Join us for the
This weekly series brings you leading experts from Google to share the latest insights and practical knowledge on AI, LLMs, and AI Agents.
Each session includes ▶️ a deep-dive talk, live demo, ?? hands-on code labs, and ? networking with speakers and a global tech community (developers, engineers, startup founders and tech leaders).
Tech Talk: Effortless AI Serving with GKE Inference Gateway
Speaker: Anmol Krishan Sachdeva (Google)
Abstract: Deploying, scaling, and managing diverse LLMs on Kubernetes is often complex and resource-intensive. Discover the GKE Inference Gateway. We’ll demo how this unified, standard-based solution delivers model-aware routing, optimized load balancing, and dynamic LoRA serving, making sophisticated, cost-effective inference a reality on GKE.
Venue:
virtual, join from anywhere.
Upcoming and Past Sessions
Global AI Tech Community on Discord
Join us on discord for local and global AI tech community:
- Events chat: chat and connect with speakers and global and local attendees;
- Learning AI: events, learning materials, study groups;
- Startups: innovation, projects collaborations, founders/co-founders;
- Jobs and Careers: job openings, post resumes, hiring managers