
Welcome to the AI meetup in San Francisco, in collaboration with Arm. Join us for deep dive tech talks on AI, GenAI, LLMs and Agent, hands-on experiences on code labs, workshops, and networking with speakers and fellow developers.
Agenda:
- 5:30pm~6:00pm: Checkin, food and networking
- 6:00pm~6:10pm: Welcome, Community update
- 6:10pm~8:00pm: Tech talks and Q&A
- 8:00pm~8:30pm: Open discussion, Mixer and Closing.
Tech Talk: Device to Datacenter: How Mobile and Cloud AI Coexist on Arm
Speakers: Avin Zarlez (Arm) | Pranay Bakre (Arm)
Abstract: As AI becomes increasingly ubiquitous, developers are faced with key architectural decisions: should inference run on-device, in the cloud, or as a hybrid? This talk explores how Arm compute platforms—from mobile devices to hyperscale datacenters—enable seamless and efficient AI execution across this entire spectrum.
We discuss real-world performance and efficiency considerations and break down where different types of AI workloads naturally align. Whether it’s low-latency inferencing on Arm-powered smartphones or scaling large models in Arm-based cloud environments, the goal is not to choose one over the other but to design systems that intelligently leverage both.
Join us to learn how the unified Arm architecture empowers developers to build portable, performant AI applications that scale from the palm of your hand to the datacenter rack—without compromise.
Tech Talk: Optimizing AI Inference With KleidiAI-powered quantization
Speakers: Kieran Hejmadi (Arm)
Abstract: Efficient large language model (LLM) inference is critical for deploying AI applications across both cloud and edge environments. Quantization of models plays a key role in this effort, enabling reduced memory footprint and faster computation by representing weights and activations in lower precision formats. In this workshop, we explore tensor-wise, channel-wise, and group-wise quantization techniques to accelerate LLM inference on AWS Graviton using PyTorch and Arm KleidiAI, evaluating the trade-offs and benefits of each approach.
Speakers/Topics:
Stay tuned as we are updating speakers and schedules. If you have a keen interest in speaking to our community, we invite you to submit topics for consideration: Submit Topics
Venue:
AWS Loft | San Francisco, 525 Market Street, 2nd Floor, San Francisco, CA 94105
Sponsors:
-
We are actively seeking sponsors to support AI developers community. Whether it is by offering venue spaces, providing food, or cash sponsorship. Sponsors will not only speak at the meetups, receive prominent recognition, but also gain exposure to our extensive membership base of 50,000+ AI developers in San Francisco Bay Area and 500K+ worldwide.
Local and Global AI Community on Discord
Join us on discord for local and global AI tech community:
- Events chat: chat and connect with speakers and global and local attendees;
- Learning AI: events, learning materials, study groups;
- Startups: innovation, projects collaborations, founders/co-founders;
- Jobs and Careers: job openings, post resumes, hiring managers;