AI meetup (SF) - AI on Mobile


Aug 21, 05:30PM PST(12:30AM GMT). Add to Calendar: Google Yahoo
  • Free 52 Attendees
Description
Speaker

Welcome to the AI meetup in San Francisco, in collaboration with Arm. Join us for deep dive tech talks on AI, GenAI, LLMs and Agent, hands-on experiences on code labs, workshops, and networking with speakers and fellow developers.

Agenda:
- 5:30pm~6:00pm: Checkin, food and networking
- 6:00pm~6:10pm: Welcome, Community update
- 6:10pm~8:00pm: Tech talks and Q&A
- 8:00pm~8:30pm: Open discussion, Mixer and Closing.

Tech Talk: Device to Datacenter: How Mobile and Cloud AI Coexist on Arm
Speakers: Avin Zarlez (Arm) | Pranay Bakre (Arm)
Abstract: As AI becomes increasingly ubiquitous, developers are faced with key architectural decisions: should inference run on-device, in the cloud, or as a hybrid? This talk explores how Arm compute platforms—from mobile devices to hyperscale datacenters—enable seamless and efficient AI execution across this entire spectrum.
We discuss real-world performance and efficiency considerations and break down where different types of AI workloads naturally align. Whether it’s low-latency inferencing on Arm-powered smartphones or scaling large models in Arm-based cloud environments, the goal is not to choose one over the other but to design systems that intelligently leverage both.
Join us to learn how the unified Arm architecture empowers developers to build portable, performant AI applications that scale from the palm of your hand to the datacenter rack—without compromise.

Tech Talk: Optimizing AI Inference With KleidiAI-powered quantization
Speakers: Kieran Hejmadi (Arm)
Abstract: Efficient large language model (LLM) inference is critical for deploying AI applications across both cloud and edge environments. Quantization of models plays a key role in this effort, enabling reduced memory footprint and faster computation by representing weights and activations in lower precision formats. In this workshop, we explore tensor-wise, channel-wise, and group-wise quantization techniques to accelerate LLM inference on AWS Graviton using PyTorch and Arm KleidiAI, evaluating the trade-offs and benefits of each approach.

Speakers/Topics:
Stay tuned as we are updating speakers and schedules. If you have a keen interest in speaking to our community, we invite you to submit topics for consideration: Submit Topics

Venue:
AWS Loft | San Francisco, 525 Market Street, 2nd Floor, San Francisco, CA 94105

Sponsors:
- Arm: Arm is the industry’s highest-performing and most power-efficient compute platform with unmatched scale that touches 100 percent of the connected global population. Learn More

We are actively seeking sponsors to support AI developers community.  Whether it is by offering venue spaces, providing food, or cash sponsorship. Sponsors will not only speak at the meetups, receive prominent recognition, but also gain exposure to our extensive membership base of 50,000+ AI developers in San Francisco Bay Area and 500K+ worldwide.

Local and Global AI Community on Discord
Join us on discord for local and global AI tech community:
- Events chat: chat and connect with speakers and global and local attendees;
- Learning AI: events, learning materials, study groups;
- Startups: innovation, projects collaborations, founders/co-founders;
- Jobs and Careers: job openings, post resumes, hiring managers;

Avin Zarlez (Arm), Pranay Bakre (Arm)

*By RSVP, you submit information to event hosts, who will email you regarding events and services. You can opt-out from the emails. FAQ