Roy Kim
Microsoft
DISCOVER, CONNECT, GROW
Ready to get started with AI? Microsoft Reactor provides events, training, and community resources to help startups, entrepreneurs and developers build their next business on AI technology. Join us!
DISCOVER, CONNECT, GROW
Ready to get started with AI? Microsoft Reactor provides events, training, and community resources to help startups, entrepreneurs and developers build their next business on AI technology. Join us!
16 July, 2024 | 9:00 PM - 10:00 PM (UTC) Coordinated Universal Time
Topic: Infrastructure for AI
Language: English
About this session:
Roy Kim will be presenting Kaito, an operator streamlining AI/ML inference model deployment in Kubernetes. Discover how Kaito simplifies deployment of large open-source inference models like Falcon and LLAMA2. Learn its unique features: managing large model files with container images, preset GPU configurations, auto-provisioning GPU nodes, and hosting on Microsoft Container Registry (MCR). See how Kaito simplifies the workflow of onboarding large AI inference models in Kubernetes.
Learn more and develop your skills in Azure Kubernetes Service with this Microsoft Learn training module:
https://aka.ms/IntroToAKSLearn1
Speakers
Related Events
The events below may be of interest to you as well. Be sure to visit our Reactor homepage to see all available events.
Format:
Livestream
Topic: Infrastructure
Language: English
Format:
Livestream
Topic: Infrastructure
Language: English
Format:
Livestream
Topic: Infrastructure
Language: English
For questions please contact us at reactor@microsoft.com