Skip to main content

DISCOVER, CONNECT, GROW

Microsoft Reactor

Join Microsoft Reactor and engage with startups and developers live

Ready to get started with AI?  Microsoft Reactor provides events, training, and community resources to help startups, entrepreneurs and developers build their next business on AI technology. Join us!

DISCOVER, CONNECT, GROW

Microsoft Reactor

Join Microsoft Reactor and engage with startups and developers live

Ready to get started with AI?  Microsoft Reactor provides events, training, and community resources to help startups, entrepreneurs and developers build their next business on AI technology. Join us!

Go back

Exploring Kaito to streamline AI inference model deployment in Azure Kubernetes

16 July, 2024 | 9:00 PM - 10:00 PM (UTC) Coordinated Universal Time

  • Format:
  • alt##LivestreamLivestream

Topic: Infrastructure for AI

Language: English

About this session:
Roy Kim will be presenting Kaito, an operator streamlining AI/ML inference model deployment in Kubernetes. Discover how Kaito simplifies deployment of large open-source inference models like Falcon and LLAMA2. Learn its unique features: managing large model files with container images, preset GPU configurations, auto-provisioning GPU nodes, and hosting on Microsoft Container Registry (MCR). See how Kaito simplifies the workflow of onboarding large AI inference models in Kubernetes.

Learn more and develop your skills in Azure Kubernetes Service with this Microsof Learn training module:
https://aka.ms/IntroToAKSLearn1

Speakers

Already registered and need to cancel? Cancel registration

Registration

Sign in with your Microsoft Account

Sign in

Or enter your email address to register

*

By registering for this event you agree to abide by the Microsoft Reactor Code of Conduct.

For questions please contact us at reactor@microsoft.com