Ishaan Sehgal
Microsoft
DISCOVER, CONNECT, GROW
Ready to get started with AI? Microsoft Reactor provides events, training, and community resources to help startups, entrepreneurs and developers build their next business on AI technology. Join us!
DISCOVER, CONNECT, GROW
Ready to get started with AI? Microsoft Reactor provides events, training, and community resources to help startups, entrepreneurs and developers build their next business on AI technology. Join us!
28 February, 2024 | 5:00 PM - 6:30 PM (UTC) Coordinated Universal Time
Topic: Microservices & APIs
Language: English
Join us to learn how to run open-source Large Language Models (LLMs) with HTTP-based inference endpoints inside your AKS cluster using the Kubernetes AI Toolchain Operator (KAITO). We’ll walk through the setup and deployment of containerized LLMs on GPU node pools and see how KAITO can help reduce operational burden of provisioning GPU nodes and tuning model deployment parameters to fit GPU profiles.
Speakers
Related Events
The events below may be of interest to you as well. Be sure to visit our Reactor homepage to see all available events.
Format: Livestream
Topic: Microservices & APIs
Language: English
This event is part of the Learn Live: Intelligent Apps on AKS Series.
Click here to visit the Series Page where you could see all the upcoming and on-demand events.
For questions please contact us at reactor@microsoft.com