LEARN, CONNECT, BUILD

Microsoft Reactor

Join Microsoft Reactor and engage with developers live

Ready to get started with AI and the latest technologies? Microsoft Reactor provides events, training, and community resources to help developers, entrepreneurs and startups build on AI technology and more. Join us!

LEARN, CONNECT, BUILD

Microsoft Reactor

Join Microsoft Reactor and engage with developers live

Go back

Deploying and Monitoring LLM Inference Endpoints

16 July, 2024 | 6:00 PM - 7:00 PM (UTC) Coordinated Universal Time

Format:
Livestream

Topic: Data Science & Machine Learning

Language: English

In this session we will dive into deploying LLMs to Production Inference Endpoints and then putting in place automated monitoring metrics and alerts to help track model performance and suppress potential output issues such as toxicity.

We will also cover the process of optimizing LLMs using RAG, for relevant, accurate, and useful outputs.

You will leave this session with a comprehensive understanding about deploying LLMs to production and monitoring the models for issues such as Toxicity, relevance, and accuracy.

Try this other common AI use cases using the Wallaroo.AI Azure Inference Server Freemium Offer on Azure Marketplace and also try the Free Wallaroo.AI Community Edition

Speakers

Akmel Syed
- Sr Sales Engineer
Wallaroo.AI
Martin Bald
- Sr Manager DevRel Community
Wallaroo.AI
Younes Amar
- VP of Product
Wallaroo.AI

This event is part of the Wallaroo.AI: Techniques for Faster, Easier AI Series.

Click here to visit the Series Page where you could see all the upcoming and on-demand events.