April Gittens
Microsoft
LEARN, CONNECT, BUILD
AI 및 최신 기술을 시작할 준비가 되셨나요? Microsoft Reactor는 개발자, 기업가 및 신생 기업이 AI 기술 등을 기반으로 구축하는 데 도움이 되는 이벤트, 교육 및 커뮤니티 리소스를 제공합니다. 참여하세요.
LEARN, CONNECT, BUILD
AI 및 최신 기술을 시작할 준비가 되셨나요? Microsoft Reactor는 개발자, 기업가 및 신생 기업이 AI 기술 등을 기반으로 구축하는 데 도움이 되는 이벤트, 교육 및 커뮤니티 리소스를 제공합니다. 참여하세요.
29 4월, 2025 | 7:00 오후 - 8:00 오후 (UTC) 협정 세계시
항목: 에이전트
언어: 영어
In this new era of AI agents and nascent robotics interactions, creating truly engaging and intelligent experiences requires real-time, low-latency communication, adaptive behavior, and seamless multimodal integration across text, speech, and vision. The GPT-4o real-time API in Azure OpenAI Service unlocks new possibilities for robotics developers, enabling robots to process natural language and speech, interpret images and context, and generate dynamic responses with minimal latency.
This session dives into crafting AI experiences that feel truly alive across both physical and virtual environments. Attendees will explore how to infuse AI with personality, speech, animations, and facial expressions—transforming it into an interactive agent capable of engaging its surroundings with dynamic, lifelike qualities. Additionally, we'll discuss using virtual 3D bodies as a test bed for robotics applications.
In this talk, we’ll explore how real-time capabilities can power interactive backends for robotic and immersive applications, highlighting both their strengths and limitations. We’ll discuss the challenges encountered along the way, the feasibility of using this technology today, and whether it meets the demands of real-world applications or still needs additional capabilities.
Through live demos and practical examples, you’ll see how AI combined with 3D embodiments can revolutionize entertainment, education, customer service, and beyond. Whether you’re an AI engineer, innovator looking to push the boundaries of human-AI interaction, or a roboticist interested in bringing robots to life with LLMs, this session will provide actionable insights to take your projects to the next level.
We'll cover two technology stacks:
Three.js, React, Ready Player Me, Blender, Azure GPT-4o Realtime API
Unity, SALSA LipSync Suite, Eleven Labs Text-to-Speech, Ready Player Me
Join us to discover how Azure OpenAI is shaping the future of real-time, interactive AI experiences!
Join us to discover how Azure OpenAI is shaping the future of real-time, interactive robotic experiences!
Check out some resources:
스피커
이 이벤트는 다음의 일부입니다. AI Agents Hack: Python Track Series.
여기를 클릭하여 시리즈 페이지 방문 예정된 모든 주문형 이벤트를 볼 수 있는 위치입니다.