April Gittens
Microsoft
學習、聯繫、建置
準備好開始使用 AI 和最新技術嗎? Microsoft Reactor 提供活動、訓練和社群資源,協助開發人員、企業家和初創公司建置 AI 技術等等。 加入我們!
學習、聯繫、建置
準備好開始使用 AI 和最新技術嗎? Microsoft Reactor 提供活動、訓練和社群資源,協助開發人員、企業家和初創公司建置 AI 技術等等。 加入我們!
29 4月, 2025 | 7:00 下午 - 8:00 下午 (UTC) 國際標準時間
主題: Agents
語言: 英文
In this new era of AI agents and nascent robotics interactions, creating truly engaging and intelligent experiences requires real-time, low-latency communication, adaptive behavior, and seamless multimodal integration across text, speech, and vision. The GPT-4o real-time API in Azure OpenAI Service unlocks new possibilities for robotics developers, enabling robots to process natural language and speech, interpret images and context, and generate dynamic responses with minimal latency.
This session dives into crafting AI experiences that feel truly alive across both physical and virtual environments. Attendees will explore how to infuse AI with personality, speech, animations, and facial expressions—transforming it into an interactive agent capable of engaging its surroundings with dynamic, lifelike qualities. Additionally, we'll discuss using virtual 3D bodies as a test bed for robotics applications.
In this talk, we’ll explore how real-time capabilities can power interactive backends for robotic and immersive applications, highlighting both their strengths and limitations. We’ll discuss the challenges encountered along the way, the feasibility of using this technology today, and whether it meets the demands of real-world applications or still needs additional capabilities.
Through live demos and practical examples, you’ll see how AI combined with 3D embodiments can revolutionize entertainment, education, customer service, and beyond. Whether you’re an AI engineer, innovator looking to push the boundaries of human-AI interaction, or a roboticist interested in bringing robots to life with LLMs, this session will provide actionable insights to take your projects to the next level.
We'll cover two technology stacks:
Three.js, React, Ready Player Me, Blender, Azure GPT-4o Realtime API
Unity, SALSA LipSync Suite, Eleven Labs Text-to-Speech, Ready Player Me
Join us to discover how Azure OpenAI is shaping the future of real-time, interactive AI experiences!
Join us to discover how Azure OpenAI is shaping the future of real-time, interactive robotic experiences!
Check out some resources:
演講者
此活動屬於 AI Agents Hack: Python Track Series.
按一下這裡以 造訪系列頁面 您可以在其中查看所有即將推出和隨選活動。