April Speight
Microsoft
发现、联系、增长
是否准备好开始使用 AI? Microsoft Reactor 提供活动、培训和社区资源,以帮助初创公司、企业家和开发人员利用 AI 技术打造新业务。 快加入我们吧!
发现、联系、增长
是否准备好开始使用 AI? Microsoft Reactor 提供活动、培训和社区资源,以帮助初创公司、企业家和开发人员利用 AI 技术打造新业务。 快加入我们吧!
29 四月, 2025 | 7:00 下午 - 8:00 下午 (UTC) 协调世界时
主题: 核心 AI
语言: 英语
In this new era of AI agents and nascent robotics interactions, creating truly engaging and intelligent experiences requires real-time, low-latency communication, adaptive behavior, and seamless multimodal integration across text, speech, and vision. The GPT-4o real-time API in Azure OpenAI Service unlocks new possibilities for robotics developers, enabling robots to process natural language and speech, interpret images and context, and generate dynamic responses with minimal latency.
This session dives into crafting AI experiences that feel truly alive across both physical and virtual environments. Attendees will explore how to infuse AI with personality, speech, animations, and facial expressions—transforming it into an interactive agent capable of engaging its surroundings with dynamic, lifelike qualities. Additionally, we'll discuss using virtual 3D bodies as a test bed for robotics applications.
In this talk, we’ll explore how real-time capabilities can power interactive backends for robotic and immersive applications, highlighting both their strengths and limitations. We’ll discuss the challenges encountered along the way, the feasibility of using this technology today, and whether it meets the demands of real-world applications or still needs additional capabilities.
Through live demos and practical examples, you’ll see how AI combined with 3D embodiments can revolutionize entertainment, education, customer service, and beyond. Whether you’re an AI engineer, innovator looking to push the boundaries of human-AI interaction, or a roboticist interested in bringing robots to life with LLMs, this session will provide actionable insights to take your projects to the next level.
We'll cover two technology stacks:
Three.js, React, Ready Player Me, Blender, Azure GPT-4o Realtime API
Unity, SALSA LipSync Suite, Eleven Labs Text-to-Speech, Ready Player Me
Join us to discover how Azure OpenAI is shaping the future of real-time, interactive AI experiences!
Join us to discover how Azure OpenAI is shaping the future of real-time, interactive robotic experiences!
Check out some resources:
主讲人
此活动属于 AI Agents Hack: Python Track Series.
单击此处 访问“系列”页 可在此处查看所有即将举办的活动和点播活动。
如有疑问,请联系我们 reactor@microsoft.com