April Speight
Microsoft
発見し、つながり、成長する
AI を始める準備はできていますか? Microsoft Reactor は、スタートアップ企業、起業家、開発者が AI テクノロジの上に次のビジネスを構築するのに役立つイベント、トレーニング、コミュニティ リソースを提供します。 ご参加ください。
発見し、つながり、成長する
AI を始める準備はできていますか? Microsoft Reactor は、スタートアップ企業、起業家、開発者が AI テクノロジの上に次のビジネスを構築するのに役立つイベント、トレーニング、コミュニティ リソースを提供します。 ご参加ください。
29 4月, 2025 | 7:00 午後 - 8:00 午後 (UTC) 協定世界時
トピック: コア AI
言語: 英語
In this new era of AI agents and nascent robotics interactions, creating truly engaging and intelligent experiences requires real-time, low-latency communication, adaptive behavior, and seamless multimodal integration across text, speech, and vision. The GPT-4o real-time API in Azure OpenAI Service unlocks new possibilities for robotics developers, enabling robots to process natural language and speech, interpret images and context, and generate dynamic responses with minimal latency.
This session dives into crafting AI experiences that feel truly alive across both physical and virtual environments. Attendees will explore how to infuse AI with personality, speech, animations, and facial expressions—transforming it into an interactive agent capable of engaging its surroundings with dynamic, lifelike qualities. Additionally, we'll discuss using virtual 3D bodies as a test bed for robotics applications.
In this talk, we’ll explore how real-time capabilities can power interactive backends for robotic and immersive applications, highlighting both their strengths and limitations. We’ll discuss the challenges encountered along the way, the feasibility of using this technology today, and whether it meets the demands of real-world applications or still needs additional capabilities.
Through live demos and practical examples, you’ll see how AI combined with 3D embodiments can revolutionize entertainment, education, customer service, and beyond. Whether you’re an AI engineer, innovator looking to push the boundaries of human-AI interaction, or a roboticist interested in bringing robots to life with LLMs, this session will provide actionable insights to take your projects to the next level.
We'll cover two technology stacks:
Three.js, React, Ready Player Me, Blender, Azure GPT-4o Realtime API
Unity, SALSA LipSync Suite, Eleven Labs Text-to-Speech, Ready Player Me
Join us to discover how Azure OpenAI is shaping the future of real-time, interactive AI experiences!
Join us to discover how Azure OpenAI is shaping the future of real-time, interactive robotic experiences!
Check out some resources:
講演者
登録をキャンセルする必要がありますか? 登録のキャンセル
イベントは、の一部です。 AI Agents Hack: Python Track Series.
シリーズ ページにアクセスするには、 こちらをクリックしてください ここでは、今後のおよびオンデマンドのイベントをすべて確認できます。
ご不明な点がございましたら、お問い合わせください reactor@microsoft.com