Microsoft Research AI
Vision-language models (VLMs) use images and text to plan robot actions, but they still struggle to decide what actions to take and where to take them. This research introduces a benchmark for evaluating spatially grounded long-horizon task planning in robotics.
Why it matters: This benchmark provides a framework to assess and improve the planning capabilities of AI systems in complex, real-world environments.
- Introduces a new benchmark for evaluating task planning in robotics.
- Focuses on spatially grounded, long-horizon tasks.
- Aims to improve decision-making in AI systems using vision-language models.
OpenAI Blog
OpenAI has developed Sora 2 and the Sora app to address safety challenges in video models and social creation platforms. The approach focuses on building safety into the foundation of these systems.
Why it matters: Ensuring safety in AI systems is crucial for their reliable deployment in creative and social contexts.
- Sora 2 addresses novel safety challenges in video models.
- The Sora app integrates safety measures from the ground up.
- Focuses on protecting users in creative and social AI applications.