Artificial intelligence ‘godmother’ calls for spatial intelligence

Dr. Fei-Fei Li argues the next leap in Artificial Intelligence will be spatially intelligent systems that grasp real-world physics. She says world models that build realistic 3D, physics-consistent representations are crucial to move from language to perception and action.

Dr. Fei-Fei Li published a new essay arguing that the next major advance in Artificial Intelligence will come from spatial intelligence: systems that can understand, reason about, and generate 3D, physics-consistent worlds. Li says large language models have mastered abstract knowledge but still lack the ability to perceive and act in space, including tasks like estimating distance and motion. She frames spatial understanding as the cognitive core of human intelligence and a necessary step to take Artificial Intelligence from language to real-world perception and action.

At the center of Li’s vision are world models that can create realistic 3D environments, interpret inputs such as images and actions, and predict how those environments evolve over time. She argues these capabilities will be essential for robotics and for applications across science, healthcare, and design. World models that understand object interactions and physics could one day help predict molecular reactions, model climate systems, or test materials. Li notes the technical challenge of teaching models real-world physics, but highlights momentum with her World Labs and efforts from companies including Google and Tencent to build spatially intelligent systems.

The newsletter places Li’s essay alongside other developments in Artificial Intelligence. Anthropic projects a major cost advantage over OpenAI by relying on a mix of chips from Amazon, Nvidia, and Google and expects to be cash flow positive by 2027. Microsoft Copilot Desktop’s Voice and Vision features can scan Google Sheets or Excel files, let users ask analysis questions by voice, and generate reports that highlight cells and explain calculations. Separately, GPT-5 became the first model to solve a full 9×9 Sudoku puzzle on Sakana AI’s Sudoku-Bench and achieved a 33 percent solve rate across puzzles, underlining progress in structured reasoning even as many puzzles remain unsolved.

58

Impact Score

ChatGPT Images adds thinking capability

OpenAI has upgraded ChatGPT Images with a new thinking mode that can search the internet, generate multiple images, and verify outputs before finalizing results. The update also improves text rendering, dense compositions, multilingual support, and style flexibility.

YouTube expands deepfake detection to Hollywood talent

YouTube is opening its likeness protection system to actors, athletes, musicians and creators beyond its own platform. The move gives public figures a way to flag and request removal of damaging Artificial Intelligence-generated replicas while YouTube weighs broader rules and possible future monetization.

Adobe plans outcome-based pricing for Artificial Intelligence agents

Adobe is positioning its Artificial Intelligence agents around performance-based pricing, charging only when the software completes useful work. The approach points to a more results-oriented model for selling generative Artificial Intelligence tools to business customers.

Tech firms commit billions to Artificial Intelligence infrastructure

Amazon, OpenAI, Nvidia, Meta, Google and others are signing increasingly large cloud, chip and data center agreements as demand for Artificial Intelligence infrastructure accelerates. The latest wave of deals spans investments, compute purchases, chip supply agreements and data center buildouts.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.