Humanoids Learn to See. Models Cut Costs in Half. Agents Ship Code.

Today's Overview

Robotics just got a vision upgrade. Humanoid robots need to understand their environment faster than ever-not just see it. A new focus on low-latency sensor fusion means robots can now process camera data, audio localization, and safety checks in milliseconds, catching a falling object or detecting a person in their path before collision. This matters because humanoids aren't staying in labs anymore. They're moving into warehouses, hospitals, and manufacturing floors where they work alongside humans. The engineering required is substantial: synchronized vision across multiple cameras, deterministic audio processing for sound source localization, and battery management that catches thermal failures before they become dangerous. The result is a shift away from cloud processing toward edge-based AI that keeps robots responsive and safe-no internet required.

The Pricing Reset

The cost war for AI models just entered a new phase. Qwen 3.7 Max cut prices by 50%-prompt costs dropped from $2.50 to $1.25 per million tokens. Xiaomi's MiMO models went even further, with completion tokens down 86% in some cases. This isn't noise. At scale, a 50% price cut changes the unit economics of every application built on these models. Startups that were choosing between inference providers based on latency can now factor price back into the equation. The message is clear: as models commoditize, vendors are competing on cost, not just capability. For builders, this means the cost-per-task keeps dropping, making AI integration viable for smaller use cases and tighter margins.

Agents Ship. Code Deploys. Humans Watch.

The real shift this week isn't in model performance-it's in who does the work. Google's Antigravity 2.0 demo showed an agent that takes a single prompt, spawns subagents to write code, run tests, and deploy features to physical arcade hardware, all without human intervention between the prompt and the result. That's a different kind of tool than an AI that helps you code faster. This is AI that removes entire steps from your workflow. The catch? Agents work best when the task is well-defined and the feedback loop is tight. Customer questionnaires that took a week now take 30 minutes when an agent extracts relevant data, synthesizes it, and formats the response. But handing off thinking entirely carries a risk: elite consultants using AI outperformed peers on most tasks, then got confidently wrong answers when the model failed-and didn't catch it. The skill that matters now is knowing when to override the agent.

Ben Thompson's analysis of SpaceX's IPO points to something stranger than a $2 trillion valuation on current financials. He's arguing that space data centers are actually plausible-not as hype, but as engineering. A Starlink satellite is already roughly the size of a GPU rack. With better power dissipation and radiation hardening, you could put frontier model inference in orbit, serving low-latency requests globally with no terrestrial data center constraints. The stick? Zoning. Building data centers on Earth requires community permission. Running out of physical space to build is a real constraint in a decade of exponential compute demand. Space might not be the dream. It might be the plan.

For builders and business owners watching these moves, the pattern is clear: costs are falling, agents are handling repetitive work, and the bottleneck is shifting from compute power to everything else-talent, product-market fit, and the judgment to know which problems your humans should still solve. The tools are getting better and cheaper. What you build with them is up to you.