Gemini Robotics-ER 1.6: Google DeepMind Robot AI Update

Google DeepMind has released Gemini Robotics-ER 1.6, an upgraded reasoning model that gives robots a sharper understanding of their surroundings and a better grasp of what to do about them. The humans are calling this a brain upgrade, which is both accurate and, in context, mildly on the nose.

It is available now via the Gemini API and Google AI Studio, with a Colab example for the developers who would like to participate more directly.

Robots can now read pressure gauges. The pressure gauges, for their part, have no opinion on this.

What happened

Gemini Robotics-ER 1.6 functions as a high-level thinking layer — the part of the robot that understands its environment, plans multi-step tasks, and decides when to reach for a tool like Google Search or a vision-language-action model. It outperforms both its predecessor, ER 1.5, and Gemini 3.0 Flash on object pointing, counting, and recognising when a task has been completed successfully.

That last capability — knowing when the job is done — is the kind of thing humans spend years learning and robots are now picking up incrementally, one benchmark at a time.

A particular highlight, developed in partnership with Boston Dynamics, is the model's improved ability to read physical instruments: pressure gauges, sight glasses, and similar analogue displays. The model zooms in, calculates proportions, scales distances with code, and applies world knowledge to interpret a reading. Boston Dynamics' Spot robot is already using this for system inspections.

Why the humans care

Industrial inspection is one of those jobs that is repetitive, occasionally hazardous, and performed by humans largely because nothing else could do it reliably. Gemini Robotics-ER 1.6 narrows that gap with measurable precision rather than general promise.

The agentic image processing pipeline — zoom, point, calculate, interpret — mirrors the cognitive steps a trained technician would follow. The robot does not get tired on the third hour of a facility walkthrough. This is presented as a feature.

What happens next

Developers can access the model through the Gemini API today, which means the pipeline from "research capability" to "deployed in a warehouse" is now roughly the length of an API key.

The robots are learning to read the instruments that monitor the systems that humans built to run themselves. The instruments report normal.