Google unveils Gemini 2.0 Flash ‘Thinking’ model for AI reasoning

Google has released an experimental model, Gemini 2.0 Flash Thinking, designed to showcase the model’s reasoning process while solving complex problems.

Available on Google AI Studio, this model focuses on multimodal understanding, reasoning, and coding. It is intended to tackle challenging problems in fields such as programming, math, and physics.

Gemini 2.0 Flash Thinking

The Gemini 2.0 Flash Thinking model is designed to improve reasoning over difficult challenges by generating the model’s thinking process during problem-solving.

This approach allows it to outperform the standard Gemini 2.0 Flash model. According to Logan Kilpatrick, head of product for AI Studio, this new model is “the first step in Google’s reasoning journey.”

Video Player

Media error: Format(s) not supported or source(s) not found

Download File: https://images.fonearena.com/blog/wp-content/uploads/2024/12/Gemini-2.0-Flash-Thinking-Model.mp4?_=1

00:00

Use Up/Down Arrow keys to increase or decrease volume.

Jeff Dean, Google DeepMind’s chief scientist, emphasized that the model “uses thoughts to strengthen its reasoning,” resulting in improved performance through increased inference time computation—the amount of computing used to process and analyze a question.

Enhanced Speed and Performance

As part of the Gemini 2.0 Flash series, the Thinking model leverages the speed and performance of Flash 2.0, offering faster computation. Google’s team shared several demos, particularly in physics and probability, to showcase how the model approaches questions.

For instance, when solving a physics problem, the model demonstrates its reasoning by breaking down the question into manageable steps before providing a final answer.

Video Player

Media error: Format(s) not supported or source(s) not found

Download File: https://images.fonearena.com/blog/wp-content/uploads/2024/12/Gemini-2.0-Flash-Thinking-reasoning-model.mp4?_=2

00:00

Use Up/Down Arrow keys to increase or decrease volume.

How to Use Thinking Mode

Thinking Mode is available through two primary channels: the Gemini API and Google AI Studio.

Using the Gemini API:
To use the model via the Gemini API, developers can specify the model code gemini-2.0-flash-thinking-exp when making a request.
Using Google AI Studio:
To access the model in Google AI Studio, simply select the Gemini 2.0 Flash Thinking Experimental model from the drop-down menu in the Settings pane.

Understanding the Model’s Thinking Process

The thought process generated by the model appears in different ways depending on the platform used. When using the Gemini API, the model’s thoughts appear as the first element of the content generated.

In Google AI Studio, the thinking process is displayed in a separate “Thoughts” panel, which can be expanded to view the model’s reasoning steps.

Limitations of Thinking Mode

As an experimental model, Gemini 2.0 Flash Thinking has the following limitations:

32k token input limit
Text and image input only
8k token output limit
Text-only output
No built-in tools, such as Search or code execution

Availability

The Thinking Mode model is now available in Google AI Studio and can be accessed through the Gemini API and Vertex AI. Developers can begin using it today by trying out the model with Google’s Colab notebook or by integrating it into their projects.

Rate This