S-a lansat Gemini 3

admin7November 20, 202501 views

Gemini 3 Overview

Gemini 3 encompasses multiple versions: Gemini 3, Gemini 3 Pro Preview, and Gemini 3 Deep Think, showcasing a comprehensive integration across Google’s products and services.
It significantly outperforms previous models, with benchmarks indicating a notable leap in capabilities, especially in reasoning and knowledge tasks.
The model supports various inputs including text, images, audio, video, and code, with an impressive token capacity of up to 1 million input tokens.

Benchmark Performance

Gemini 3 achieved exceptional scores on multiple benchmarks, including a 37.5% score on Humanity’s Last Exam, compared to older models like Gemini 2.5 Pro at 21%.
On the Vending Bench, Gemini 3 outperformed competitors, with a net worth of $5,478.16, demonstrating its effectiveness in long-term economic planning.
It also excelled in complex multi-step reasoning tasks, showcasing a 22-point performance increase over Gemini 2.5 Pro in enterprise use cases.

Deep Think Capabilities

The Gemini 3 Deep Think variant shows enhanced performance in reasoning benchmarks, scoring 41% on Humanity’s Last Exam, indicating better cognitive processing.
It achieved a leading position in scientific knowledge benchmarks (GPQA) with a score of 93.8%, outperforming other models significantly.
The ability to analyze visual reasoning puzzles was also highlighted, with Gemini 3 Deep Think scoring 45.1%, demonstrating advanced learning and generalization capabilities.

Integration with Google Services

Gemini 3 is integrated into Google Search, offering dynamic user interfaces based on user queries, enhancing the search experience.
The model can extract insights from documents and perform multi-step logic tasks, making it suitable for enterprise applications, particularly in sectors like healthcare and finance.
Its capability to analyze video content frame by frame allows for comprehensive understanding and interaction with multimedia data, enhancing its utility in content creation.

Automation and Task Management

The newly introduced Gemini Agent feature allows the model to complete tasks on behalf of users, such as organizing emails and generating contextual responses.
This automation capability is designed to streamline workflows, providing users with dynamic views and actionable suggestions based on their needs.
Gemini 3’s ability to perform real-world tasks positions it as a powerful tool for enhancing productivity in various professional environments.

Gemini 3 Launch: Unmatched Benchmarks and Performance in AI Model Comparisons

Gemini 3 outperforms previous models in extensive benchmarks.

Gemini 3 achieved significant improvements in human exam benchmarks, reaching 37.5% with no tools and 45.8% with code execution.
In the vending benchmark, Gemini 3 maximized profits at $5,478.16, outperforming Cloud Sonnet 4.5.

Gemini 3 Pro significantly outperforms Gemini 2.5 Pro in multi-step reasoning tasks.

Gemini 3 Pro shows a 22 point performance increase, scoring 85% compared to 63% of Gemini 2.5 Pro.
The benchmark evaluation by Box focused on complex task automation and multi-document analysis for enterprise workflows.

Gemini 3 Deep Think outperforms rivals in key reasoning benchmarks.

Gemini 3 Deep Think scores 41% on humanity’s last exam, surpassing Gemini 3 Pro’s 37.5%.
In visual reasoning tests, Gemini 3 Deep Think achieves 45.1%, significantly leading over GPT models and Claude.

Gemini 3 shows significant advancements in multi-modal understanding, especially video processing.

It demonstrates a 10x improvement over Gemini 2.5 Pro, enhancing the integration of various input types.
Gemini 3 uniquely analyzes video frame by frame, allowing for in-depth understanding and interaction with visual content.

Gemini 3 introduces dynamic user interfaces in Google search.

The presenter discusses the new AI mode in Google search launched with Gemini 3, highlighting its impressive capabilities.
Dynamic interfaces are generated based on user queries, revolutionizing how search results are displayed and interacted with.

Gemini 3 excels in custom search and long-term planning capabilities.

The launch of the anti-gravity coding platform, a VS Code fork, facilitates development with support for multiple AI models.
Gemini 3’s performance on the Vending Bench 2 demonstrates its proficiency in managing real-world economic tasks over long time periods.

Gemini 3 shows strong growth and innovative task management features.

Gemini 3 outperforms competitors, maintaining a rising net worth over one year, unlike declining models.
The Gemini Agent capability allows automated task completion, enhancing user productivity through dynamic UIs and email management.

Gemini 3 introduces advanced email handling and new model capabilities.

Users can manage emails by accepting, rejecting, and generating contextual responses automatically.
Gemini 3 is a new foundation model with extensive input capabilities and Google’s custom TPU architecture.