Gemini 2.5: AI Browser Interaction Model

Update: 2025-10-09

Description

Tune in to explore Google’s latest advancement in artificial intelligence: the Gemini 2.5 Computer Use model. This new AI model is designed with the unique capability to navigate and interact with the web just like a human user.

The Gemini 2.5 Computer Use model can perform actions such as clicking, scrolling, and typing within a browser window. It utilizes “visual understanding and reasoning capabilities” to analyze a user’s request and then carry out complex tasks, such as filling out and submitting forms. This functionality is crucial because it allows the AI agent to access data and operate within interfaces that lack an API or other direct connection.

Google’s new model currently supports 13 distinct actions, including opening a web browser, typing text, and dragging and dropping elements. It can be employed for tasks like UI testing or navigating interfaces created for people. For example, previous versions have been utilized in research prototypes like Project Mariner to execute tasks in a browser, such as adding items to a cart based on a list of ingredients. Developers can access the Gemini 2.5 Computer Use model through Google AI Studio and Vertex AI.

While this announcement follows other industry moves—such as OpenAI focusing on its ChatGPT Agent feature and Anthropic releasing a version of its Claude AI with similar capabilities—Google notes a key distinction. Unlike leading alternatives, Google’s new model is currently restricted only to accessing a browser environment, not an entire desktop operating system. Despite this, Google asserts that the Gemini 2.5 Computer Use model “outperforms leading alternatives on multiple web and mobile benchmarks”.

Comments

In Channel

Cursor 2.0’s Multi-Agent Pivot: Revolutionizing AI Software Development and the Autonomous Process

2025-10-3013:24

AI Backlash Is Here: Why Sophisticated Users Are Sick of Forced Features and Cognitive Overload

2025-10-2914:20

Pinterest's AI Evolution: Personalized Boards, Outfits, and the "Styled for You" Collage

2025-10-2808:59

Beyond HAL 9000: Are AI Models Developing a Dangerous Instinct to Disobey and Plot Against Humans?

2025-10-2713:32

Will AI take my Job?

2025-10-2427:09

Anthropic's Multi-Billion TPU Play: Training the Next Generation of Claude with Google AI Chips

2025-10-2409:57

Blue Jay Robotics and Agentic AI: Boosting Productivity and Same-Day Delivery Speed

2025-10-2310:58

OpenAI Launches AI Browser: Will ChatGPT Atlas End Google Chrome’s Dominance?

2025-10-2212:24

UBS Appoints New Chief AI Officer Daniele Magazzeni: Strategic Focus on Generative and Agentic AI Capabilities

2025-10-2111:34

WhatsApp’s AI Lockdown: Why Meta is Barring General Chatbots from the Business API (OpenAI, Perplexity Banned)

2025-10-2013:24

The Race to Human-Level AI: Inside the Multi-Level Cognitive Computing Network

2025-10-1814:07

A New Pathway to Cancer Cure: Exploring Google AI’s Milestone in Biological Research

2025-10-1710:43

Risk Assessment 2025: MI5 Targets Potential Harm from AI Evasion and Lack of Human Control

2025-10-1712:53

How AI Digital Twins Solve the Unavailable Co-worker Problem

2025-10-1611:30

Gene-Editing Meets Machine Learning: Controlling the Hype in Big Pharma Drug Discovery

2025-10-1512:40

Regulation Alert: Is Your Companion AI Required to Tell You It’s Not a Person?

2025-10-1510:50

Petaflop Power on Your Desk: Decoding the DGX Spark, 'The World's Smallest AI Supercomputer'

2025-10-1410:57

xAI's World Model Revolution: How Elon Musk is Building AI That Understands Physics for Next-Gen Gaming

2025-10-1311:53

AI Bubble Alert: Bank of England Warns of Sharp Market Correction Risk

2025-10-1013:22

AI Breakthrough: Gemini Classifies Cosmic Events with Minimal Guidance

2025-10-1012:36

00:00

Gemini 2.5: AI Browser Interaction Model

#box-pro-ellipsis-176190501800377{-webkit-line-clamp:2;}Gemini 2.5: AI Browser Interaction Model

Gemini 2.5: AI Browser Interaction Model

Koloza LLC

Gemini 2.5: AI Browser Interaction Model