首页 - AI - Main Content

Google launches AI agent project Mariner that can understand and reason about information on the browser screen to help complete task processing

Last night, Google announced the launch of Google Gemini 2.0, the latest model from Google's AI team, featuring multimodal support capable of understanding content from images to videos.

Utilizing this model, Google has developed an AI agent project named Project Mariner, an early research prototype built on Gemini 2.0. The project aims to explore the future of human-computer interaction starting from the browser.

This AI agent can comprehend and reason with information on the browser screen, incorporating every pixel, text, codes, images, and web elements like forms, then employs this data to complete tasks via an experimental Chrome extension.

Take, for instance, a webpage loaded with extensive data that needs to be copied and organized into a spreadsheet. Here, the AI agent comes into play.

Once instructions are provided to Project Mariner, the AI agent automatically interacts with the browser, organizing and inputting data into designated web page areas as per your requirements.

As an early prototype, Project Mariner currently performs typing, scrolling, and clicking actions within the active browser tab (the page you have open) and prompts user confirmation for sensitive actions, such as shopping and payments.

Evaluated against the WebVoyager benchmark, Project Mariner has achieved an optimal performance outcome of 83.5% as a standalone agent, a benchmark specifically designed to assess AI agent performance on real-world web tasks end-to-end.

Google has initially made Project Mariner available to a select group of trusted developers for testing, with plans to gradually expand the test population, allowing more developers and the general public to engage with Project Mariner for human-computer interaction.

AI(251)AI agent project(1)Gemini(13)Mariner(1)Project Mariner(1)

Copyright Notice:
Thank you for reading. This article was written by Landian News, and the author is Brook.X. If you wish to repost this article, please include a link to the original: https://landian.news/article/4509.html

{{userData.name}}

Google launches AI agent project Mariner that can understand and reason about information on the browser screen to help complete task processing

Google Reveals Details of Its TPU4 Supercomputer, Claiming Superior Speed and Efficiency Over NVIDIA's Systems

OpenAI's ChatGPT Faces Scrutiny from European Regulators as Italy Implements Ban

Rapid Application Scenarios of AI: Microsoft Claims Real-Time Translation on Windows 11 Will Be Handled by Local NPU

AI Image Generator Midjourney Ends Free Trials Amid Concerns of Deepfake Misuse

Microsoft Once Again Delays the Launch of Windows Recall, Citing the Need for Further Refinement

OpenAI's Sam Altman Debunks GPT-5 Rumors Amid Calls for a Moratorium on Advanced AI Systems

OpenAI Launches Exclusive Program to Offer Free API Credits to Startups

Microsoft has launched the Bing Image Creator feature based on OpenAI's DALL-E

Tech Titans Under Fire: DOJ and FTC Launch Joint Investigation into AI Monopoly

Apple Unveils 20 Core ML Models and 4 Datasets on HuggingFace: An Open Source Gift to Developers

[Download] Free Virtual Machine Software VMware Workstation Pro v17.6.2 Official Release - No Activation Required

MAS v2.9 Update: Your All-In-One Solution to Windows/Office Activation Issues

Microsoft Adds Red Hat's RHEL to the Official WSL Support, Allowing Corporate Customization

[Updated] The Importance of Data Backup: European Cloud Company Hetzner Deletes All Servers and Data of a Client Without Warning

Linux Kernel 6.13 to Support Display of Stuck Task Counts, Aiding Administrators in Fault Diagnosis

Ubuntu 20.04 LTS Support Nearing End: Upgrade or Subscribe to ESM for Updates

Elementary OS 8 Released: Aiming to Replace Windows and macOS

OpenAI Expands ChatGPT Collaboration: New IDE and Terminal Tool Integrations

Linux Kernel 6.13 First Release Candidate (RC1) Launched with Multiple New Features and Improvements