Project Jarvis: Google’s AI That Will Browse the Web for You

Share
With Google looking to debut Project Jarvis by the end of this year, it won’t be long until agentic AI is available to the masses
Google has let slip a preview of an upcoming product Project Jarvis, which will send an AI Agent to browse and interact with the web for you

With AI fervour going wild, tech giants are continuously pushing the boundaries of what's possible in order to offer new applications and gain the competitive edge. 

AI has already transformed many aspects of our digital lives, from voice assistants to content recommendation systems and healthcare industry assistance. 

Now, the focus is shifting towards creating AI agents capable of autonomously navigating the web and performing complex tasks on behalf of users.

AI agents and Project Jarvis

Unlike traditional AI models that execute predefined tasks, AI agents  are autonomous, goal-oriented agents that can operate more like human employees, understanding context, setting appropriate goals, and adapting their actions based on changing conditions. 

Google’s Project Jarvis represents these capabilities applied to browsing the web. 

Youtube Placeholder

An accidental leak on the Chrome extension store gave a sneak peak, with a description that labelled it a “helpful companion that surfs the web with you”, before it was rescinded. 

Project Jarvis is said to be able to take control of a web browser to complete tasks such as research and shopping. 

What makes Project Jarvis unique to other AI systems?
  • Browser control: Designed to take over web browsers to complete tasks
  • Task automation: Aims to automate complex tasks like research and shopping
  • Direct computer interaction: Interacts directly with a user's computer or browser
  • Integration with Gemini: To be demonstrated alongside Google's new large language model
  • Autonomous web navigation: Pushes towards more independent internet use than current AI assistants

This project is set to be demonstrated alongside Google's latest large language model, Gemini 2.0, which is expected to rival or surpass existing models like GPT-4.

Google is not alone in this pursuit. Microsoft-backed OpenAI is working on a "computer-using agent" (CUA) that can take actions based on its findings while conducting web-based research. Anthropic, another AI research company, is also reportedly developing similar technology.

Implications and potential impact

Bill Gates, former CEO of Microsoft, believes that advancements in AI are turning such autonomous agents into a real possibility. 

"In the near future, anyone who's online will be able to have a personal assistant powered by AI that's far beyond today's technology,” he stated.

The development of autonomous AI agents could revolutionise how users interact with the internet, potentially enabling comprehensive research across multiple sources, complex online shopping tasks, including finding the best deals and simplified user interactions through natural language commands.

Recent research has shown promising results in autonomous web navigation. 

Agent-E, a state-of-the-art web agent, achieved a 73.2% success rate on the WebVoyager benchmark, representing a 20% improvement over previous text-only models and a 16% improvement over multi-modal web agents.

Agent-Q, another autonomous web agent, demonstrated a 95.4% success rate on real-world booking tasks, a 340% improvement over Llama 3's baseline zero-shot performance.

These advancements highlight the rapid progress being made in the field of autonomous web agents and their potential to transform our digital interactions.

With titans of their respective industry Accenture and Nvidia joining to usher in agentic AI for enterprises, it may not be long until we see greater use cases for these autonomous AI agents outside of web browsing. 

Yet, with Google looking to debut Project Jarvis by the end of this year, it won’t be long until agentic AI is available to the masses to test in one form or another.

******

Make sure you check out the latest edition of AI Magazine and also sign up to our global conference series - Tech & AI LIVE 2024

******

AI Magazine is a BizClik brand

Share

Featured Articles

Responsibility in the Age of AI: O’Reilly President Examines

O’Reilly President Laura Baldwin discusses the legal challenges unmitigated and unobserved use of Gen AI may present to enterprises

Schneider Electric Enhances AI Data Centre Operations

Schneider Electric teams with Nvidia to advance AI data centres, whilst emphasising global sustainability in energy management

How Can AI Firms Pay Publishers? Perplexity Has a Plan

AI search firm Perplexity extends its content licensing programme to 14 new media partners, offering revenue share and API access for publisher content

PwC and AWS Join Forces on Enterprise AI Controls System

AI Strategy

How Amazon Nova is Redefining AI for Enterprise Solutions

AI Strategy

MHP Study: AI Reshapes Global Auto Industry Trust Landscape

AI Strategy