Skip to main content

Project Mariner: A Guide With Five Practical Examples

Learn about Google’s Project Mariner through real-world examples and a breakdown of its strengths and weaknesses.
Jun 19, 2025  · 8 min read

Project Mariner is Google DeepMind’s experimental AI browser agent. Mariner uses Gemini’s powerful multimodal capabilities to see your screen, understand what’s displayed, plan tasks, and execute them autonomously—clicking, typing, scrolling, and filling forms on your behalf.

In this blog post, I share my experience of trying it on the following five real-world tasks:

  • Book movie tickets
  • Find flights
  • Book a plumber to fix my kitchen sink
  • Browse the web to fill in a spreadsheet
  • Solve a LeetCode problem

We keep our readers updated on the latest in AI by sending out The Median, our free Friday newsletter that breaks down the week’s key stories. Subscribe and stay sharp in just a few minutes a week:

How Does Project Mariner Work?

Project Mariner uses a prompt-based UI that lets us describe what we want it to do. It then spins up a browser and attempts to complete the task based on that input.

Project Mariner's UI

For now, it’s limited to browser-related tasks—it can’t access or control your computer. Behind the scenes, it runs in a browser and relies on Gemini’s multi-modal capabilities to get the job done. It can see and understand the window’s content, perform clicks, and type text.

I evaluated Project Mariner by testing it on several tasks. For each one, I recorded the process and will show you the results. Since Mariner is quite slow at completing tasks, I edited the videos to speed up the slower sections.

How to Access Project Mariner?

Project Mariner is currently available to U.S. users subscribing to the Google AI Ultra plan at $249.99 per month.

This plan offers many other tools (like Veo 3), advanced Gemini models and features, and data storage. However, it is currently the only plan giving you access to Project Mariner, making it a very expensive tool.

Let’s explore a few examples.

Example 1: Booking Movie Tickets

The first thing I tried was booking movie tickets. I used the following prompt:

"Find a horror movie playing in the cinema this Friday night in New York and help me book a ticket."

Here's a breakdown of how it solved the task:

  1. Performed a Google search to find horror movies playing on Friday
  2. Out of the options, it selected the first
  3. Then it identified the button to book the tickets and tried the first option, Alamo Drafthouse Cinema.
  4. This didn't work, so it opted for Fandango.
  5. Eventually, it asked for my intervention in selecting the seat.
  6. Then it also asked me to confirm before moving to the payment page.
  7. At this point, it marked the task as completed and let me continue manually.

I think overall this was a success. Since I didn't select the seat, I feel asking me to proceed with seat selection was unnecessary. It could also have reached the payment page by itself without asking for my consent. After that, I think it's a positive that the interaction isn't automated, as I wouldn't want to spend money without a manual confirmation.

Example 2: Finding the Best Flights

I need to travel from Portugal to Taiwan, passing through Belgium, so I thought I'd try to see how Mariner would perform in finding the best flights for me. Here's the prompt I used (I didn't provide information about the dates to see what would happen):

"I want to travel from Portugal to Taiwan, passing by Belgium for three days. Help me find the best flights."

This one was very disappointing. The only thing it did was navigate to Google Flights and fill in the form for me. Moreover, Mariner is quite slow at identifying where to click when navigating the website, so this process took much longer than it would have taken me.

I also tried to ask it to book the tickets, but it entered a loop, and I eventually gave up.

Example 3: Finding and Booking a Plumber

Next, I was curious to see if it could help me find a plumber. For this example, I used a random address in New York to see if it would be able to find someone near that location and go through with the booking.

"I live at 33 Wilson Ave. Hamburg, NY 14075. Find and book a plumber for tomorrow to fix a leak in my kitchen sink."

This task worked quite well. It was able to find a plumbing company in New York and book an appointment. When it needed information such as my name and email, it stopped and asked for it, then filled in the form automatically.

Example 4: Filling Data in a Google Sheet

A common yet very tedious task people need to do is gather some data from several sources and place it into a spreadsheet. To test Mariner's ability to solve such a task, I created a simple sheet of AI companies.

Example spreedsheet

Then I gave it access to the tab and asked it to search the web for the contact emails of the companies and fill in the Email column.

As you probably noticed, everything went well except for one case. When searching for the email of Google DeepMind, it entered a loop, so I eventually stopped it and asked it to try again—this time, it succeeded.

Example 5: Solving a LeetCode Problem

For the final example, I wanted to create something more complex, so I asked the following:

"Go to LeetCode and solve a medium difficulty problem of your choice using Python 3"

It first tried using filters to find a medium-difficulty problem, but for some reason, that didn't work. Then, it gave up and found a problem without needing my intervention.

Then, it wrote the code in the chat instead of using the LeetCode interface. I asked it to enter the answer directly into the interface, but it showed up with very strange formatting that it couldn’t fix.

Then I asked it to write the solution in the chat again, and I set it manually. I was curious to see if the answer was correct. However, I couldn't log in to LeetCode because of Cloudflare. This is a problem I saw happening a lot when testing Mariner.

I did submit the solution later with my LeetCode account, and it was correct. The fastest step during this execution was solving the problem. As soon as the problem was opened, a solution was shown in the chat. This was very impressive to me.

Teaching a Task on Project Mariner

Project Mariner comes with a Chrome extension we can use to teach it how to perform tasks. The way it works is that it records your screen while you perform and explain the task to it (yes, you need to speak while performing the task).

From that data, it extracts a list of actions. I tried teaching it how to shop on Walmart, and this is what I got:

Task training output

I don't know if it collects other data as well, but I felt it was a lot of effort to perform and explain the task just to get this list, which I could have typed in a few seconds.

I tried executing the task, but it didn't work. It feels like it is missing steps, like asking what the items on the shopping list are. You can manually edit a task before creating it, but I didn't find any way to edit it later on.

As of now, I feel that this feature is still underdeveloped and not ready for use. Moreover, I found it hard to find a use case because if I had just asked it to add items from a spreadsheet shopping list into a Walmart cart, it would have done a better job.

Pros & Cons of Project Mariner

The good parts

Good at browsing the web

All the tasks that involved browsing the web and finding information on a website seem to work well. The movie ticket and plumber use cases are good examples of this.

Work with multiple tabs

Mariner can solve tasks that require using multiple tabs and performing sequential Google searches.

Visual understanding

Despite being slow, it is good at understanding the content of the website and its elements. It can interact with elements, fill forms, and click buttons to control the website.

The bad parts

Here are, in my opinion, the main problems with the current iteration of Project Mariner.

Obstacles to automating the web

Navigating the web is a complex process, littered with various challenges for automated tools like Project Mariner. One significant hurdle is the resistance from websites through tools like Cloudflare and reCAPTCHA, designed to block bots and protect against malicious activities.

During my experiments, I got blocked very often. Even when trying one of their suggested examples, the website blocked me:

This resistance complicates Mariner's task execution, as it struggles to bypass these barriers meant for human verification. Additionally, due to regulations, websites often feature cookie banners and pop-ups, forcing users to continuously make decisions about their data preferences.

Mariner does not automatically handle cookie selections, likely because users should make these choices based on their privacy preferences. There may also be regulations in place that prevent automation of these decisions. As a result, the numerous decision points and necessary interactions escalate the complexity and friction for tools like Mariner to function efficiently.

Ideally, we aim to automate tasks to reduce manual effort, but the current state of these tools still requires significant oversight, making it clear that we're far from realizing a truly seamless and helpful automation experience.

Slow to process the UI

Another problem that Mariner faces at the moment is speed. It takes a long time to process the content on the screen. Filling out even the simplest of forms can take minutes. 

Not enough reasoning

During my experiments, it seemed that whenever a choice had to be made—like which platform to buy movie tickets from or which plumbing company to use—the agent evaluated the options one by one, in the order they appeared.

That isn't a good strategy, and it isn't how humans usually make their choices. I feel that the agent is missing a reasoning and research step when making decisions.

Integration with the browser

I struggled to share tabs with it. It would often log me out and lose access to the information displayed in the tab I shared. The workaround was to log in manually inside the browser it uses. This is cumbersome and makes me not want to use it.

I think it would be better if it were a Chrome extension that lived side by side with the current open tab. That way, you could ask it at any time to take control of the tab and perform actions directly.

I assume there are security concerns with this approach, but I don't see it being very useful when using its browser that is detached from ours, because most tasks will then require a long setup, such as logins, to be completed.

Loops

It often happens that it gets into loops where it asks me if something is okay and then keeps asking without moving further. Here's an example:

Example of Mariner looping

Privacy and security concerns

Another concerning aspect regards privacy and security. Here’s the official privacy notice

In short, Google's Project Mariner collects your conversations, browsing data, and feature usage to improve its services and AI, following Google's Privacy Policy. Some data may be reviewed by real people. Therefore, avoid sharing confidential information. 

Conclusion

Project Mariner is still very much an experimental product—slow, limited in scope, and prone to running into roadblocks like CAPTCHA or login issues. In its current state, I don’t think it’s worth subscribing just to have access.

What I appreciate most about Google’s approach is their willingness to release unpolished prototypes early on. This kind of transparency and iteration lets them learn quickly and build better products over time.

In contrast, if you look at Apple—which prefers only to release perfectly polished products—they seem to be missing out in the fast-evolving AI race. AI thrives on data and real-world usage, so exposing these early-stage tools, even if unfinished, is a smart way to iterate and improve rapidly.

Introduction to AI Agents

Learn the fundamentals of AI agents, their components, and real-world use—no coding required.
Explore Course

François Aubry's photo
Author
François Aubry
LinkedIn
Full-stack engineer & founder at CheapGPT. Teaching has always been my passion. From my early days as a student, I eagerly sought out opportunities to tutor and assist other students. This passion led me to pursue a PhD, where I also served as a teaching assistant to support my academic endeavors. During those years, I found immense fulfillment in the traditional classroom setting, fostering connections and facilitating learning. However, with the advent of online learning platforms, I recognized the transformative potential of digital education. In fact, I was actively involved in the development of one such platform at our university. I am deeply committed to integrating traditional teaching principles with innovative digital methodologies. My passion is to create courses that are not only engaging and informative but also accessible to learners in this digital age.
Topics

Learn AI with these courses!

Track

AI Fundamentals

0 min
Discover the fundamentals of AI, dive into models like ChatGPT, and decode generative AI secrets to navigate the dynamic AI landscape.
See DetailsRight Arrow
Start Course
See MoreRight Arrow
Related

Tutorial

Google Jules: A Guide With 3 Practical Examples

Learn what Google Jules is and how to use it to automate real-world development tasks for your GitHub repository.
Aashi Dutt's photo

Aashi Dutt

12 min

Tutorial

Perplexity Labs: A Guide With 5 Practical Examples

Learn about Perplexity Labs, its features for creating reports, dashboards, and web applications, and how to use it through practical examples.
Bex Tuychiev's photo

Bex Tuychiev

8 min

Tutorial

Mistral Agents API: A Guide With Demo Project

Learn how to build AI agents using Mistral's Agents API, and explore key concepts like tool usage, connectors, handoffs, and more.
Aashi Dutt's photo

Aashi Dutt

12 min

Tutorial

OpenAI's O3: A Guide With Five Practical Examples

Explore five practical examples of how to use OpenAI's o3 model within the ChatGPT application.
Marie Fayard's photo

Marie Fayard

5 min

Tutorial

Gemini CLI: A Guide With Practical Examples

Learn how to install the Gemini CLI on your local machine, how to start a project from scratch or work on an existing one, and more.
Aashi Dutt's photo

Aashi Dutt

8 min

Replit Agent: from idea to app

Tutorial

Replit Agent: A Guide With Practical Examples

Learn how to set up Replit Agent and discover how to use it through an example walkthrough and 10 real-world use cases.
Dr Ana Rojo-Echeburúa's photo

Dr Ana Rojo-Echeburúa

10 min

See MoreSee More