© Vantage_DS/Shutterstock.com
Gemini, Google’s generative AI, continues to evolve. Today, it already rivals ChatGPT and is able to understand many formats, in addition to text, thanks to its multimodal capabilities. But the next step in Gemini's development will see Google offer “AI agents” that will save Chrome users even more time.
In a press release published this week, Google presents Gemini 2.0. Available as a test version, it is still in development. And one of its main new features is that it will allow the firm to offer AI agents that will be able to perform actions on the internet in place of the user.
Subscribe to Presse-citron
Building on the new capabilities of Gemini 2.0, Google has developed a prototype called Mariner, which is able to take control of Google Chrome to perform online actions for the user, in order to save time. In the video below, Google gives Mariner a list of companies and asks the AI to search for their contacts. Mariner then takes control of Google Chrome and performs the searches on behalf of the user, ultimately producing the requested list of contacts. Throughout the process, Mariner details in real time the actions it performs as well as its reasoning.
“Agents”, the future of artificial intelligence
Another video posted by Google suggests that, thanks to the AI agent, it will be possible to entrust a search in Gemini, for a specific type of product, and then add the product you are looking for to an e-commerce site.
But for the moment, it is not yet a finished product. “Project Mariner is an early research prototype built with Gemini 2.0 that explores the future of human-agent interaction, starting with your browser. As a research prototype, it can understand and reason through information on your browser screen, including pixels and web elements like text, code, images, and forms, and then use that information through an experimental Chrome extension to perform tasks for you,” the company says.
200% Deposit Bonus up to €3,000 180% First Deposit Bonus up to $20,000It does admit that, for now, the AI may produce incorrect results or take too long to complete certain tasks. But Google is confident that its AI’s performance will improve over time. For now, the Mariner project is only available to a group of testers, via a Chrome extension. Otherwise, Google has also developed similar agents for developers to automate tasks in the workflow of coders.
Google continues to tease the Astra project
Google is also continuing to develop the Astra project, which it presented at its I/O conference in May. As a reminder, Astra is an assistant that uses the camera of a smartphone or smart glasses to observe the user's environment and answer questions related to this environment. The new version of this assistant is based on Gemini 2.0, which improves performance. For example, thanks to Gemini 2.0, Astra improves its language skills.
The future assistant can also work in tandem with Google Search, Google Lens or Google Maps, to be even more useful. And thanks to the performance of Gemini 2.0, it also has lower latency. Google explains that it continues to develop Astra in order to offer this visual assistant on the Gemini mobile application, and on connected glasses.
You can test Gemini 2.0 Flash now today
It’s unclear at this point when Google will let us use its AI agents. And it doesn’t say when Project Astra will be available. However, if you want a taste of the company’s next-generation AI, you can start using Gemini 2.0 Flash today. Just go to Google’s chatbot and select this new template.
“In addition to supporting multimodal inputs such as images, video, and audio, Flash 2.0 now supports multimodal outputs such as natively generated images mixed with text and multilingual, steerable text-to-speech (TTS) audio. It can also natively call tools like Google Search, code execution, and third-party user-defined functions,” the firm says.
- Google Unveils Gemini 2.0, the Next Evolution of Its Generative AI
- With its new capabilities, this AI allows Google to develop agents that don't just answer questions, but can also take control of Google Chrome to perform actions
- Gemini 2.0 also allows Google to improve the Astra project
📍 To not miss any Presse-citron news, follow us on Google News and WhatsApp.
M162.5 reviews
[ ]