Published on: March 4, 2025 / update from: March 4, 2025 - Author: Konrad Wolfenstein
Google Gemini KI with live video analysis and screen sharing functionality-Mobile World Congress (MWC) 2025-Image: Xpert.digital
Multimodal interaction: The future of the Google AI assistant
New AI functions: What the Gemini Boost means for users
At the Mobile World Congress (MWC) 2025 in Barcelona, Google presented significant extensions for its AI assistant Gemini, which should improve the user experience through new visual functions. The main renovations include live video analysis and screen sharing skills that will be available for subscribers to Google One Ai Premium Plan at the end of March. These developments mark an important milestone in Google's strategy to integrate artificial intelligence deeper into everyday life and to improve multimodal interaction
Suitable for:
- Artificial intelligence and AI-based smartphones: Samsungs Galaxy Ai at the Mobile World Congress in Barcelona
The new visual functions for Gemini
Live video analysis
One of the outstanding innovations presented on the MWC 2025 is the live video function for Gemini. This technology enables users to show the AI in real time what your smartphone camera captures and to lead a natural dialogue. The function was for the first time in May 2024 at Google I/O Conference and is now ready for the introduction. In a demonstration video shown by Google you can see how a potter focuses on a collection of ceramic work and asks Gemini about advice on the color selection for her next vase. The AI assistant analyzes the existing colors and gives a well-founded recommendation for a suitable glaze.
This function uses Gemini's multimodal skills to process visual information in real time and to interpret in the context of a natural conversation. It is part of Google's larger project, which is known as “Project Astra”, and represents significant progress in the development of AI assistants who can interact with the real world.
Screen sharing functionality
The second important visual expansion is the screen sharing function, which enables users to share your smartphone screen with Gemini. When starting the Gemini interface on Android, a new button with the inscription “Share screen with live” appears, through which the user can share their screen in real time with the AI assistant. This function is supplemented by a new notification in the style of a telephone call, which enables seamless integration into the user interface.
In practical application scenarios, Gemini can assist, for example, with online shopping. In a demonstration example, Google shows how a user can ask Gemini what would fit well with a jeans displayed on the screen. Gemini can then give recommendations based on what it sees and accompany the user through the purchase process.
Technical details and availability
Timetable for the introduction
Google plans to roll out the new live video and screen sharing functions for Gemini Advanced subscribers in March 2025. These functions are offered as part of the Google One Ai Premium Plan, which costs 21.99 euros per month. First, the extensions will only be available for Android devices, with pixel and Samsung devices in particular being among the first to be supported.
Integration in Gemini Live
The new visual functions are integrated into Gemini live, Google's continuous AI assistant, which enables real-time talks. Gemini Live was updated with Gemini 2.0 Flash, a version of the multimodal model that was specially optimized for fast, mobile use. Interestingly, the screen release is supported by a new notification in the style of a telephone call, which should enable seamless integration into the user experience.
Technological basis
The new functions are based on Project Astra, Google's project for a universal multimodal AI assistant. The aim of this project is to develop an assistant who can process text, video and audio data in real time and save in a conversation context of up to ten minutes. This technology should also be able to use Google Search, Lens and Maps to offer a comprehensive assistant experience.
Suitable for:
- Google AI personal assistant: There are two versions – Gemini (standard version) and Gemini Advanced (paid premium version)
Gemini in the context of the AI assistant market
Competitive position
With the new visual functions, Google strategically positions itself against its main competitor Openai and his chat. The Advanced Voice Mode from Chatgpt has been supporting live and screen sharing since December 2024. By integrating these functions in Gemini Live, Google ensures that its AI assistant remains competitive and offers comparable skills.
Meaning for the smartphone industry
The introduction of advanced AI functions such as Gemini that could have an important impact on the smartphone industry. After two years of declining sales figures in which many consumers have kept their devices longer, the integration of AI assistants could create new buying incentives with expanded skills. In Germany, according to a Bitkom, only every third has a device that is younger than a year - in 2023 it was still 55 percent.
The smartphone manufacturers use the new AI functions as a differentiation feature, since the devices are very similar on the outside and technologically. For example, Samsung shows how an agent can do tasks over several apps on the new smartphone S25, while Oppo demonstrates the visual skills of artificial intelligence for image processing.
More updates for Gemini
Extended language support
In addition to the visual functions, Google has also expanded Gemini's language skills. The AI assistant can now understand and speak in 45 languages. A particularly innovative function is the ability to change language in the middle of the sentence without having to change the language settings of your phone - "Gemini Live will understand and answer".
New widgets for iPhone users
Although the visual functions are initially only available for Android devices, Google has also announced updates for iPhone users. With the version 1.2025.0762303 of the Gemini app, six different lock screen widgets are introduced, which enable faster access to the AI assistant. These widgets include options such as "Enter", "Talk to Gemini live", "Open the microphone", "Use camera", "Share image" and "Share file". They can be placed on both the lock screen and in the control center of the iPhone, which makes access to Gemini easier.
This development is seen by some observers as an attempt to lure the iPhone and iPad users from Apple's voice assistant Siri. Apple is reportedly progressing slowly in developing a more powerful version of Siri, which can compete with the leading AI platforms.
Conclusion: meaning and outlook
The updates for Gemini presented by Google on the MWC 2025 mark an important step in the evolution of AI assistants. The new visual functions-live video analysis and screen sharing-enable more intuitive and context-related interaction between users and artificial intelligence. They are part of a broader development towards multimodal assistants who can increasingly interact with the real world.
The integration of these functions could have far -reaching effects on different areas. For the smartphone industry, you could create new buying incentives and help to revive the stagnating market. For users, they open up new opportunities to use AI in everyday life, be it when shopping, creative projects or when looking for information.
At the same time, these developments illustrate the ongoing competition between the large technology companies in the field of AI assistants. Google, Openai, Apple and others continuously work to improve their assistants and to equip them with new functions. This is driving innovation and could lead to even more powerful and intuitive AI assistants in the coming years.
With Project Astra and the new functions for Gemini, Google shows its long-term vision for AI assistants: they should be universal, multimodal and deep into everyday life. The updates presented on the MWC 2025 are an important step in this way and give an insight into the future of human-machine interaction.
Suitable for:
Your global marketing and business development partner
☑️ Our business language is English or German
☑️ NEW: Correspondence in your national language!
I would be happy to serve you and my team as a personal advisor.
You can contact me by filling out the contact form or simply call me on +49 89 89 674 804 (Munich) . My email address is: wolfenstein ∂ xpert.digital
I'm looking forward to our joint project.