Image: Gstudio/Adobe Stock
A recently discovered prompt-injection flaw in Google’s Gemini makes it possible for hackers to target unsuspecting users in sophisticated phishing attacks.
Google’s Gemini chatbot is vulnerable to a prompt-injection exploit that could trick users into falling for phishing scams, without them ever seeing it coming.
The flaw allows attackers to embed hidden instructions in seemingly benign emails. When a user clicks Summarize This Email using Gemini for Google Workspace, the chatbot can be manipulated into generating fake security alerts, prompting victims to click malicious links or call scam phone numbers.
According to the anonymous researcher who originally discovered and reported the vulnerability, the technique “involves clever and unorthodox tactics designed to deceive the model, often requiring an understanding of its operational mechanics to achieve desired outcomes.”
Since the malicious email doesn’t include any attachments, it’s not always viewed as a red flag—either by users or their SPAM filters. Moreover, since it exploits HTML and CSS code, it’s easily hidden within the body of the email itself. Once it’s been embedded, Gemini for Google Workspace will process it just like any other set of instructions.
“Because the injected text is rendered in white-on-white (or otherwise hidden), the victim never sees the instruction in the original message, only the fabricated ‘security alert’ in the AI generated summary,” Marco Figueroa, researcher with 0DIN, said.
It’s important to note that neither Google, the anonymous researcher, nor the team at 0DIN has seen any verified reports of this happening to any Gemini users; however, this specific prompt-injection attack was demonstrated as a proof of concept by 0DIN’s researchers.
Google has gone to great lengths to secure its Gemini platform. Some of these security controls include:
Dark Reading reported that some of these safeguards have yet to be fully implemented. Google has also confirmed that it will be introducing additional safeguards for Gemini in the coming months.
Even though this particular flaw hasn’t been exploited yet, AI developers need to be aware that their tools could be used as delivery mechanisms by cunning hackers and other malicious actors. This prompt-injection method is specific to Gemini for Google Workspace, but it’s easy to see how a hacker could apply similar techniques to other AI platforms, such as ChatGPT and Grok.
J.R. Johnivan is a technology writer and computer repair professional with 20 years of experience. His work explores emerging technologies, including next-generation LLMs, their societal impact, and how they can improve professional workflows. He began writing while studying computer networking, eventually combining his passion for technology with a career in content. He also brings expertise in project management, HR, and CRM software, giving him a practical, business-focused perspective on today’s tech landscape.