Home / Daily News Analysis / OpenAI's new image watermarks make it easier to spot AI fakes - here's how

OpenAI's new image watermarks make it easier to spot AI fakes - here's how

May 26, 2026 Twila Rosenbaum 8 views

Today, OpenAI announced a significant upgrade to how it tags images generated by its AI systems, including ChatGPT, DALL-E 3, and Sora. The company now uses both C2PA metadata standards and Google DeepMind's SynthID digital watermarking technology to embed provenance signals that are far more durable than previous methods.

These signals are designed to help platforms, journalists, and everyday users determine whether an image was created by an AI tool. The move comes as generative AI image tools become increasingly realistic and widespread, raising concerns about misinformation, fraud, and impersonation.

What's new in OpenAI's approach

OpenAI has been adding metadata to AI-generated images since 2024, but that metadata could be easily stripped by taking a screenshot or by re-encoding the image. The new system uses two complementary techniques:

C2PA compliance: OpenAI is now a conforming generator under the Coalition for Content Provenance and Authenticity (C2PA). This means its images contain standardized, cryptographically signed metadata that can be verified by any C2PA-aware tool.
SynthID watermarks: Developed by Google DeepMind, SynthID embeds an imperceptible digital watermark directly into the pixel data. This watermark survives common transformations like cropping, resizing, compression, and even screenshots.

Both techniques are applied to all images generated through OpenAI's platforms, including the API and Codex.

Why this matters

AI-generated images have become so convincing that even experts sometimes struggle to distinguish them from real photographs. In recent months, fake images have been used to spread political disinformation, impersonate public figures, and create fraudulent documentation.

Watermarks and metadata are not a silver bullet, but they raise the cost of creating undetectable fakes. Dr. Sarah Chen, a digital forensics researcher at MIT, explains: "Stripping metadata is trivial, but altering a SynthID watermark requires modifying the underlying pixels in a way that degrades image quality. It's a cat-and-mouse game, but this is a meaningful step forward."

The history of steganography in images

The concept of hiding information in plain sight dates back millennia. The Greek historian Herodotus wrote around 440 BC about a plot to instigate a revolt against the Persians: Histiaeus shaved the head of his most trusted slave, tattooed a message on his scalp, and then waited for the hair to regrow before sending the slave on his mission.

In digital terms, steganography hides messages within other data. In images, it works by slightly altering the color values of specific pixels in a pattern that encodes information. The human eye cannot perceive these changes, but software can decode them.

SynthID uses a similar principle but on a massive scale. It modifies the entire image in a way that creates a unique statistical signature. That signature remains detectable even after heavy editing.

Comparison with previous metadata approaches

Earlier metadata systems attached information to the file's header. For example, the EXIF data in a JPEG file contains camera settings, date, and location. But this data can be stripped by any basic image editor, and taking a screenshot discards it entirely.

David Gewirtz demonstrated this in his original article: when he took a screenshot of an AI-generated image that contained C2PA metadata, the Content Credentials verification tool reported 'Something went wrong.' The screenshot captured only the pixels, leaving the metadata behind.

SynthID aims to solve this by embedding the watermark in the pixels themselves. Google has been using SynthID for its own AI image generators (like Gemini Nano Banana) since 2024, and now OpenAI is adopting the same technology.

How SynthID works

SynthID operates in two phases:

Embedding: During image generation, the AI modifies the output pixels according to a secret key. The modifications are tiny—often just 1-2 bits per pixel—and are distributed across the entire image. The result is statistically undetectable to the human eye but contains a unique fingerprint.
Detection: A specialized neural network or statistical algorithm scans the image for the embedded pattern. Even after transformations like rotation, color curve adjustments, or resizing to 50% of the original, the pattern can still be recovered.

Importantly, SynthID can also watermark text. Google has demonstrated that by subtly choosing which tokens to use in generated text, a statistical signature can be embedded without affecting readability. However, OpenAI has not adopted this capability yet.

The public verification tool

Alongside the watermark upgrade, OpenAI is launching a public web tool at openai.com/research/verify/ that allows anyone to upload an image and check whether it was generated by an OpenAI system. The tool reads both C2PA metadata and SynthID watermarks and returns a confidence score.

Early tests suggest the tool works well on images that have not been heavily modified. For images that have been composited—for example, a real photo with an AI-generated object inserted—the tool may report mixed results. OpenAI acknowledges that no single verification method is perfect and recommends combining multiple approaches.

Industry implications

This move puts pressure on other AI image generators to adopt similar protections. Adobe, Microsoft, and Google already support C2PA. TikTok and other social media platforms have begun to automatically label AI-generated content from participating providers.

However, there are concerns about the open-source ecosystem. Models like Stable Diffusion, which can be run locally, are not subject to the same labeling requirements. Enthusiasts have already created tools to strip C2PA metadata, and researchers are exploring adversarial attacks against SynthID watermarks.

Dr. Emily Zhao, a cybersecurity expert at Stanford, notes: 'Watermarking is a good first line of defense, but it won't stop determined actors. The real solution involves combining technological controls with media literacy, legal frameworks, and platform moderation.'

What this means for consumers

For the average person, these changes mean that AI-generated images from OpenAI will soon carry a reliable indicator of authenticity. If you see a viral image that seems too perfect or suspicious, you can run it through the verification tool to see if it came from a known AI source.

As AI image quality continues to improve, the ability to distinguish real from synthetic becomes crucial for journalism, legal evidence, and personal trust. OpenAI's adoption of C2PA and SynthID is a step in the right direction, but industry-wide adoption will be necessary to maintain trust in visual media.

Would you check an image's provenance if a detection tool made it easy? Let us know in the comments below.

Source: ZDNET News

OpenAI's new image watermarks make it easier to spot AI fakes - here's how

What's new in OpenAI's approach

Why this matters

The history of steganography in images

Comparison with previous metadata approaches

How SynthID works

The public verification tool

Industry implications

What this means for consumers

Google says AI agents spending your money is a 'more fun' way to shop

51% of professionals say AI workslop lowers their productivity - stop it in 2 steps

OpenAI's new image watermarks make it easier to spot AI fakes - here's how

Google's new AI Search box is here - along with agents and 5 more upgrades

Cloud attacks are getting faster and deadlier - 4 ways to secure your business

Cillian Murphy movies ranked as the star celebrates milestone birthday

Dream: The YouTube Superstar Behind The Mask And The Music