AI Watermarking Technology Fails: Mainstream Methods Easily Removed, Fake News Risks Intensify

Feb 2, 2026 451 approx.5min Grok/X

AI水印假新闻 AI安全技术争议监管政策

As AI-generated content rapidly proliferates, a recent study on mainstream AI watermarking technologies has sparked intense debate: these "invisible watermarks" designed to mark AI outputs can be easily removed with simple tools. This not only exposes technical shortcomings but also ignites fierce discussions about fake news, content abuse, and regulation. An AI safety expert's post on X platform quickly garnered 200,000 interactions, becoming a focal point of public discourse.

Event Overview: The Alarm Bell of Watermark Failure

In October 2024, researchers released a report stating that AI watermarking technologies from tech giants like OpenAI and Google saw their detection accuracy plummet to less than 10% when faced with common image editing tools or noise addition. This discovery quickly spread through tech circles. Expert @AI_Safety_Guru posted on X: "AI watermarks are just paper tigers and placebos, the real cat-and-mouse game has just begun." The post was shared over 50,000 times, with comment sections debating the pros and cons of watermarking versus detection methods, highlighting industry divisions.

Background: The Rise and Original Intent of AI Watermarking Technology

AI watermarking technology emerged from concerns about the proliferation of generative AI. With the popularization of tools like ChatGPT and Midjourney, AI-generated text, images, and videos have flooded the internet, becoming difficult to distinguish from human creations. Starting in 2023, OpenAI embedded invisible watermarks in DALL·E 3, using specific algorithms to implant markers at the pixel level; Google's SynthID targeted audio and images, providing "robust" detection.

The original intent of these technologies was "traceability": watermarks serve as digital ID cards, helping platforms and users identify AI content to prevent deepfakes and fake news. For instance, the U.S. Federal Election Commission warned that AI-generated fake videos could interfere with the 2024 election. Watermarks were seen as a low-cost solution to avoid a blanket ban on AI-generated content.

Core Content: Research Reveals Watermark Vulnerability

The controversy stems from a paper titled "AI Watermarking Under Scrutiny" published by researchers from UC Berkeley. They tested 10 mainstream watermarking schemes, including OpenAI's text watermarking (marked through lexical probability bias) and Stability AI's image watermarking. The results showed:

Using Adobe Photoshop's mild blur filter reduced watermark detection rates to 5%.
Adding 5% Gaussian noise could completely disable watermarks.
Even free online tools like Remove.bg could strip 80% of markers.

Researchers noted that watermarks rely on statistical patterns (such as pixel correlation), but AI attackers can optimize countermeasures through "adversarial examples." X platform data shows the paper link was shared over 100,000 times, with #AIWatermarkFail trending.

Various Perspectives: Watermarking Advocates vs. Detection Advocates Debate

Industry opinions are polarized. Watermarking supporters believe it's a necessary starting point. OpenAI researcher Noam Brown responded on X: "Watermarks aren't foolproof, but they're evolving. We're developing multi-layer nested schemes." The Google DeepMind team also posted emphasizing that SynthID had withstood initial attack tests.

"Watermarking is the first line of defense, removing it is like disarming yourself."—OpenAI Chief Scientist Ilya Sutskever (quoted from X post)

Opponents point directly to its "arms race" nature. AI safety expert Timnit Gebru commented on X: "Watermarking vs. de-watermarking is like cat and mouse, bound to fail eventually. We should shift to source detection, like model fingerprinting." Another independent researcher @DeepfakeHunter posted with 30,000 likes: "The detection arms race is already lost, watermarks are easily faked, regulation is the key."

Chinese scholars' views are equally active. Zhang Yaqin, director of Tsinghua University's AI Lab, stated: "Watermarking technology needs to be combined with blockchain to form an immutable chain." In Chinese X circles, @AI_ChinaWatch posted: "Domestic Baidu and Alibaba watermarks have been optimized, but the lack of international standards is the pain point."

Impact Analysis: From Fake News to Global Regulation

Watermark failure amplifies AI abuse risks. First, the barrier to fake news is lowered: attackers can generate "traceless" AI videos to spread rumors. In recent Indian elections, AI fake videos have misled millions; similar concerns are rising before the U.S. midterm elections. Second, the content ecosystem is impacted: social platforms like X and TikTok face massive moderation pressure, while advertisers avoid AI content.

Economic impacts cannot be ignored. McKinsey reports predict that without effective traceability, the AI content market could lose over $50 billion by 2025. The security field is even more severe: military reports warn that AI-forged intelligence could overturn battlefield decision-making.

Regulatory calls are rising accordingly. The EU's AI Act already requires mandatory watermarking for high-risk AI, and U.S. senators plan to introduce a "Content Authenticity Act." China's Cyberspace Administration emphasizes "generative AI registration + watermarking," but experts worry about implementation difficulties. The #RegulateAI tag on X has over 500,000 interactions, with public opinion leaning toward a dual approach of "technology + regulation."

Conclusion: Technological Iteration and Shared Responsibility

The AI watermarking controversy reveals the double-edged nature of generative AI: rapid innovation alongside parallel risks. In the short term, watermarks need enhanced robustness, such as combining with zero-knowledge proofs; long-term, industry consensus and global standards are urgently needed. Developers, platforms, and regulators must work together to avoid the vicious cycle of "technical failure → widespread abuse." As @AI_Safety_Guru concluded: "Don't let watermarks become a laughingstock, take action!" Whether a safe harbor for AI content can be built in the future remains to be seen.