In the current media landscape, it’s easy to feel as if AI-generated content is everywhere, and that concern is not without precedent.
What started as psychedelic dog-like face images and robotic, nonsensical text monologues have morphed into the modern, intelligent, and increasingly difficult-to-detect AI content that we encounter every day.
Every article consumed, image viewed and video watched now triggers a small paranoid question in any responsible editor or consumer’s mind: “Did a human make this?” The reality? We’ve reviewed several independent studies which suggest humans can’t detect AI content.
From helpful editing and re-wording to full-on farmed AI-generated content with the purpose of gaming Google’s search engine optimization algorithm, AI content is present in Google’s search results, and it’s here to stay.
In response, we set out to give a data-backed, statistical count of AI’s presence in Google ratings to answer the question: how much AI content is present in Google?
By sampling the top 20 search results from 500 popular keywords from the beginning of 2019 to the present day, we’re tracking the saturation of AI in search results, all the way back to the release of GPT-2.
Our latest findings, reviewed on January 21, 2025, found that the amount of AI content in Google Search Results is at an all-time high of 19.10%.
This marks a notable increase from the 18.07% of AI content found in Google Search Results in November 2024.
We'll continue monitoring the presence of AI content in Google, so check back on this live dashboard to stay up to date!
In general, over the course of our study, we’ve found a continuously increasing presence of AI content.
Before GPT-2 was released to the public, AI content was detected in only 2.3% of our sampled websites.
Around five years and three Open AI GPT models later, that percentage increased almost threefold to 10.18% in March 2024.
Now, in 2025 AI content is at an all-time high of 19.10%.
We found that even after Google’s implementation of its helpful content policy, first introduced in August 2022, the machine-content saturation still increased.
It is however notable that with the introduction of AI to the general public via ChatGPT, we did expect to see a spike in AI content in Google, which our data does not confirm.
This suggests that Google’s helpful content and spam policies have been at least partially successful at keeping AI Spam at bay.
To learn more about Google’s interactions with AI content, check out our study on the March 2024 Google update, and its repercussions.
Then, get more insight into AI detection accuracy and review a meta-analysis of third-party studies on AI detection.
Curious about Google Reviews? There’s AI content in those too. Check out our AI Google Reviews Study for more information.
The increasing presence of AI in Google is not to be understated. Learning Language Model (LLM) based AI tools are trained on very large human content datasets, often sourced from the internet via an automated scraping/crawling process. If not curated carefully, as the internet becomes more and more saturated with AI-generated content, these training datasets will as well. A study from May 2023 suggests that through multiple iterations of LLM training on datasets containing machine-generated content, models generate more generic and predictable results, and become more likely to mis-perceive their learning task over time. To offset this, the authors stress the importance of the continuing availability of non-machine generated training materials for future model learning. With future models of GPT and other AI models well on their way, and the continuously increasing rate of AI content in websites, said datasets will become more and more difficult to source.
Image: An AI model becoming more uniform over time (Ilia Shumailov et al.)
The data collection phase of this project was designed with the goal of generating a representative sampling of the average results one would see on Google after searching an informational keyword. We automated each step, allowing us to analyze large quantities of data. The process is novel to us, and we believe as well to the general search engine community.
To seed the data points for our study, 500 Google Search keywords were chosen, with the resulting set having the following properties:
For each keyword, we used a search engine optimization (SEO) tool to find their respective top 20 search results. We repeated this process every second month, from January 2019 to present day, resulting in a list of 10,000 websites for each two month period.
For each list of websites/time period, we checked the Internet Archive to see if a website snapshots was available within its respective period. If a snapshot was available, we used a port of the Arc90 Readability Algorithm to extract the main article text from the Archive snapshot.
All scraped text was run through a data cleaning process, fixing extraneous white space and other punctuation artifacts, removing most non-article text such as citations and footnotes and ensuring that all text was sufficiently long enough to be run through the Originality.AI detector. Text from websites that were not article based, like Youtube and Reddit, was removed.
The text was then run through the Originality.AI detector, and its score was recorded. As the originality score represents the detector’s confidence that a text contains AI from 0 to 1, we consider a score of 0.5 or above to be a positive AI detection.
Image: Text from an Archive snapshot gets scraped and run through the AI detector
Two things are clear: AI content is becoming more present and Google is increasing its efforts to keep unhelpful AI content at bay. Using tools like Originality.ai can help ensure that the content on your website is original, helpful, and plays well within Google’s rules for search engine results.
AI In Reddit Writing Communities
Does Google Penalize AI Content?
How Does AI Content Detection Work?
Ilia Shumailov et al. (2023). The Curse of Recursion: Training on Generated Data Makes Models Forget. Retrieved from arXiv:2305.17493
Save up to 23% on our Pro and Enterprise subscriptions
See Our Pricing