Try the Most Accurate AI Detector on the Market
Our patented AI checker is the most accurate detector on the market! Don't believe us? Try it for yourself!
Try for FREE Here!
AI Studies

Is Qwen2.5-Turbo Content Detectable?

Using our proprietary Originality.ai AI detection tool, we analyzed Alibaba’s Qwen2.5-Turbo AI model to find out if it is detectable. These are our findings.

In November 2024, Alibaba developed a model, Qwen2.5-Turbo, capable of handling longer contexts efficiently and affordably while maintaining high performance across various use cases.

As AI-generated content becomes more advanced, the question arises — can we still detect it? 

This study examines 1,000 samples from Qwen2.5-Turbo to assess their detectability using our Turbo and Lite AI content detectors, alongside GPTZero. 

Is Qwen2.5-Turbo AI Content Detectable?

  1. Yes — Qwen2.5-Turbo text is detectable by Originality.ai with 99.3% accuracy using our 3.0.1 Turbo model and 97.7% accuracy with our Lite 1.0.0 model. 
  2. Turbo 3.0.1 and Lite 1.0.0, developed by Originality.ai, outperformed GPTZero, which achieved 93.4% accuracy in detecting content generated by the Qwen2.5-Turbo model.

Try our AI Detector here.

Then, read our study on Qwen2.5-Max.

Dataset

In order to evaluate the detectability of Qwen2.5-Turbo, we prepared a dataset of 1000 Qwen2.5-Turbo-generated text samples.

AI-Generated Text Data

For AI-text generation, we used Qwen2.5-Turbo based on three approaches given below:

  1. Rewrite prompts: Generating the content by providing the model with a customized prompt along with some articles (probably generated by LLMs) as a reference to rewrite from. (450 Samples)
  2. Rewrite human-written text: Generating the content considering the provided prompt to bypass the AI Detection tool by rewriting the human-written text which we fetched from an open-source dataset (350 Samples)
    1. One-Class Learning for AI-Generated Essay Detection
      1. Paper: https://www.mdpi.com/2076-3417/13/13/7901
      2. Dataset: https://github.com/rcorizzo/one-class-essay-detection
  3. Write articles from scratch: Generating the articles from scratch based on the given topics ranging from fictional and non-fictional diverse domains such as history, medicine, mental health, content marketing, social media, literature, robots, future etc. (200 Samples)

Evaluation

To evaluate the efficacy we used the Open Source AI Detection Efficacy tool that we have released:

Originality.ai has two models namely Model 3.0.1 Turbo and Model 1.0.0 Lite for the purpose of AI Text Detection.

  • Use Version 3.0.1 Turbo - If your risk tolerance for AI is ZERO! It is designed to identify any use of AI even light AI
  • Version 1.0.0 Lite - If you are okay with slight use of AI (i.e. AI editing)

The open-source testing tool returns a variety of metrics for each detector you test, each of which reports on a different aspect of that detector’s performance, including:

  • Sensitivity (True Positive Rate): The percentage of the time the detector identifies AI correctly.
  • Specificity (True Negative Rate): The percentage of the time the detector identifies human-written content correctly.
  • Accuracy: The percentage of the detector’s predictions that were correct.
  • F1: The harmonic mean of Specificity and Precision, often used as an agglomerating metric when ranking the performance of multiple detectors.

If you'd like a detailed discussion of these metrics, what they mean, how they're calculated, and why we chose them, check out our blog post on AI detector evaluation. For a succinct snapshot, though, we think the confusion matrix is an excellent representation of a model's performance.

Below is an evaluation of both the models on the above dataset. 

Confusion Matrix:

Confusion Matrix:
Figure 1. Confusion Matrix on AI only dataset with Originality.ai Model 1.0.0 Lite
Confusion Matrix:
Figure 2. Confusion Matrix on AI only dataset with Originality.ai Model 3.0.1 Turbo
Confusion Matrix:
Figure 3. Confusion Matrix on AI only dataset with GPTZero

Evaluation Metrics:

For this smaller test to be able to identify the ability of Originality.ai’s AI detector to identify Qwen2.5-Turbo content we look at True Positive Rate or the % of the time that the model correctly identified AI text as AI out of a 1000 Qwen2.5-Turbo content. 

Model 1.0.0 Lite:

  • Recall (True Positive Rate) = 97.7%

Model 3.0.1 Turbo:

  • Recall (True Positive Rate) =  99.3%

GPTZero:

  • Recall (True Positive Rate) =  93.4%

Conclusion

Our study confirms that Qwen2.5-Turbo AI-generated text is highly detectable using our AI content detectors

Model 3.0.1 Turbo achieved an outstanding 99.3% recall, while Model 1.0.0 Lite followed closely with 97.7% recall — both Originality.ai AI detection models surpassed GPTZero (93.4%). 

These results demonstrate the superior accuracy and reliability of our AI detection models in identifying AI-generated content, reinforcing their effectiveness as industry-leading tools for AI content detection.

To learn more about AI detection and its efficacy read our AI detection accuracy study and a meta-analysis of AI detection studies conducted by third parties.

Jonathan Gillham

Jonathan Gillham

Founder / CEO of Originality.ai I have been involved in the SEO and Content Marketing world for over a decade. My career started with a portfolio of content sites, recently I sold 2 content marketing agencies and I am the Co-Founder of MotionInvest.com, the leading place to buy and sell content websites. Through these experiences I understand what web publishers need when it comes to verifying content is original. I am not For or Against AI content, I think it has a place in everyones content strategy. However, I believe you as the publisher should be the one making the decision on when to use AI content. Our Originality checking tool has been built with serious web publishers in mind!

More From The Blog

Al Content Detector & Plagiarism Checker for Marketers and Writers

Use our leading tools to ensure you can hit publish with integrity!