Anthropic has launched its latest AI model Claude 3.5 Sonnet — the company’s first release in the upcoming Claude 3.5 AI model series. Anthropic has claimed that its latest offering outperforms its peers, such as OpenAI’s GPT-4o, Google’s Gemini-1.5 Pro, Meta’s Llama-400b, and even the company’s proprietary models — Claude 3 Haiku and Claude 3 Opus.
So, we put the Originality.ai AI detector to the test to determine its accuracy in detecting AI-generated text created by Claude 3.5 Sonnet.
To establish the AI Checker’s accuracy, this brief study generated 1000 Claude 3.5 Sonnet text results and then ran them through the Originality.ai AI checker.
Yes — Claude 3.5 Sonnet text is detectable with 98.5% accuracy for the Originality.ai Model 2.0 Standard and 99.3% accuracy for our 3.0 Turbo model.
Try our AI Detector and learn about the newly released Version 2.0.1 Standard (BETA) and Lite 1.0.0.
To evaluate the detectability of Claude 3.5 Sonnet, we prepared a dataset of 1000 Claude 3.5 Sonnet generated text samples.
For AI-text generation, we used Claude 3.5 Sonnet based on three approaches given below:
To evaluate the efficacy, we used our Open Source AI Detection Efficacy tool:
Originality.ai has two models — Model 3.0 Turbo and Model 2.0 Standard for the purpose of AI text detection.
The open-source testing tool returns a variety of metrics for each detector tested, each of which reports on a different aspect of that detector’s performance, including:
If you'd like a detailed discussion of these metrics, what they mean, how they're calculated, and why we chose them, check out our blog post on AI detector evaluation. For a succinct snapshot, the confusion matrix is an excellent representation of a model's performance.
Below is an evaluation of both the models on the above dataset.
For this small test to reflect the Originality.ai detector’s ability to identify Claude 3.5 Sonnet content, we looked at the True Positive Rate or the percentage of the time the model correctly identified AI text as AI out of a 1000 sample Claude 3.5 Sonnet content.
Model 2.0 Standard:
Model 3.0 Turbo:
Our study confirms that the content generated by Claude 3.5 AI-generated text is highly detectable with our AI detector. The Model 2.0 Standard scored an impressive 98.5% accuracy, while the Model 3.0 Turbo excelled with a 99.3% accuracy. These results state the effectiveness of our AI detectors in identifying AI-generated content, ensuring reliable detection across various text generation approaches. The Claude 3.5 study was reviewed with Version 2.0.0 Standard and 3.0 Turbo. In July 2024 we're releasing Version 2.0.1 Standard (BETA) and Lite 1.0.0 learn more here.