AI Writing

Breaking Barriers: Originality.AI's Multilanguage Release Supports 15 Languages

Originality.AI is thrilled to introduce a step towards a more inclusive platform with the launch of our Multilanguage Release. This release supports 15 languages in total. The decision for this release aims to address some of the existing gaps that have stood between diverse languages and robust AI Detection Technology.

Originality.AI is thrilled to introduce a step towards a more inclusive platform with the launch of our Multilanguage Release. This release supports 15 languages in total. The decision for this release aims to address some of the existing gaps that have stood between diverse languages and robust AI Detection Technology.

In this blog post we dive into how our Multilanguage Release was built, how we tested it and what future work we will be doing to enhance this feature. 

Key Takeaways: 

  • Our first version of the multilingual AI detector has surpassed 90% accuracy across all languages
  • Accuracy is around 95% and false positive is approximately 4.5%
  • Future work will include expanding our dataset as well as exploring different deep learning structures for better outcomes 
  • Our core English language Accuracy still outperforms the current multilingual model

  1. The Dataset

The foundation of our Multilingual AI Detector lies within its extensive dataset which boasts millions of data samples that were curated for our analysis. The dataset is split into two categories. 

1. Content penned by humans 

2. Content generated by some of the most prominent AI language models, including text-davinci (GPT3), GPT 3.5-turbo, chat-GPT, and the state-of-the-art GPT-4. 

Spanning fifteen different languages, this dataset presented a unique challenge in multilingual AI detection.

2. Experiments: Evaluating Performance

In order to conduct a thorough evaluation of the Multilingual AI Detector’s performance, we did some experiments using a benchmark dataset of up to millions of samples. The results represented below in the confusion matrix represent reliable scores

  • True Positive - AI detector correctly identified AI content as AI
  • False Negative - AI detector incorrectly identified AI content as Human
  • False Positive - AI detector incorrectly identified Human content as AI
  • True Negative - AI detector correctly identified Human content as Human

3. Results Per Language

The performance of the Multilingual AI Detector performs very well across all languages, consistently surpassing a 90% accuracy rate.  Below is a graph of the accuracy achieved for each language:

4. Conclusion: A Win For Languages 

After lots of experimentation and testing,our Multilingual AI Detector - Version 1.0 emerges with its accuracy consistently above 90%. The accuracy rate is approximately 95%, while the false positive rate hovers around 4.5%. There is still room for improvement, but we believe this is a step in the right direction, bringing together different languages, and smart detection methods. 

5. How to Access the Model 

With no change to the existing Originality.AI workflow and no action required by the user, the Multilanguage AI Detection model automatically detects if the English language is not being used and will toggle automatically to the Multilinguage Model to deliver a seamless user experience. 

6. A Roadmap Ahead

Our development plans don’t stop here for our Multilingual AI Detector. Here's what's planned ahead:

  1. We will work to make GPT-3 Detection Better: We'll add more data samples to improve how well we can spot AI content from GPT-3.
  2. We will try new approaches:  By adapting and experimenting with alternative models, we can uncover new insights and achieve improved results. 
  3. We will explore improving Prompt Engineer Technique: We'll work more on the tricky Prompt Engineer technique to get even better at detecting AI Generated content
  4. We’ll work to get More efficient: We'll figure out how to make the detector work faster and smoother using special techniques.

We hope you enjoy our latest release!

Jonathan Gillham

Founder / CEO of Originality.AI I have been involved in the SEO and Content Marketing world for over a decade. My career started with a portfolio of content sites, recently I sold 2 content marketing agencies and I am the Co-Founder of MotionInvest.com, the leading place to buy and sell content websites. Through these experiences I understand what web publishers need when it comes to verifying content is original. I am not For or Against AI content, I think it has a place in everyones content strategy. However, I believe you as the publisher should be the one making the decision on when to use AI content. Our Originality checking tool has been built with serious web publishers in mind!

More From The Blog

AI Content Detector & Plagiarism Checker for Serious Content Publishers

Improve your content quality by accurately detecting duplicate content and artificially generated text.