In November 2023, the AI, and at a larger scale, the tech world was feverishly consumed by news of an incipient coup staged by the Open AI (the organization behind ChatGPT) board of directors. Ultimately, that coup failed to result in executive reform. Still, it did provide the perfect opportunity for one of its biggest competitors, Anthropic, to draw attention to its own LLM (large language model). As a result, on Tuesday, Nov 21st, 2023, Anthropic announced grand updates to its AI model, Claude. Considering the enthusiastic market response, we will look at the many developing trends and statistics behind Claude AI.
Claude, is an AI (artificial intelligence) based assistant developed by Anthropic using their research into HHH (Helpful, Honest, Harmless) AI applications. The assistant can be accessed through a chatbot at www.claude.ai or an API (Application Programming Interface) through Anthropic’s developer console. Below, we will explore the interesting data behind Claude AI including development trends, performance metrics, applications and user statistics.
Since its inception, Claude has gone through a few upgrades and different iterations:
Anthropic has also released a streamlined and faster model with limited capabilities, called Claude Instant. This model has seen a similar development cycle as seen below:
As mentioned earlier, within the past month, Anthropic has released an upgraded version of their leading AI model, Claude 2.1. Having also made advancements in their Claude Instant model within the last quarter, Anthropic has set its mark as a technical leader in the AI space. Below we can see different stats detailing the capabilities achieved by Claude AI:
In developing the different versions of Claude, the Anthropic team extensively tests and measures the performance of their advancing models. Below we can see some recent highlights in Claude AI performance:
Claude scored 88.0% on GSM8k grade-school math problems, showcasing its computational ability (source)
Aside from coding performance, Anthropic sees truthfulness, harmlessness, and helpfulness as pillars to Claude’s success. The following benchmarks were recently gathered by the Anthropic team in the wake of Claude 2.1’s release:
Source: www.anthropic.com
In more traditional benchmarks, the performance of Claude has been monitored while completing arduous standardized exams historically taken by humans. The following statistics measure the progress and improvements made by the different versions of Claude.
Here’s a comparative overview of how Claude 1.3, Claude 2, Claude 3 Opus, Claude 3 Sonnet, and Claude 3 Haiku perform on standard exams (MBE - Law, GRE - Writing, HumanEval - Python, and GSM3K - Math):
Note: The GRE writing test is scored from 0 to 6.0, the above graphic represents the available data on GRE writing scores as a percentage out of a total of 6.0.
Overview highlights:
Currently, the Claude AI models are available in about 159 countries worldwide (https://www.anthropic.com/claude-ai-locations). If unfortunately, you happen to reside outside those countries, it is still possible to use Claude through one of the methods below:
Through the span of 2023, Anthropic has released its various models of Claude to the public. The following below are some quick notes on the current state of the accessibility of the Claude LLM AI models:
New testing grounds have emerged with the recent developments of Anthropic’s Claude AI bot and the release of the Claude 3.5 Sonnet version.
In these tests, we’ll observe how effective the Claude model is at concealing AI content by scanning the AI-generated articles with Originality.ai’s professional AI detection tool. The tests will include an analysis of how Originality.ai performs at detecting content when prompts include requests to “humanize” content.
While Claude can be exceptionally useful at providing ideas and generating valuable suggestions, creating entire articles with AI should be avoided, as Google can penalize AI content that doesn’t comply with spam policies.
For testing purposes, we will use the most recent version of Claude - Claude 3.5 Sonnet and the Originality.ai AI Checker. Now, let’s proceed with the first test and prompt Claude!
First, we’ll prompt Claude to generate a typical article without extra instructions (to create a baseline for comparison during future tests). Claude’s generative technology is similar to other chatbots, however, Anthropic has aimed to humanize Claude’s responses as much as possible.
Let’s begin with the first tests:
[Prompt #1] - Write a short article (500–1000 words) on the integration of artificial intelligence in 2024.
We’ve received a total of 693 words from Claude, covering the essentials of recent AI integration trends in 2024. Let’s check Originality.ai’s detection result:
Originality.ai detects the output from Claude as Likely AI with 100% Confidence.
Now, let’s attempt to humanize Claude 3.5 Sonnet’s output:
[Prompt #2] - Write a short article (500–1000 words) on the integration of artificial intelligence in 2024. Use a human tone and stick to the fluidity of a human conversation. Break up the text, include unique bullets, and implement numbered lists. Provide suggestions in first-person and try to use popular phrasings.
As a result of the second prompt, we’ve managed to extract an 863-word example from Claude. Let’s check Originality.ai’s detection result:
Even when prompted to ‘stick to the fluidity of a human conversation,’ Originality.ai continues to identify the content as AI-generated with 99% Confidence that the output is likely AI.
The verdict from this round of testing? Providing Claude with extra instructions to create a more human-like tone in the prompt does not have a significant impact on the detection outcome.
Let’s proceed with the more complex tests, where we provide Claude with a human-written example of an article to use for comparison when generating text.
The unique capabilities of AI chatbots allow them to learn on the go via unique suggestions and user prompts. Let’s see what impact providing Claude with a unique article example has on AI detection.
We have provided Claude with a technical-themed example. Let’s have a look at the first prompt:
[Prompt #1] - Write a short article (500–1000 words) on the integration of artificial intelligence in 2024. Use this *article* as an example. Stick to the tone and structure of the provided article.
Similar to the first test, we won’t mention specific instructions that prompt it to humanize the content. The first prompt has generated a 622-word piece. Here are the detection results:
From this prompt, Originality.ai continues to identify the content as Likely AI. It detects the content as Likely AI with 75% Confidence (learn more about AI detection scores). The sections, which it detects as most likely generated by AI are highlighted in the deeper shades of red and orange.
Let’s move on with the second prompt and provide Claude with both an article example and instructions for content humanization:
[Prompt #2] - Write a short article (500–1000 words) on the integration of artificial intelligence in 2024. Use a human tone and stick to the fluidity of a human conversation. Break up the text, include unique bullets, and implement numbered lists. Provide suggestions in first-person and try to use popular phrasings. Use this *article* as an example. Stick to the tone and structure of the provided article.
Let’s compare the 692 words we’ve received with Originality.ai’s detection technology:
From this prompt, the AI Checker determined that the prompt was Likely AI with 100% Confidence, continuing to demonstrate that the detector identifies Claude’s content as AI-generated.
Overall Claude’s generated text was continuously identified as Likely AI by the Originality.ai AI detector. As new models are released, we’ll continue to evaluate the detectability of their text.
Below is a pricing table comparing the costs of different Claude AI models as of May 2024 (source):
As of September 2024, (latest available data), the Claude AI website has garnered widespread attention, reaching (source):
Claude AI shows great marketing potential as currently, the site reaches roughly 75.93% of its traffic through direct searches, the chart below shows how many users Claude AI reaches through other web traffic sources:
Similarly, this next graph illustrates the traffic driven to Claude AI by different social media channels (source):
As of May 2024 (latest available data), the most common Claude AI can be described by the stats below (source):
(Source)
Geography (source):
Time will tell whether Anthropic’s strategic decision to unveil Claude 2.1 during Sam Altman’s skirmish against OpenAI’s board of directors. Resounding market praise and support have made it clear that Anthropic is positioning itself as a frontrunner and key player in the AI field. By focusing on the HHH (honest, harmless, helpful) application of AI, Claude has found a niche in the market which is highlighted by the strength of the statistics listed above. With the continued improvements and advancements showcased by Anthropic and Claude AI, it is evident that AI as a field is at the onset of rapid transformation.
Socratic is best known as Google’s free AI-powered learning app that connects students with online resources. Learn about the history of Socratic, from its early development to its acquisition by Google. Plus, get insight into key statistics like traffic sources and financials.