States Headline

DeepSeek-V3: China’s AI Marvel Outshines Global Giants

DeepSeek, a Chinese artificial intelligence lab, has made waves with the release of its new large language model, DeepSeek-V3. Launched in late December 2024, this model has emerged as a formidable competitor in the AI landscape, outperforming renowned models such as OpenAI's GPT-4o and Anthropic's Claude Sonnet 3.5 in third-party benchmark tests. Remarkably, DeepSeek-V3 was…

Natasha Laurent

January 24, 2025

DeepSeek-V3: China’s AI Marvel Outshines Global Giants

In benchmarking tests, DeepSeek-V3 has demonstrated superior capabilities over other established models, including Meta's Llama 3.1 and Alibaba's Qwen2.5. It excels in tasks such as problem-solving, coding, and mathematics. The team at DeepSeek managed to train this potent model using significantly fewer graphics processing units (GPUs) than its competitors. Despite the proprietary nature of its training data, DeepSeek-V3 remains an "open-weight" model, allowing users to explore and adapt its algorithm.

DeepSeek recently furthered its advancements by releasing DeepSeek-R1 on January 20. This newer model has already surpassed ChatGPT's latest o1 model in several tests while being 27 times less expensive. This accomplishment highlights DeepSeek's strategic efforts to navigate U.S. export controls that limit Chinese companies' access to state-of-the-art AI computing chips. Both DeepSeek-V3 and R1 employ the "chain of thought" method, which allows for tackling more complex tasks with heightened precision.

The development of these models offers a significant insight into how AI systems learn from training data derived from human input. By analyzing probabilities of different patterns within datasets, these systems generate outputs that replicate human-like reasoning. OpenAI's GPT-3.5 was trained on approximately 570GB of text data from Common Crawl, equating to about 300 billion words. In contrast, DeepSeek engineers achieved comparable results to ChatGPT using only 2,000 Nvidia GPUs, whereas ChatGPT reportedly required 10,000 Nvidia GPUs to process its training data.

The AI community is now urged to pay close attention to these technological strides emerging from China. Microsoft CEO Satya Nadella emphasized the importance of this development:

"We should take the developments out of China very, very seriously," – Satya Nadella, the CEO of Microsoft.

Natasha Laurent

Natasha grew up in a small town near Montreal, where her family’s garden became her first laboratory. Her curiosity about plants and the environment led her to explore broader scientific questions. She spent her early years volunteering at local environmental groups and health initiatives, combining her love for science with a desire to make a difference. After starting her career as a researcher, Natasha transitioned to journalism, where she could communicate scientific discoveries and health breakthroughs in a way that inspires others.

Latest Posts

The Path to Palestinian Statehood Remains Obstinate Amid International Pressure

August 2, 2025

.

Liam
Son Heung-min to Depart Tottenham Hotspur After Historic Tenure

August 2, 2025

.

Ryan Fraser
Guess’s New AI-Generated Models Spark Controversy in Ad Campaign

August 2, 2025

.

Ava Cho
Whistleblower Lawsuit Highlights Concerns Over Conditions in Fraser Health Hospitals

August 2, 2025

.

Natasha Laurent
Financial Strain Revealed as SEND Companies Profit Amid Local Council Budget Cuts

August 2, 2025

.

Alexis Wang

Editor’s Picks

The Path to Palestinian Statehood Remains Obstinate Amid International Pressure

August 2, 2025
Son Heung-min to Depart Tottenham Hotspur After Historic Tenure

August 2, 2025
Guess’s New AI-Generated Models Spark Controversy in Ad Campaign

August 2, 2025
Whistleblower Lawsuit Highlights Concerns Over Conditions in Fraser Health Hospitals

August 2, 2025