Home News The affordability of DeepSeek is a myth: The revolutionary AI actually cost $1.6 billion to develop

The affordability of DeepSeek is a myth: The revolutionary AI actually cost $1.6 billion to develop

by Isaac Mar 16,2025

DeepSeek's new chatbot boasts an impressive introduction: "Hi, I was created so you can ask anything and get an answer that might even surprise you." This AI, a product of the Chinese startup DeepSeek, has quickly become a major player, even causing significant drops in NVIDIA's stock price.

DeepSeek TestImage: ensigame.com

DeepSeek's success stems from its innovative architecture and training methods. Key technologies include:

  • Multi-token Prediction (MTP): Instead of predicting words one by one, MTP forecasts multiple words simultaneously, boosting accuracy and efficiency.
  • Mixture of Experts (MoE): This architecture utilizes 256 neural networks in DeepSeek V3, activating eight for each token processing task, significantly accelerating training and improving performance.
  • Multi-head Latent Attention (MLA): MLA repeatedly extracts key details from text fragments, ensuring crucial information isn't missed, leading to a more nuanced understanding of input data.
DeepSeek V3Image: ensigame.com

While DeepSeek initially claimed a remarkably low training cost of $6 million for DeepSeek V3 using 2048 GPUs, SemiAnalysis revealed a far more substantial infrastructure: approximately 50,000 Nvidia Hopper GPUs (including 10,000 H800s, 10,000 H100s, and additional H20s) spread across multiple data centers. This represents a total server investment of roughly $1.6 billion, with operational expenses estimated at $944 million.

DeepSeek, a subsidiary of the Chinese hedge fund High-Flyer, owns its data centers, granting unparalleled control over optimization and innovation implementation. This self-funded approach enhances flexibility and decision-making speed. Furthermore, the company attracts top talent, with some researchers earning over $1.3 million annually, primarily recruiting from leading Chinese universities.

DeepSeekImage: ensigame.com

DeepSeek's $6 million training cost claim is misleading; it only reflects pre-training GPU usage, excluding research, refinement, data processing, and infrastructure. The company's actual investment in AI development exceeds $500 million. However, its lean structure allows for efficient innovation implementation compared to larger, more bureaucratic organizations.

DeepSeekImage: ensigame.com

DeepSeek's story demonstrates a well-funded independent AI company's ability to compete with giants. Its success, however, is undeniably linked to billions in investment, technological breakthroughs, and a strong team. The "revolutionary budget" narrative is a significant oversimplification. Nevertheless, DeepSeek's costs remain significantly lower than competitors. For example, DeepSeek spent $5 million on R1, while ChatGPT4 cost $100 million.