Home > News > DeepSeek's $1.6B Development Cost Debunked

DeepSeek's $1.6B Development Cost Debunked

Mar 13,25(1 months ago)
DeepSeek's $1.6B Development Cost Debunked

DeepSeek's new chatbot boasts an impressive introduction: "Hi, I was created so you can ask anything and get an answer that might even surprise you." This AI, a product of the Chinese startup DeepSeek, has rapidly become a major market competitor, even contributing to a significant drop in NVIDIA's stock price. Its success stems from a unique architecture and training methodology incorporating several innovative technologies.

Multi-token Prediction (MTP): Unlike traditional word-by-word prediction, MTP forecasts multiple words simultaneously, analyzing different sentence parts for enhanced accuracy and efficiency.

Mixture of Experts (MoE): This architecture leverages multiple neural networks to process input data, accelerating AI training and improving performance. DeepSeek V3 utilizes 256 neural networks, activating eight for each token processing task.

Multi-head Latent Attention (MLA): This mechanism focuses on crucial sentence parts, repeatedly extracting key details from text fragments to minimize information loss and capture subtle nuances.

DeepSeek initially claimed to have trained its powerful DeepSeek V3 neural network for a mere $6 million using only 2048 GPUs. However, SemiAnalysis revealed a far more substantial infrastructure: approximately 50,000 Nvidia Hopper GPUs, including 10,000 H800s, 10,000 H100s, and additional H20 GPUs, distributed across multiple data centers. This translates to a server investment of roughly $1.6 billion and operational expenses estimated at $944 million.

DeepSeek, a subsidiary of the Chinese hedge fund High-Flyer, owns its data centers, granting complete control over AI model optimization and faster innovation implementation. This self-funded approach enhances flexibility and decision-making speed. Furthermore, the company attracts top talent, with some researchers earning over $1.3 million annually, primarily from Chinese universities.

While DeepSeek's initial $6 million training cost claim appears unrealistic—referring only to pre-training GPU usage and excluding research, refinement, data processing, and infrastructure—the company has invested over $500 million in AI development. Its compact structure facilitates efficient innovation implementation compared to larger, more bureaucratic competitors.

DeepSeek's example showcases a well-funded independent AI company successfully competing with industry giants. However, its success is undeniably linked to substantial investments, technical breakthroughs, and a strong team, making the "revolutionary budget" claim somewhat misleading. Nevertheless, the company’s costs remain significantly lower than competitors; for example, DeepSeek spent $5 million on R1, while ChatGPT4 cost $100 million. This cost difference, even considering DeepSeek's actual expenditure, highlights a significant competitive advantage.

DeepSeek TestDeepSeek V3DeepSeekDeepSeek

Discover
  • Wedding Hairstyles on photo
    Wedding Hairstyles on photo
    Discover your dream wedding hairstyle with the Wedding Hairstyles Photo Editor! Welcome to the world of Wedding Hairstyles, where you can transform your bridal photos into stunning masterpieces with just a few taps. Our app is a versatile bridal photo editor that lets you add a variety of beautiful
  • Whack Whack War
    Whack Whack War
    Get ready for an exhilarating new adventure with **Whack Whack War**, a game that's not only wildly addictive but also incredibly easy to dive into with its adorable graphics and intuitive one-tap controls. Step into the thrilling arena, where you'll take command of your hero and embark on a mission
  • Army Bomb Games 3D Nuclear War
    Army Bomb Games 3D Nuclear War
    Nuclear Bomb Simulator and Bomb Defuse 3D: Bomb Blast & Nuclear Bomb Games War. Let's enjoy Bomb Defusing Nuclear Bomb Games 3D Offline Multiplayer, introduced with a Bomb Defuse Squad in an amazing Nuke Bomb Games. Download the Bomb Defuse Game and be careful about the attack of the Atomic Bomb to
  • One Lab - Artful Photo Editor
    One Lab - Artful Photo Editor
    Unleash your creativity with OneLab - Artful Photo Editor, a revolutionary app that offers a wealth of graphic possibilities at your fingertips. From simple photo editing to mind-bending glitch art, image distortions, procedural generation, and 3D manipulation, this app is a treasure trove for artis
  • LEGO DUPLO WORLD
    LEGO DUPLO WORLD
    LEGO DUPLO WORLD is not just a regular game; it is an engaging and educational platform designed specifically for children. With a vast world to explore filled with colorful animals, buildings, vehicles, and trains made out of LEGO pieces, kids are in for an interactive and stimulating experience. T
  • Doppelgangers - find your twin
    Doppelgangers - find your twin
    Unleash the fun of finding your perfect lookalike with our Doppelgangers - find your twin app! Begin your journey by downloading the app and signing in effortlessly with your preferred method. Once you're in, snap a clear selfie, making sure it's all about you—no distractions needed. Our cutting-edg