


545%! DeepSeek first disclosed cost profit margin Expert: If it is already a company worth more than 10 billion US dollars in the United States
Mar 12, 2025 pm 01:30 PMDeepSeek, a Chinese AI startup, has been "open source" recently. Last Saturday (1st), there was a bigger surprise. It fully revealed the DeepSeek-V3/R1 inference system, which not only disclosed the core optimization solution of its inference system, but also disclosed key data such as cost-benefit rates for the first time, causing industry shocks.
DeepSeek released its first article on Zhihu platform last Saturday, announcing the details of model inference cost-profits and disclosed key information on cost-profit rates. If all tokens are calculated based on the pricing of DeepSeek-R1, the theoretical total revenue per day is US$560,000, and the cost-profit rate is 545%. This figure refreshes the profit ceiling in the global AI big model field.
According to the official disclosure of DeepSeek, all services of DeepSeek V3 and R1 use H800 GPU, using consistent accuracy with training, that is, matrix calculation and dispatch transmission adopt consistent FP8 format with consistent training, and core-attention calculation and combine transmission adopt consistent BF16 with consistent training, ensuring the service effect to the greatest extent.
In the statistical cycle of the last 24 hours (12:00 on February 27, 2025 to 12:00 on February 28), if the GPU rental cost is calculated at US$2/hour, the average daily cost is US$87,072, and if all input/output tokens are priced at R1 (input 1 yuan/million token, output 16 yuan/million token), the daily income can reach US$560,027 (about NT$18.65 million), and the cost interest rate is as high as 545%.
After reading the above data, MenloVentures investor Deedy pointed out that the business efficiency of the profit rate exceeding 500% will be a company worth more than 10 billion US dollars in the United States.
Yuan Jinhui, founder of China's silicon-based mobile phone, also expressed his feelings at the first time: "DeepSeek's official disclosure of the cost and benefits of large-scale deployment has once again subverted many people's perceptions."
DeepSeek's high profit rate comes from its innovative inference system design, with three technical pillars: large-scale cross-node expert parallelism (EP), computing communication overlap and load balancing optimization. EP improves throughput and response speed. For model sparsity (only 8/256 experts are started per layer), EP strategy is used to expand the overall batch size to ensure that each expert obtains sufficient computing load, significantly improves GPU utilization, and dynamically adjusts the deployment unit (such as 4 nodes in the Prefill stage and 18 nodes in the Decode stage), and balances resource allocation and task requirements.
In short, EP is like "multi-person collaboration", dispersing the "experts" in the model to multiple GPUs for calculations, greatly improving Batch Size, squeezing the GPU computing power, and at the same time dispersing experts, reducing memory pressure, and responding faster.
DeepSeek further compresses costs at the engineering level, plus day and night resource allocation, fully supports inference services during peak days, and idle nodes at night are transferred for R&D and training, maximizing hardware utilization, and the cache hit rate reaches 56.3%. Reduces duplicate calculations through KVCache hard disk cache. Among the input tokens, 342 billion (56.3%) hit caches directly, greatly reducing computing power consumption.
Some analysts say that the data disclosed by DeepSeek not only verifies the commercial feasibility of its technical route, but also sets a benchmark for efficient profitability for the industry. The cost of model training is only 1% to 5% of similar products. The previously released DeepSeek-V3 model training cost is only 5.576 million US dollars, far lower than that of giants such as OpenAI. In terms of inference pricing advantages, DeepSeek-R1's API pricing is only about one-seventh to half of OpenAI o3-mini, and low-cost strategies accelerate market penetration.
Other analysts pointed out that DeepSeek's "transparent" disclosure not only demonstrates its technical strength and business potential, but also sends a clear signal to the industry, that is, the profit cycle of AI models has shone from ideals into reality, representing a key turning point in AI technology from laboratory to industrialization.
However, DeepSeek officially admitted that there was actually not so much revenue, because V3 was priced lower, and paid services only accounted for a part of the time, and there were discounts at night.
CITIC Securities believes that Deepseek's best practices in reducing model training costs are expected to stimulate technology giants to adopt a more economical way to accelerate the exploration and research of cutting-edge models, and at the same time, it will enable a large number of AI applications to be unlocked and implemented. The increasing effect of scale returns brought by algorithm training, as well as the Jevins paradox corresponding to the reduction of unit computing power costs, all represent that medium and short-term dimensional technology giants continue to make continuous investment in the field of AI computing power, and scale investment will still be a high-deterministic event.
The above is the detailed content of 545%! DeepSeek first disclosed cost profit margin Expert: If it is already a company worth more than 10 billion US dollars in the United States. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undress AI Tool
Undress images for free

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

As the market conditions pick up, more and more smart investors have begun to quietly increase their positions in the currency circle. Many people are wondering what makes them take decisively when most people wait and see? This article will analyze current trends through on-chain data to help readers understand the logic of smart funds, so as to better grasp the next round of potential wealth growth opportunities.

With the continuous evolution of the crypto market, investors face not only the temptation of high returns, but also the challenge of high risks in 2025. Especially in high volatility market conditions, how to avoid traps and achieve stable returns has become the core issue that long-term holders pay attention to. This article will give a detailed explanation of asset allocation strategies and recommend several low-risk investment methods that are suitable for long-term holding.

The pattern in the public chain field shows a trend of "one super, many strong ones, and a hundred flowers blooming". Ethereum is still leading with its ecological moat, while Solana, Avalanche and others are challenging performance. Meanwhile, Polkadot, Cosmos, which focuses on interoperability, and Chainlink, which is a critical infrastructure, form a future picture of multiple chains coexisting. For users and developers, choosing which platform is no longer a single choice, but requires a trade-off between performance, cost, security and ecological maturity based on specific needs.

The top ten cryptocurrency platforms in 2025 are Binance, OKX, Gate.io, HTX, Coinbase, Kraken, KuCoin, Bitget, Crypto.com and MEXC, respectively, serving different user groups with their respective core advantages. 1. Binance is suitable for all kinds of traders with extremely high liquidity, rich asset types and a complete ecosystem; 2. OKX attracts professional traders and Web3 explorers with its leading derivatives market and Web3 ecosystem integration; 3. Gate.io is suitable for users who seek niche asset allocation due to its rich asset types and a long history of operation; 4. HTX uses a high-performance trading engine and excellent users

The currency circle seems to have a low threshold, but in fact it hides a lot of terms and complex logic. Many novices "rush into the market" in confusion and end up losing money. This article will give a comprehensive explanation of common terms in the currency circle, the operating logic of real money makers, and practical risk control strategies to help readers clarify their ideas and reduce investment risks.

Cardano's Alonzo hard fork upgrade has successfully transformed Cardano from a value transfer network to a fully functional smart contract platform by introducing the Plutus smart contract platform. 1. Plutus is based on Haskell language, with powerful functionality, enhanced security and predictable cost model; 2. After the upgrade, dApps deployment is accelerated, the developer community is expanded, and the DeFi and NFT ecosystems are developing rapidly; 3. Looking ahead to 2025, the Cardano ecosystem will be more mature and diverse. Combined with the improvement of scalability in the Basho era, the enhancement of cross-chain interoperability, the evolution of decentralized governance in the Voltaire era, and the promotion of mainstream adoption by enterprise-level applications, Cardano has

The cryptocurrency market in 2025 is still full of opportunities, and choosing a suitable app is the first step to success. Before making a decision, it is recommended that users comprehensively consider their trading experience, product types of interest, and preferences for functional complexity. Most importantly, no matter which platform you choose, asset security should be put first and always maintain a learning mindset to adapt to this rapidly changing market.

The top 20 most promising crypto assets in 2025 include BTC, ETH, SOL, etc., mainly covering multiple tracks such as public chains, Layer 2, AI, DeFi and gaming. 1.BTC continues to lead the market with its digital yellow metallicity and popularization of ETFs; 2.ETH consolidates the ecosystem due to its position and upgrade of smart contract platforms; 3.SOL stands out with high-performance public chains and developer communities; 4.LINK is the leader in oracle connecting real data; 5.RNDR builds decentralized GPU network service AI needs; 6.IMX focuses on Web3 games to provide a zero-gas-free environment; 7.ARB leads with mature Layer 2 technology and huge DeFi ecosystem; 8.MATIC has become the value layer of Ethereum through multi-chain evolution
