Nvidia rival claims DeepSeek world record as it delivers industry-first performance with 95% fewer chips
By
Wayne Williams
published
20 February 2025
SambaNova recorded 198 tokens per second using just 16 custom-built chips

- SambaNova runs DeepSeek-R1 at 198 tokens/sec using 16 custom chips
- The SN40L RDU chip is reportedly 3X faster, 5X more efficient than GPUs
- 5X speed boost is promised soon, with 100X capacity by year-end on cloud
Chinese AI upstart DeepSeek has very quickly made a name for itself in 2025, with its R1 large-scale open source language model, built for advanced reasoning tasks, showing performance on par with the industryтАЩs top models, while being more cost-efficient.
SambaNova Systems, an AI startup founded in 2017 by experts from Sun/Oracle and Stanford University, has now announced what it claims is the worldтАЩs fastest deployment of the DeepSeek-R1 671B LLM to date.
The company says it has achieved 198 tokens per second, per user, using just 16 custom-built chips, replacing the 40 racks of 320 Nvidia GPUs that would typically be required.
Independently verified
тАЬPowered by the SN40L RDU chip, SambaNova is the fastest platform running DeepSeek,тАЭ said Rodrigo Liang, CEO and co-founder of SambaNova. тАЬThis will increase to 5X faster than the latest GPU speed on a single rack – and by year-end, we will offer 100X capacity for DeepSeek-R1.тАЭ
While NvidiaтАЩs GPUs have traditionally powered large AI workloads, SambaNova argues that its reconfigurable dataflow architecture offers a more efficient solution. The company claims its hardware delivers three times the speed and five times the efficiency of leading GPUs while maintaining the full reasoning power of DeepSeek-R1.
тАЬDeepSeek-R1 is one of the most advanced frontier AI models available, but its full potential has been limited by the inefficiency of GPUs,тАЭ said Liang. тАЬThat changes today. WeтАЩre bringing the next major breakthrough – collapsing inference costs and reducing hardware requirements from 40 racks to just one – to offer DeepSeek-R1 at the fastest speeds, efficiently.тАЭ
George Cameron, co-founder of AI evaluating firm Artificial Analysis, said his company had тАЬindependently benchmarked SambaNovaтАЩs cloud deployment of the full 671 billion parameter DeepSeek-R1 Mixture of Experts model at over 195 output tokens/s, the fastest output speed we have ever measured for DeepSeek-R1. High output speeds are particularly important for reasoning models, as these models use reasoning output tokens to improve the quality of their responses. SambaNovaтАЩs high output speeds will support the use of reasoning models in latency-sensitive use cases.тАЭ
Are you a pro? Subscribe to our newsletter
Sign up to the TechRadar Pro newsletter to get all the top news, opinion, features and guidance your business needs to succeed!
DeepSeek-R1 671B is now available on SambaNova Cloud, with API access offered to select users. The company is scaling capacity rapidly, and says it hopes to reach 20,000 tokens per second of total rack throughput “in the near future”.
You might also like
- Nvidia and AMD trade blows over who is faster on DeepSeek AI benchmarks
- A look at the Nvidia GPU that powers DeepSeek’s AI global ambition
- AI phenomenon DeepSeek is officially growing faster than ChatGPT

Wayne Williams is a freelancer writing news for TechRadar Pro. He has been writing about computers, technology, and the web for 30 years. In that time he wrote for most of the UKтАЩs PC magazines, and launched, edited and published a number of them too.
You must confirm your public display name before commenting
Please logout and then login again, you will then be prompted to enter your display name.

Australia’s largest pension funds hit by hackers, thousands of dollars stolen

Businesses are losing millions to fraud every year

The Samsung Galaxy S25 EdgeтАЩs new release date could be May 13










-
1The Samsung Galaxy S25 EdgeтАЩs new release date could be May 13
-
2Australia’s largest pension funds hit by hackers, thousands of dollars stolen
-
3Kaleidescape’s new Blu-ray quality movie streamer is half the price, but has a huge 4K catch тАУ and maybe that’s okay
-
4Businesses are losing millions to fraud every year
-
5Sony’s new OLED TV gets first price and release date, and it’s great news for us, bad news for LG