DeepSeek released an updated version of its DeepSeek-V3 model on Watch Sukeban Deka: The Movie OnlineMarch 24. The new version, DeepSeek-V3-0324, has 685 billion parameters, a slight increase from the original V3 model’s 671 billion. The company has not yet released a system card for the updated model. DeepSeek has also changed the model’s open-source license to an MIT license, aligning it with the DeepSeek-R1 model.
The original DeepSeek-V3 gained worldwide attention for its cost-effectiveness. In multiple benchmark tests, it outperformed other open-source models such as Qwen2.5-72B and Llama-3.1-405B, while delivering performance comparable to top proprietary models like GPT-4o and Claude-3.5-Sonnet. DeepSeek investor High-Flyer Quant has emphasized in a published paper that the model was trained at exceptionally low costs. By optimizing algorithms, frameworks, and hardware, the total training cost of DeepSeek-V3 was just $5.576 million – assuming an H800 GPU rental price of $2 per GPU per hour. [Cailian, in Chinese]
(Editor: {typename type="name"/})
Pairing CPUs and GPUs: PC Upgrades and Bottlenecking
Everything Apple announced at the iPhone 14 event
Stephen King creates a Twitter play to brutally mock Donald Trump
Apple Watch Series 8 introduces new ovulation and fertility tracking
NYT Connections hints and answers for May 18: Tips to solve 'Connections' #707.
Pornhub traffic took a hit with Apple Watch Ultra reveal at iPhone 14 event
Mariah Carey continues to be an absolute icon with her 10
Best Apple Watch deal: Save $100 on the Apple Watch Series 7 at Walmart
Elon Musk says Mars ship could make first flights in 2019
This massive fried rice prank turned into a great Photoshop battle
Creator job opportunities grew 7x in recent years [April 2025]
Ryan Reynolds' birthday message to Betty White is as wonderful as you'd expect
接受PR>=1、BR>=1,流量相当,内容相关类链接。