Deepseek Architecture

Tech Times

DeepSeek V4 Architecture: How Sparse Attention Cuts Inference Costs, What NIST Found

DeepSeek V4 architecture uses sparse attention to cut inference costs 73% at one-million-token contexts, but a NIST ...

DeepSeek open sources DSpark, a new framework to speed up LLM inference by up to 85%

DSpark can make decoding faster, but acceptance quality still determines how much speed the system actually realizes.

3don MSN

DeepSeek's DSpark just made Nvidia's most important new bet harder to close

DeepSeek just released DSpark, an inference module that makes its AI models 60% to 85% faster without new hardware. Nvidia is ...

VentureBeat

Clever architecture over raw compute: DeepSeek shatters the ‘bigger is better’ approach to AI development

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now The AI narrative has reached a critical ...

Tech Times

DeepSeek Releases DSpark: Speculative Decoding Makes V4 Up to 85 Percent Faster

DeepSeek speculative decoding framework DSpark went live June 27 on V4-Flash and V4-Pro, reporting up to 85 percent faster ...

Mint

DeepSeek debuts new AI model as ‘intermediate step’ towards next generation

China's DeepSeek has launched a new AI model called DeepSeek-V3.2-Exp. The new model comes with the new DeepSeek Sparse Attention technology, which the company says is "designed to explore and ...

Geeky Gadgets

Deepseek R2 Crushes Costs by 97% : Fast Hybrid AI Architecture Performance

What if you could access innovative AI technology for a fraction of the cost—without sacrificing performance? Enter Deepseek R2, the open source model that’s turning the industry on its head. At an ...

Yahoo News Singapore

DeepSeek proposes shift in AI model development with 'mHC' architecture to upgrade ResNet

Add Yahoo as a preferred source to see more of our stories on Google. DeepSeek's latest technical paper, co-authored by the firm's founder and CEO Liang Wenfeng, has been cited as a potential game ...

Gizmochina

DeepSeek kicks off 2026 with new AI architecture aimed at more efficient model training

Training large AI models has become one of the biggest challenges in modern computing—not just because of complexity, but because of cost, power use, and wasted resources. A new research paper from ...

Geeky Gadgets

How DeepSeek 3.1 is Outperforming Industry Giants Like OpenAI

What if the next big leap in artificial intelligence wasn’t a flashy, headline-grabbing overhaul, but a so-called “minor update” that quietly redefined what’s possible? Enter DeepSeek V3.1, a release ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results