DeepSeek has published a technical paper co-authored by founder Liang Wenfeng proposing a rethink of its core deep learning ...
DeepSeek, the Chinese artificial intelligence (AI) startup, that took the Silicon Valley by storm in November 2024 with its R1 AI model has now revealed a new architecture that can help bring down the ...
Why reinforcement learning plateaus without representation depth (and other key takeaways from NeurIPS 2025) ...
DeepSeek has released new research showing that a promising but fragile neural network design can be stabilised at scale, delivering measurable performance gains in large language models without ...
DeepSeek has expanded its R1 whitepaper by 60 pages to disclose training secrets, clearing the path for a rumored V4 coding ...
DeepSeek researchers have developed a technology called Manifold-Constrained Hyper-Connections, or mHC, that can improve the performance of artificial intelligence models. The Chinese AI lab debuted ...
The paper comes at a time when most AI start-ups have been focusing on turning AI capabilities in LLMs into agents and other products DeepSeek's latest technical paper, co-authored by the firm's ...