DeepSeek Stirs Anticipation with Rumored R2 Model Breakthrough

Chinese start-up DeepSeek fuels intense online discussion as rumors circulate about the launch and capabilities of its next open-source Artificial Intelligence model, R2.

Chinese start-up DeepSeek is at the center of mounting online speculation as social media buzz grows over the impending release of its next open-source artificial intelligence model, DeepSeek-R2. The company, well-known for its cost-efficient technology, has not officially confirmed details about R2’s launch, but discussions online suggest the new model could introduce significant advances in performance and cost savings, raising anticipation throughout the tech sector against the backdrop of the ongoing US-China tech rivalry.

Interest in DeepSeek surged after the company’s rapid emergence in late 2024 and early 2025, when it introduced two advanced open-source artificial intelligence models, V3 and R1. Both models drew attention for being developed at a fraction of the cost and computing resources compared to those required by global tech giants for similar large language model (LLM) projects. Such LLMs underpin generative artificial intelligence applications like ChatGPT, which have become central to both industry and consumer use cases.

According to recent posts circulating on Chinese stock trading social platforms, the upcoming R2 model is reportedly based on a hybrid mixture-of-experts (MoE) architecture, featuring a massive 1.2 trillion parameters. This architecture divides models into specialized sub-networks that handle different aspects of data processing, resulting in substantially reduced computation needs during pre-training and faster inference times. Notably, R2 is claimed to be up to 97.3 per cent less expensive to build than OpenAI’s GPT-4o. These rumors, if substantiated, could position DeepSeek R2 as a transformative player in global artificial intelligence competition, particularly as Chinese start-ups vie to lessen dependence on Western technologies amid ongoing international tech tensions.

81

Impact Score

IBM and AMD partner on quantum-centric supercomputing

IBM and AMD announced plans to develop quantum-centric supercomputing architectures that combine quantum computers with high-performance computing to create scalable, open-source platforms. The collaboration leverages IBM´s work on quantum computers and software and AMD´s expertise in high-performance computing and Artificial Intelligence accelerators.

Qualcomm launches Dragonwing Q-6690 with integrated RFID and Artificial Intelligence

Qualcomm announced the Dragonwing Q-6690, billed as the world’s first enterprise mobile processor with fully integrated UHF RFID and built-in 5G, Wi-Fi 7, Bluetooth 6.0, ultra-wideband and Artificial Intelligence capabilities. The platform is aimed at rugged handhelds, point-of-sale systems and smart kiosks and offers software-configurable feature packs that can be upgraded over the air.

Recent books from the MIT community

A roundup of new titles from the MIT community, including Empire of Artificial Intelligence, a critical look at Sam Altman’s OpenAI, and Data, Systems, and Society, a textbook on harnessing Artificial Intelligence for societal good.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.