Meta and Nvidia ramp up battle for DeepSeek artificial intelligence talent

Meta and Nvidia are intensifying efforts to attract DeepSeek’s top artificial intelligence experts as the global race for innovation accelerates.

The competition to attract elite artificial intelligence talent has reached new heights, with Meta and Nvidia targeting key hires from Chinese startup DeepSeek. According to a report from the South China Morning Post, DeepSeek recently launched an aggressive recruitment campaign, showcasing a wave of open roles on Microsoft´s LinkedIn platform. The company is actively seeking candidates for ten positions, including specialist internships focusing on large language models, as well as roles for deep learning researchers, core systems engineers, and software developers located in Hangzhou and Beijing.

This surge in hiring by DeepSeek comes as Meta accelerates its own efforts to strengthen artificial intelligence capabilities, particularly in the domain of artificial general intelligence, or AGI. Meta continues to hunt for senior research and engineering talent, looking to outpace rivals not only in model development but also in foundational research. Nvidia, a global leader in artificial intelligence hardware, is also making notable moves; in late June, the company appointed two prominent Chinese artificial intelligence scientists, Zhu Bangguo and Jiao Jiantao, to positions of influence, signaling a deeper engagement with the Chinese AI ecosystem.

DeepSeek’s job postings, all written in Chinese, highlight the company´s substantial technological resources, including high-end GPU clusters and an emphasis on fast iteration and experimentation. The company is aiming to attract overseas Chinese talent, particularly those working or studying in the U.S., leveraging its reputation for large-scale, cost-effective model innovation. Since its emergence in January, DeepSeek has disrupted markets with low-cost but competitive artificial intelligence models. Its recent releases include updates to its R1 reasoning model and foundational V3 system, which have kept major players in the sector on alert. As technology giants like Meta and Nvidia respond, the competitive landscape in artificial intelligence talent acquisition only grows fiercer, underscoring the global stakes in next-generation machine intelligence.

74

Impact Score

Anthropic launches Claude Mythos for Project Glasswing

Anthropic has introduced Claude Mythos Preview, a new frontier Artificial Intelligence model positioned as a major advance in cybersecurity capability. The model is being used to power Project Glasswing, a coalition effort to secure critical software before similar capabilities spread more widely.

Artificial Intelligence speeds quantum encryption threat timeline

Research from Google and Oratomic suggests quantum computers capable of breaking core internet encryption may arrive sooner than expected. Artificial Intelligence played a key role in improving one of the new algorithms, raising fresh urgency around post-quantum security.

New methods aim to improve Large Language Model reasoning

A new study on arXiv outlines algorithmic techniques designed to strengthen Large Language Model reasoning and reduce hallucinations. The work reports better logical consistency and stronger performance on mathematical and coding benchmarks.

Nvidia acquisition of SchedMD raises Slurm neutrality concerns

Nvidia’s purchase of SchedMD has given it control of Slurm, an open-source scheduler that sits at the center of many supercomputing and large-model training systems. Researchers and engineers are watching for signs that support could tilt toward Nvidia hardware over AMD and Intel alternatives.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.