Kimi-Dev-72B sets new open-source benchmark for coding large language models

Moonshot AI´s Kimi-Dev-72B achieves state-of-the-art software engineering performance, using open-source large-scale reinforcement learning for code issue resolution.

Moonshot AI has unveiled Kimi-Dev-72B, a powerful open-source coding large language model (LLM) focused on software engineering and automated issue resolution. With a reported 60.4% performance on the SWE-bench Verified benchmark, Kimi-Dev-72B now leads among open-source models, setting a new state-of-the-art and outperforming previous contenders on practical software engineering tasks.

Kimi-Dev-72B distinguishes itself by leveraging large-scale reinforcement learning. The model is trained to autonomously patch real-world code repositories inside Docker containers, rewarding itself only when the complete test suite for the relevant software passes. This unique approach ensures that Kimi-Dev-72B´s code solutions are not only syntactically correct but also functionally robust, aligning closely with professional software development standards. The deployment and learning setup mimics end-to-end development workflows, making the model´s outputs directly applicable to real-world programming challenges.

This model is available to the wider community through both Hugging Face and GitHub, inviting developers, researchers, and organizations to use, test, and contribute further improvements. The quick start guide showcases how to integrate and interact with the model in Python using the widely adopted transformers library. The Kimi-Dev team also highlights their commitment to open science, with a forthcoming technical report and active community contributions encouraged. Released under the permissive MIT license, Kimi-Dev-72B underscores a collaborative approach to advancing coding-centric artificial intelligence tools and benchmarks.

78

Impact Score

Google Vids opens free video generation to all Google users

Google has made Google Vids available to anyone with a Google account, adding free access to video generation with its latest models. The move expands Google’s end-to-end video workflow and increases pressure on rivals that charge for similar tools.

Court warns against chatbot legal advice in Heppner case

A federal court found that chats with a publicly available generative Artificial Intelligence tool were not protected by attorney-client privilege or the work-product doctrine. The ruling highlights litigation risks when executives or employees use chatbots for legal guidance without lawyer supervision.

Newsom orders California to weigh Artificial Intelligence harms in contract rules

Gov. Gavin Newsom has signed an executive order directing California agencies to account for potential Artificial Intelligence harms in state contracting while expanding approved use of generative tools across government. The move follows a dispute involving Anthropic and reflects a broader split between California and the Trump administration on Artificial Intelligence oversight.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.