Kimi-Dev-72B sets new open-source benchmark for coding large language models

Moonshot AI´s Kimi-Dev-72B achieves state-of-the-art software engineering performance, using open-source large-scale reinforcement learning for code issue resolution.

Moonshot AI has unveiled Kimi-Dev-72B, a powerful open-source coding large language model (LLM) focused on software engineering and automated issue resolution. With a reported 60.4% performance on the SWE-bench Verified benchmark, Kimi-Dev-72B now leads among open-source models, setting a new state-of-the-art and outperforming previous contenders on practical software engineering tasks.

Kimi-Dev-72B distinguishes itself by leveraging large-scale reinforcement learning. The model is trained to autonomously patch real-world code repositories inside Docker containers, rewarding itself only when the complete test suite for the relevant software passes. This unique approach ensures that Kimi-Dev-72B´s code solutions are not only syntactically correct but also functionally robust, aligning closely with professional software development standards. The deployment and learning setup mimics end-to-end development workflows, making the model´s outputs directly applicable to real-world programming challenges.

This model is available to the wider community through both Hugging Face and GitHub, inviting developers, researchers, and organizations to use, test, and contribute further improvements. The quick start guide showcases how to integrate and interact with the model in Python using the widely adopted transformers library. The Kimi-Dev team also highlights their commitment to open science, with a forthcoming technical report and active community contributions encouraged. Released under the permissive MIT license, Kimi-Dev-72B underscores a collaborative approach to advancing coding-centric artificial intelligence tools and benchmarks.

78

Impact Score

Industry 5.0 shifts focus to human centric value and sustainability

Industry 5.0 reframes industrial transformation around collaboration between humans and machines, emphasizing growth, resilience, and sustainability over narrow efficiency gains. Many organizations still underinvest in human centric and sustainable use cases despite evidence that they create higher value.

Best artificial intelligence video generators for every creator

Leading artificial intelligence video tools like Sora, Veo 3, Adobe Firefly, Runway and Midjourney target different needs, from free social clips to commercially safe productions, but all come with legal and ethical tradeoffs. Choosing the right platform means balancing price, creative control, output quality and how each service handles your data and copyrights.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.