Kimi-Dev-72B sets new open-source benchmark for coding large language models

Moonshot AI´s Kimi-Dev-72B achieves state-of-the-art software engineering performance, using open-source large-scale reinforcement learning for code issue resolution.

Moonshot AI has unveiled Kimi-Dev-72B, a powerful open-source coding large language model (LLM) focused on software engineering and automated issue resolution. With a reported 60.4% performance on the SWE-bench Verified benchmark, Kimi-Dev-72B now leads among open-source models, setting a new state-of-the-art and outperforming previous contenders on practical software engineering tasks.

Kimi-Dev-72B distinguishes itself by leveraging large-scale reinforcement learning. The model is trained to autonomously patch real-world code repositories inside Docker containers, rewarding itself only when the complete test suite for the relevant software passes. This unique approach ensures that Kimi-Dev-72B´s code solutions are not only syntactically correct but also functionally robust, aligning closely with professional software development standards. The deployment and learning setup mimics end-to-end development workflows, making the model´s outputs directly applicable to real-world programming challenges.

This model is available to the wider community through both Hugging Face and GitHub, inviting developers, researchers, and organizations to use, test, and contribute further improvements. The quick start guide showcases how to integrate and interact with the model in Python using the widely adopted transformers library. The Kimi-Dev team also highlights their commitment to open science, with a forthcoming technical report and active community contributions encouraged. Released under the permissive MIT license, Kimi-Dev-72B underscores a collaborative approach to advancing coding-centric artificial intelligence tools and benchmarks.

78

Impact Score

Moltbot and the case for human agency as the core Artificial Intelligence guardrail

Moltbot’s viral rise highlights both the appeal of deeply personalized Artificial Intelligence agents and the rising need for users to assert their own agency, security practices, and governance. Human decision making and responsibility emerge as the decisive safeguard as open source agentic Artificial Intelligence systems gain system level powers.

Artificial Intelligence reshapes business visibility and accountability

Artificial Intelligence has shifted from a back-office productivity tool to a front-door interface that controls how organisations are discovered, interpreted, and trusted, creating new governance and accountability pressures. As search and decision-making move inside Artificial Intelligence systems, businesses must treat visibility, accuracy, and oversight as board-level issues rather than marketing concerns.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.