Verl: Reinforcement Learning Framework for Post-Training Large Language Models

Verl streamlines reinforcement learning post-training for large language models and integrates seamlessly with major machine learning tools, optimizing throughput and scalability for Artificial Intelligence applications.

Verl is an open source, flexible reinforcement learning (RL) training framework specifically designed for post-training large language models (LLMs). Built as an implementation of the HybridFlow architecture, verl offers a user-friendly platform for constructing and executing sophisticated RL dataflows, enabling researchers and practitioners to efficiently extend and experiment with diverse RL algorithms. Its modular APIs decouple computation and data dependencies, allowing seamless integration with existing LLM infrastructures such as PyTorch FSDP, Megatron-LM, and vLLM, as well as with HuggingFace models.

The framework is engineered for performance and scalability, supporting flexible device mapping and parallelism to optimize GPU utilization across various cluster sizes. Verls´s adoption of the 3D-HybridEngine technology ensures efficient actor model resharding, which eliminates memory redundancies and reduces communication overhead during training and generation transitions. The architecture leverages the strengths of both single-controller and multi-controller paradigms, streamlining the execution of complex post-training workflows using a concise and extensible codebase.

Comprehensive documentation and a suite of practical guides—covering installation, backend choices, multi-node training, programming with the HybridFlow model, data preparation, configuration management, and performance tuning—enables rapid onboarding and experimentation. Verls supports community collaboration under Apache License 2.0, with contributions encouraged via GitHub, Slack, or WeChat. The framework adopts modern code quality practices, employing ´ruff´ for linting/formatting and ´pre-commit´ for code management. Continuous integration guidance and open project roadmaps further bolster community engagement, positioning verl as a robust resource for advancing RL-based post-training in cutting-edge Artificial Intelligence systems.

71

Impact Score

Google Vids opens free video generation to all Google users

Google has made Google Vids available to anyone with a Google account, adding free access to video generation with its latest models. The move expands Google’s end-to-end video workflow and increases pressure on rivals that charge for similar tools.

Court warns against chatbot legal advice in Heppner case

A federal court found that chats with a publicly available generative Artificial Intelligence tool were not protected by attorney-client privilege or the work-product doctrine. The ruling highlights litigation risks when executives or employees use chatbots for legal guidance without lawyer supervision.

Newsom orders California to weigh Artificial Intelligence harms in contract rules

Gov. Gavin Newsom has signed an executive order directing California agencies to account for potential Artificial Intelligence harms in state contracting while expanding approved use of generative tools across government. The move follows a dispute involving Anthropic and reflects a broader split between California and the Trump administration on Artificial Intelligence oversight.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.