Alibaba Unveils Qwen3: New Standard in Open-Source Large Language Models

Alibaba launches Qwen3, a groundbreaking open-source large language model family, pushing Artificial Intelligence innovation with hybrid reasoning and multilingual support.

Alibaba has introduced Qwen3, its latest generation of open-sourced large language models, establishing a new benchmark in Artificial Intelligence innovation. Qwen3 comprises six dense models and two Mixture-of-Experts (MoE) models, with parameter scales ranging from 0.6 billion to 235 billion, now freely accessible worldwide. Developers can leverage these models for diverse applications spanning mobile devices, smart glasses, autonomous vehicles, and robotics. All Qwen3 models are open sourced and available on platforms such as Hugging Face, GitHub, and ModelScope, ensuring broad developer access and fostering global collaboration.

Qwen3 pioneers Alibaba´s debut in hybrid reasoning models, uniting traditional large language model capabilities with advanced dynamic reasoning. The models are engineered to switch flexibly between ´thinking´ mode for complex, multi-step tasks—such as mathematics, coding, and logical deduction—and ´non-thinking´ mode for fast, general-purpose outputs. For API users, Qwen3 provides granular control over the duration of its reasoning (up to 38,000 tokens), optimizing performance while containing computational costs. The flagship model, Qwen3-235B-A22B MoE, notably reduces operational expenses compared to other state-of-the-art models, reaffirming Alibaba´s commitment to affordable, high-performance Artificial Intelligence.

The Qwen3 suite is trained on an expansive dataset of 36 trillion tokens, twice that of its predecessor, resulting in significant advancements in reasoning, instruction following, tool use, and multilingual tasks. Key features include superior support for 119 languages and dialects, robust agent-task integration through native Model Context Protocol and function-calling, leading benchmark scores in mathematics and coding, and enhanced human alignment for natural dialogue and creative applications. The models achieved top-tier results across industry benchmarks including AIME25, LiveCodeBench, BFCL, and Arena-Hard, driven by a complex four-stage training process focused on reinforcement learning and reasoning fusion.

Open access is central to Qwen3´s release, as Alibaba aims to accelerate Artificial Intelligence innovation across industries. Since inception, the Qwen model family has recorded over 300 million downloads worldwide, with more than 100,000 derivative models created by the developer community. Qwen3 already underpins Alibaba´s Artificial Intelligence super assistant app, Quark, and will soon be available via its Model Studio platform. This comprehensive open-source approach signals Alibaba´s ambition to redefine the global landscape of large language models and hybrid Artificial Intelligence solutions.

79

Impact Score

IBM, Red Hat, and Google donate llm-d to CNCF

IBM Research, Red Hat, and Google Cloud have donated llm-d, an open-source Kubernetes framework for large language model inference, to the CNCF as a sandbox project. The move aims to create a vendor-neutral blueprint for deploying scalable inference across models, accelerators, and clouds.

AAMU named regional lead for Amazon Web Services machine learning university

Alabama A&M University has been named a regional lead institution for Amazon Web Services Machine Learning University, expanding its role in Artificial Intelligence and machine learning education, research, and workforce development. The designation follows the university’s recent national HBCU summit on Artificial Intelligence and sets up new curriculum, faculty training, and student career pathways across the Southeast.

EDPB backs global privacy statement on Artificial Intelligence-generated imagery

The European Data Protection Board has endorsed a joint Global Privacy Assembly statement warning that Artificial Intelligence-generated images and videos can seriously harm privacy, dignity, and safety. The statement calls for stronger safeguards, transparency, and protections for children and other vulnerable groups.

Intel unveils Arc Pro B70 and B65 workstation GPUs

Intel has introduced the Arc Pro B70 and Arc Pro B65 for workstation-class Artificial Intelligence compute and professional visualization. The Arc Pro B70 is the fullest expression yet of the Xe2 Battlemage discrete GPU design in this lineup.

Intel launches Xeon 600 workstation processors

Intel has launched its Xeon 600 series Granite Rapids-WS processors for workstations and high-end desktops, with a focus on Artificial Intelligence development, AVX-512, and large PCIe connectivity. The lineup scales up to 86 Redwood Cove performance cores and supports high memory and I/O capacity.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.