Nvidia Blackwell Ultra targets agentic artificial intelligence with higher efficiency

Nvidia's Blackwell Ultra platform is being positioned to power a surge in agentic artificial intelligence and coding assistants, with new data indicating major gains in throughput and cost efficiency over Hopper. Inference providers are adopting the architecture to handle rising software development workloads that demand low latency and long context processing.

The Nvidia Blackwell platform has seen wide adoption among inference providers such as Baseten, DeepInfra, Fireworks Artificial Intelligence and Together Artificial Intelligence, where it is used to reduce cost per token by up to 10x. Building on this deployment, the Nvidia Blackwell Ultra platform is aimed at accelerating agentic Artificial Intelligence, particularly for coding assistants and autonomous agents that must manage complex, multistep tasks. These workloads span entire codebases and require both very low latency and the ability to maintain long context to keep interactions responsive and coherent.

According to OpenRouter’s State of Inference report, Artificial Intelligence agents and coding assistants are driving rapid growth in software-programming-related Artificial Intelligence queries, which increased from 11% to about 50% last year. This shift underscores how much inference demand is shifting toward interactive development tools and automated software agents. These applications place significant pressure on infrastructure to deliver real-time responsiveness while scaling to large numbers of concurrent requests and extended conversations.

New SemiAnalysis InferenceX performance data shows that the combination of Nvidia’s software optimizations and the next-generation Nvidia Blackwell Ultra platform has delivered advances in both performance and efficiency. Nvidia GB300 NVL72 systems now deliver up to 50x higher throughput per megawatt, resulting in 35x lower cost per token compared with the Nvidia Hopper platform. By coordinating innovation across chips, system architecture and software, Nvidia is using an extreme codesign approach to accelerate performance across Artificial Intelligence workloads ranging from agentic coding tools to interactive coding assistants, while continuing to drive down inference costs at scale.

70

Impact Score

Google Vids opens free video generation to all Google users

Google has made Google Vids available to anyone with a Google account, adding free access to video generation with its latest models. The move expands Google’s end-to-end video workflow and increases pressure on rivals that charge for similar tools.

Court warns against chatbot legal advice in Heppner case

A federal court found that chats with a publicly available generative Artificial Intelligence tool were not protected by attorney-client privilege or the work-product doctrine. The ruling highlights litigation risks when executives or employees use chatbots for legal guidance without lawyer supervision.

Newsom orders California to weigh Artificial Intelligence harms in contract rules

Gov. Gavin Newsom has signed an executive order directing California agencies to account for potential Artificial Intelligence harms in state contracting while expanding approved use of generative tools across government. The move follows a dispute involving Anthropic and reflects a broader split between California and the Trump administration on Artificial Intelligence oversight.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.