NVIDIA NeMo Microservices Enable Fast, Scalable AI Agent Deployment for the Enterprise

NVIDIA NeMo microservices are now available to help enterprises build and optimize Artificial Intelligence teammates with advanced data flywheels, boosting employee productivity.

NVIDIA has announced the general availability of its NeMo microservices, an integrated developer platform designed to streamline the creation and deployment of agentic Artificial Intelligence systems within enterprise IT environments. These microservices allow organizations to build and manage AI-powered digital teammates that leverage user interactions, inference data, and business intelligence to continuously enhance model performance through orchestrated data flywheels—systems that curate, retrain, and evaluate models with minimal human input.

Key components of the NeMo suite, including NeMo Customizer, NeMo Evaluator, NeMo Guardrails, NeMo Retriever, and NeMo Curator, provide tools for fine-tuning large language models, benchmarking against both custom and industry standards, and enforcing robust safety and compliance protocols. For example, enterprises can accelerate large language model training throughput by up to 1.8x and improve compliance protection by 1.4x. NeMo microservices are deployable on NVIDIA AI Enterprise and operate seamlessly on any accelerated computing infrastructure, offering enterprise-grade security and support either on-premises or in the cloud.

Industry leaders such as AT&T, BlackRock, Cisco’s Outshift team, and Nasdaq are leveraging NeMo microservices to build advanced AI agents that deliver measurable gains: AT&T used the platform to increase agent accuracy by 40% for core business functions such as personalized service and fraud prevention, while Nasdaq improved its platform’s response time and accuracy by up to 30%. The platform supports a broad array of foundation models, including Meta’s Llama, Microsoft Phi, Google Gemma, Mistral, and the Llama Nemotron Ultra, enabling versatile development across diverse enterprise needs.

NeMo microservices are now integrated by a growing ecosystem of technology partners and solution providers, from major storage and infrastructure companies like Dell Technologies, HPE, IBM, NetApp, and Pure Storage, to software platforms and consulting leaders, including Accenture, Deloitte, and EY. Developers can access NeMo microservices via the NVIDIA NGC catalog, with full API support for popular frameworks and guarantee of stability, proactive security updates, and support through NVIDIA AI Enterprise’s extended-life software branches.

76

Impact Score

Mustafa Suleyman says Artificial Intelligence compute growth is still accelerating

Mustafa Suleyman argues that Artificial Intelligence development is being propelled by simultaneous advances in chips, memory, networking, and software efficiency rather than nearing a hard limit. He contends that rising compute capacity and falling deployment costs will push systems beyond chatbots toward more capable agents.

China and the US are leading different Artificial Intelligence races

The US leads in large language models and advanced chips, while China has built a major advantage in robotics and humanoid manufacturing. That balance is shifting as Chinese developers narrow the gap in model performance and both countries push to combine software and machines.

Congress weighs Artificial Intelligence transparency rules

Bipartisan lawmakers are pushing a federal transparency standard for the largest Artificial Intelligence models as Congress works on a broader national framework. The proposal aims to increase public trust while avoiding stricter state-by-state requirements and heavier regulation.

Report finds California creative job losses are not driven by Artificial Intelligence

New research from Otis College of Art and Design finds California’s recent creative industry job losses stem from cost pressures and structural shifts, not direct worker displacement by generative Artificial Intelligence. The technology is changing workflows and expectations, but it is largely replacing tasks rather than entire jobs.

U.S. senators propose broader chip tool export ban for Chinese firms

A bipartisan proposal in the U.S. Senate would shift semiconductor equipment controls from specific fabs to targeted Chinese companies and their affiliates. The measure is aimed at cutting off access to advanced lithography and other wafer fabrication tools for firms such as Huawei, SMIC, YMTC, CXMT, and Hua Hong.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.