IBM Cloud Becomes First to Deploy Intel Gaudi 3 AI Accelerators

IBM Cloud leads in enterprise Artificial Intelligence with the deployment of Intel Gaudi 3, promising significant cost and performance advantages over GPU competitors.

IBM Cloud has become the first cloud service provider to make Intel Gaudi 3 artificial intelligence accelerators available to customers, marking a significant step toward expanding access to high-performance artificial intelligence infrastructure. The integration is designed to address the high costs typically associated with deploying specialized artificial intelligence hardware and to provide enterprise customers with more choices for cost-effective generative artificial intelligence (GenAI) solutions.

This deployment represents Intel´s first major commercial rollout of Gaudi 3, signaling the introduction of competitive options to a market historically dominated by other GPU providers. The companies aim to help clients across industries—including financial services, healthcare, life sciences, and the public sector—test, innovate, and deploy artificial intelligence solutions at a lower total cost. IBM Cloud is extending Gaudi 3 availability in key regions such as Frankfurt, Washington D.C., and Dallas, and the accelerators will be accessible through IBM Virtual Private Cloud and Virtual Servers, with support for Red Hat OpenShift and IBM´s watsonx artificial intelligence platform expected soon.

Backed by recent Signal65 benchmark tests commissioned by Intel, Gaudi 3 has demonstrated marked cost efficiency and performance gains. When running Meta´s Llama-3.1-405B-Instruct-FP8 model with large context sizes, Gaudi 3 was found to be 92% more cost efficient in terms of performance per dollar compared to competitors. On smaller workloads with the IBM Granite-3.1-8B-Instruct model, it delivered 43% more tokens per second, and on large context tasks, achieved a 36% increase in throughput over alternative solutions. These metrics are especially relevant for enterprises scaling up artificial intelligence adoption as they seek lower operation costs and higher output in their workloads.

Enterprise users in banking, insurance, healthcare, and retail rely on IBM Cloud for problems such as fraud detection, personalized service, drug discovery, diagnostics, and inventory management. With Gaudi 3, these industries gain accelerated artificial intelligence model training, fine-tuning, and inferencing capabilities. This move allows customers to modernize technology stacks securely while reaping the benefits of the latest advancements in artificial intelligence hardware platforms.

68

Impact Score

Artificial Intelligence speeds quantum encryption threat timeline

Research from Google and Oratomic suggests quantum computers capable of breaking core internet encryption may arrive sooner than expected. Artificial Intelligence played a key role in improving one of the new algorithms, raising fresh urgency around post-quantum security.

New methods aim to improve Large Language Model reasoning

A new study on arXiv outlines algorithmic techniques designed to strengthen Large Language Model reasoning and reduce hallucinations. The work reports better logical consistency and stronger performance on mathematical and coding benchmarks.

Nvidia acquisition of SchedMD raises Slurm neutrality concerns

Nvidia’s purchase of SchedMD has given it control of Slurm, an open-source scheduler that sits at the center of many supercomputing and large-model training systems. Researchers and engineers are watching for signs that support could tilt toward Nvidia hardware over AMD and Intel alternatives.

Mustafa Suleyman says Artificial Intelligence compute growth is still accelerating

Mustafa Suleyman argues that Artificial Intelligence development is being propelled by simultaneous advances in chips, memory, networking, and software efficiency rather than nearing a hard limit. He contends that rising compute capacity and falling deployment costs will push systems beyond chatbots toward more capable agents.

China and the US are leading different Artificial Intelligence races

The US leads in large language models and advanced chips, while China has built a major advantage in robotics and humanoid manufacturing. That balance is shifting as Chinese developers narrow the gap in model performance and both countries push to combine software and machines.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.