Rdma for s3-compatible storage accelerates Artificial Intelligence workloads

Rdma for S3-compatible storage uses remote direct memory access to speed S3-API object storage access for Artificial Intelligence workloads, reducing latency, lowering CPU use and improving throughput. Nvidia and multiple storage vendors are integrating client and server libraries to enable faster, portable data access across on premises and cloud environments.

Enterprises are generating vast volumes of unstructured data and Artificial Intelligence workloads are becoming increasingly data-intensive. The article frames object storage as a cost-effective option that historically served archives, backups and data lakes but has lacked the performance needed for fast-paced Artificial Intelligence training and inference. The need for scalable, portable storage between on premises infrastructure and the cloud is driving exploration of new approaches to object storage performance.

Remote direct memory access, or RDMA, for S3-compatible storage is presented as a solution that accelerates the S3 application programming interface-based storage protocol. By offloading data transfers from the host CPU and using RDMA-enabled networking, the approach promises higher throughput per terabyte, improved throughput per watt, lower cost per terabyte and much lower latency than traditional TCP-based transports. Nvidia has developed RDMA client and server libraries; storage partners have incorporated the server libraries into their products and client libraries run on GPU compute nodes to enable faster data access for Artificial Intelligence workloads and better GPU utilization. The article notes that initial libraries are optimized for Nvidia GPUs and networking while the architecture remains open for other vendors and contributors.

Several leading object storage vendors are adopting the technology. Cloudian, Dell Technologies and HPE are integrating RDMA for S3-compatible libraries into HyperStore, ObjectScale and Alletra Storage MP X10000 respectively. Executives quoted in the piece emphasize scalability, portability and reduced total cost of ownership for large-scale Artificial Intelligence deployments and AI factories. Nvidia’s libraries are available to select partners now and are expected to be generally available via the Nvidia CUDA Toolkit in January, alongside information about a new Nvidia object storage certification as part of the Nvidia-Certified Storage program.

68

Impact Score

Huawei chip design raises pressure on Nvidia, AMD, and Intel

Huawei has outlined a new chip design framework that it says can improve efficiency and reduce dependence on leading-edge manufacturing tools. The move adds pressure on US chipmakers as China builds a domestic Artificial Intelligence semiconductor ecosystem under export restrictions.

UK and EU seek simpler medical device rules

The UK and EU are advancing medical device regulatory changes aimed at improving predictability, reducing bottlenecks and supporting market access. Manufacturers of Artificial Intelligence-enabled devices in Europe will still need to navigate overlapping rules even as compliance timelines are extended.

LLMSurgeon targets foundation model data auditing

LLMSurgeon introduces a way to infer the domain mix of large language model pretraining data using only generated text. The framework is designed to improve transparency around foundation models whose training corpora remain largely undisclosed.

Databricks model units target lower inference costs

Databricks is positioning model units as a new way to manage large language model inference, aiming to cut GPU spending while improving reliability under enterprise-scale demand. The approach reflects growing pressure on platforms to balance cost, latency, and resilience as agentic Artificial Intelligence workloads expand.

Texas arrests man over Artificial Intelligence-generated child abuse images

Texas authorities arrested a Carrizo Springs man accused of creating hundreds of pornographic images and videos involving children by using Artificial Intelligence tools to manipulate photos taken from public school-affiliated pages. Investigators said the case also uncovered non-Artificial Intelligence-generated child sexual abuse images and identified approximately 30 victims.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.