AMD Instinct MI355X drives competitive performance for optimized LLM workloads

Artificial Intelligence training workloads are the focus of AMD's ROCm 7.0 and v25.9 Training Dockers, which promise optimized scaling for JAX and PyTorch LLM training.

Artificial Intelligence training workloads are increasingly pushing modern GPU architectures, and AMD is positioning its software stack to meet that demand. The company highlights ROCm 7.0 software as a foundation for optimized support across the JAX and PyTorch frameworks, while the v25.9 Training Dockers are presented as demonstrating exceptional scaling efficiency in both single-node and multi-node setups. AMD frames these updates as enabling researchers and developers to scale model sizes and complexity further than before.

The announcement emphasizes integration with Primus, a unified and flexible LLM training framework, to streamline PyTorch-based development on AMD Instinct GPUs. Primus now supports both the TorchTitan and Megatron-LM backends, offering flexibility for different large model training approaches. In addition, Primus-Turbo is described as accelerating Transformer models to further boost training throughput specifically on AMD Instinct MI355X GPUs, addressing throughput and efficiency goals important to high-performance model training.

Practically, the combination of ROCm 7.0, the v25.9 Training Dockers, and the Primus toolchain is presented as an end-to-end push to make AMD Instinct hardware more competitive for LLM workloads. The coverage directs readers to the Primus-Repo for access to the framework and related tooling. Overall, the material positions these software and framework updates as targeted improvements for scaling LLM training workflows on AMD hardware while focusing on interoperability with established frameworks and backends.

52

Impact Score

European Union Artificial Intelligence Act raises layered compliance demands for finance

Banks, insurers and financial intermediaries face a more complex compliance environment as the European Union Artificial Intelligence Act overlays existing financial regulation and the GDPR. Proposed changes in the Digital Omnibus Package may delay some obligations, but the core challenge remains managing overlapping rules, roles and regulators.

Europe and US discuss biometric data-sharing framework

European Union and US officials are negotiating a border security arrangement that could enable continuous biometric data exchanges on EU citizens. The UK says the US has also requested access to fingerprint records as part of Visa Waiver Program discussions.

Apple plans Intel 18A-P for M7 and 14A for A21

Apple is expected to use Intel’s 18A-P process for M7 chips in MacBook models and Intel’s 14A process for A21 chips in iPhones. The shift points to a broader supplier strategy as Apple moves beyond TSMC for parts of its future silicon roadmap.

Google and other chatbots surface real phone numbers

Generative Artificial Intelligence chatbots are surfacing real phone numbers and other personal details, sometimes by pulling from obscure public sources and sometimes by inventing plausible but wrong contact information. Privacy experts say users have few reliable ways to find out whether their data is in model training sets or to force its removal.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.