NVIDIA Showcases Next-Gen Multimodal Generative AI Research at ICLR 2024

NVIDIA unveils groundbreaking Artificial Intelligence models for audio, robotics, and multimodal applications at ICLR 2024, pushing generative innovation across industries.

NVIDIA Research is at the forefront of advancing Artificial Intelligence through a comprehensive approach that spans cutting-edge computing infrastructure, optimized compilers, novel algorithms, and transformative applications. Presenting over 70 papers at the International Conference on Learning Representations (ICLR) 2024 in Singapore, the company is showcasing technological strides intended to deliver new capabilities in fields such as autonomous systems, healthcare, content creation, and robotics.

Key research highlights include Fugatto, an advanced audio generative model adept at creating or transforming combinations of music, voice, and sounds from mixed text and audio prompts, redefining the possibilities in sound synthesis. Robotics development is propelled by the HAMSTER project, which leverages hierarchical designs in vision-language-action models to facilitate knowledge transfer using data that doesn´t rely on costly real-world robot collection. Meanwhile, Hymba introduces a family of small language models utilizing a hybrid architecture that blends transformer and state space models, providing higher throughput, improved recall, and efficient memory usage without compromising accuracy.

Innovations in visual understanding are advanced through LongVILA, enabling efficient training of visual-language models on long-form video data and achieving state-of-the-art results across multiple benchmarks. On the language modeling front, LLaMaFlex introduces a novel compression technique for large language models, outperforming several existing methods and significantly reducing computational costs. In computational biology, Proteina presents new capabilities for generating eligible protein backbone structures using deep transformer architectures. Other notable progress includes the SRSA framework, which enhances robotic learning by enabling task adaptation from preexisting skill libraries, and STORM, capable of reconstructing dynamic 3D outdoor scenes swiftly from minimal input—vital for autonomous vehicle development.

NVIDIA Research´s 400-member team continues to drive global innovation across computer architecture, generative technologies, graphics, self-driving systems, and robotics, cementing the company´s role as a pivotal contributor to the next generation of Artificial Intelligence across diverse sectors.

83

Impact Score

Congress weighs Artificial Intelligence transparency rules

Bipartisan lawmakers are pushing a federal transparency standard for the largest Artificial Intelligence models as Congress works on a broader national framework. The proposal aims to increase public trust while avoiding stricter state-by-state requirements and heavier regulation.

Report finds California creative job losses are not driven by Artificial Intelligence

New research from Otis College of Art and Design finds California’s recent creative industry job losses stem from cost pressures and structural shifts, not direct worker displacement by generative Artificial Intelligence. The technology is changing workflows and expectations, but it is largely replacing tasks rather than entire jobs.

U.S. senators propose broader chip tool export ban for Chinese firms

A bipartisan proposal in the U.S. Senate would shift semiconductor equipment controls from specific fabs to targeted Chinese companies and their affiliates. The measure is aimed at cutting off access to advanced lithography and other wafer fabrication tools for firms such as Huawei, SMIC, YMTC, CXMT, and Hua Hong.

Trump executive order targets state Artificial Intelligence laws

Executive Order 14365 lays out a federal strategy to discourage, challenge, and potentially preempt state Artificial Intelligence laws viewed as burdensome. Employers are advised to keep complying with current state and local rules while preparing for regulatory uncertainty in 2026.

Who decides how America uses Artificial Intelligence in war

Stanford experts are divided over how the United States should govern Artificial Intelligence in defense, surveillance, and warfare. Their views converge on one point: decisions with such high stakes cannot be left to companies alone.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.