Google Unveils DolphinGemma: A Leap Toward Human-Dolphin Communication

Google´s new DolphinGemma model could pave the way for real conversations between humans and dolphins, harnessing cutting-edge Artificial Intelligence audio analysis.

Google has announced a significant breakthrough in the field of interspecies communication with the development of DolphinGemma, a large language model specifically trained to interpret and generate dolphin sounds. This new Artificial Intelligence system, built on the Gemma architecture that also powers Google’s Gemini models, leverages advanced audio technology to analyze dolphin vocalizations. By sifting through the Wild Dolphin Project’s extensive acoustic database of wild Atlantic spotted dolphins, DolphinGemma is capable of processing complex audio sequences and identifying patterns that could form the basis of a shared human-dolphin vocabulary.

The DolphinGemma system operates using Google’s SoundStream tokenizer, which efficiently represents the intricacies of dolphin sounds before feeding them into the model’s architecture. Sized at approximately 400 million parameters, the model is optimized to run on consumer-grade devices, such as Google’s Pixel smartphones. The Wild Dolphin Project, in collaboration with Georgia Tech and Google, utilizes these phones during real-time underwater field research. This enables researchers to analyze dolphin communication on site, linking specific sound patterns to observed behaviors such as social bonding, aggression, and courtship.

Beyond passive observation, Google and its partners are actively exploring two-way communication through the Cetacean Hearing Augmentation Telemetry (CHAT) system. CHAT employs synthetic whistles associated with specific objects or rewards, with the aim of encouraging dolphins to mimic these sounds to request items. This method, combined with DolphinGemma’s capability to uncover hidden structures and potential meanings in both synthetic and natural dolphin sounds, sets the foundation for a practical vocabulary shared between species. The use of standard smartphones eliminates the need for custom hardware, reducing cost, size, and power requirements, and making this technology accessible and sustainable for extended oceanic research. Ultimately, DolphinGemma marks a pivotal step in bridging the communication gap between humans and dolphins, hinting at future possibilities for interspecies dialogue powered by Artificial Intelligence.

88

Impact Score

IBM and AMD partner on quantum-centric supercomputing

IBM and AMD announced plans to develop quantum-centric supercomputing architectures that combine quantum computers with high-performance computing to create scalable, open-source platforms. The collaboration leverages IBM´s work on quantum computers and software and AMD´s expertise in high-performance computing and Artificial Intelligence accelerators.

Qualcomm launches Dragonwing Q-6690 with integrated RFID and Artificial Intelligence

Qualcomm announced the Dragonwing Q-6690, billed as the world’s first enterprise mobile processor with fully integrated UHF RFID and built-in 5G, Wi-Fi 7, Bluetooth 6.0, ultra-wideband and Artificial Intelligence capabilities. The platform is aimed at rugged handhelds, point-of-sale systems and smart kiosks and offers software-configurable feature packs that can be upgraded over the air.

Recent books from the MIT community

A roundup of new titles from the MIT community, including Empire of Artificial Intelligence, a critical look at Sam Altman’s OpenAI, and Data, Systems, and Society, a textbook on harnessing Artificial Intelligence for societal good.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.