OpenAI Introduces GPT-4o Model with Audio Capabilities

OpenAI´s GPT-4o brings audio input and output features to Artificial Intelligence models, enabling faster and more cost-efficient applications.

OpenAI has unveiled the GPT-4o model, a powerful addition to its line of Artificial Intelligence offerings. GPT-4o is engineered to handle audio inputs and outputs, expanding beyond the text-only capabilities of previous models. This enhancement enables the development of applications that can listen to and generate spoken responses, marking a significant advancement in interactive and conversational Artificial Intelligence services.

Integrated into the ChatGPT product as ´chatgpt-4o-latest´, the GPT-4o model allows for real-time communication, making it suitable for dynamic tasks such as live translation, customer support, and accessible voice-enabled digital assistants. These features are designed with efficiency in mind, providing high performance at a reduced computational cost compared to earlier, larger models.

OpenAI is also offering a variety of cost-optimized and smaller, faster models within its API ecosystem. These models enable developers to strike a balance between speed, resource use, and advanced capability, broadening the adoption of Artificial Intelligence across diverse applications. The latest updates position OpenAI to meet growing demand for versatile and scalable Artificial Intelligence solutions in industries requiring natural language and voice interfaces.

76

Impact Score

IBM and AMD partner on quantum-centric supercomputing

IBM and AMD announced plans to develop quantum-centric supercomputing architectures that combine quantum computers with high-performance computing to create scalable, open-source platforms. The collaboration leverages IBM´s work on quantum computers and software and AMD´s expertise in high-performance computing and Artificial Intelligence accelerators.

Qualcomm launches Dragonwing Q-6690 with integrated RFID and Artificial Intelligence

Qualcomm announced the Dragonwing Q-6690, billed as the world’s first enterprise mobile processor with fully integrated UHF RFID and built-in 5G, Wi-Fi 7, Bluetooth 6.0, ultra-wideband and Artificial Intelligence capabilities. The platform is aimed at rugged handhelds, point-of-sale systems and smart kiosks and offers software-configurable feature packs that can be upgraded over the air.

Recent books from the MIT community

A roundup of new titles from the MIT community, including Empire of Artificial Intelligence, a critical look at Sam Altman’s OpenAI, and Data, Systems, and Society, a textbook on harnessing Artificial Intelligence for societal good.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.