OpenAI Introduces GPT-4o Model with Audio Capabilities

OpenAI´s GPT-4o brings audio input and output features to Artificial Intelligence models, enabling faster and more cost-efficient applications.

OpenAI has unveiled the GPT-4o model, a powerful addition to its line of Artificial Intelligence offerings. GPT-4o is engineered to handle audio inputs and outputs, expanding beyond the text-only capabilities of previous models. This enhancement enables the development of applications that can listen to and generate spoken responses, marking a significant advancement in interactive and conversational Artificial Intelligence services.

Integrated into the ChatGPT product as ´chatgpt-4o-latest´, the GPT-4o model allows for real-time communication, making it suitable for dynamic tasks such as live translation, customer support, and accessible voice-enabled digital assistants. These features are designed with efficiency in mind, providing high performance at a reduced computational cost compared to earlier, larger models.

OpenAI is also offering a variety of cost-optimized and smaller, faster models within its API ecosystem. These models enable developers to strike a balance between speed, resource use, and advanced capability, broadening the adoption of Artificial Intelligence across diverse applications. The latest updates position OpenAI to meet growing demand for versatile and scalable Artificial Intelligence solutions in industries requiring natural language and voice interfaces.

76

Impact Score

Pope Leo frames Artificial Intelligence as a media power struggle

Pope Leo XIV’s first encyclical casts Artificial Intelligence as a moral question of power, labor, and collective responsibility, offering publishers a framework for negotiating with technology companies. The broader media landscape is also shifting as AP supplies election data to ChatGPT, YouTube expands labeling of Artificial Intelligence video, and search traffic declines for publishers.

Why the U.S. leads Europe in Artificial Intelligence adoption

Survey evidence shows U.S. workers and firms are adopting Artificial Intelligence faster than their European counterparts. The gap appears to be driven not only by workforce composition, but also by stronger managerial support and greater workplace encouragement to use the technology.

FluxMem brings dynamic memory to large language model agents

FluxMem reframes memory for large language model agents as a dynamic graph that evolves with feedback, task variation, and long-term use. The approach is designed to reduce the brittleness of static memory systems and improve reliability in complex environments.

Microsoft and NVIDIA hint at N1X Windows 11 launch

Microsoft and NVIDIA signaled a joint Windows 11 push around the N1X, framing it as a new era of PC. The upcoming Arm chip is positioned to bring Copilot+ acceleration and challenge the fastest Windows processors in its class.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.