LLM Jailbreak: X-Teaming Attack Achieves 98% Success Against Top Models

A new method called X-Teaming significantly bypasses security measures in leading Artificial Intelligence language models with a 98% success rate.

A novel approach known as X-Teaming has emerged in the field of machine learning, capable of ´jailbreaking´ large language models (LLMs) and circumventing their built-in security measures. The reported 98% success rate highlights a significant vulnerability within top-performing models, raising serious concerns for the Artificial Intelligence security community.

X-Teaming takes advantage of collaborative prompt engineering, employing multiple coordinated prompts or users to break restrictive safety protocols in LLMs. This technique allows attackers to generate responses that typically violate the intended guidelines and content filters imposed by model developers.

The discovery draws attention to ongoing challenges faced in securing conversational Artificial Intelligence and the urgent need for robust, adaptive defenses. Researchers and developers are now tasked with reinforcing LLM safety systems, and the X-Teaming method has sparked debate on transparency, responsible disclosure, and further collaboration in securing Artificial Intelligence technologies.

78

Impact Score

IBM and AMD partner on quantum-centric supercomputing

IBM and AMD announced plans to develop quantum-centric supercomputing architectures that combine quantum computers with high-performance computing to create scalable, open-source platforms. The collaboration leverages IBM´s work on quantum computers and software and AMD´s expertise in high-performance computing and Artificial Intelligence accelerators.

Qualcomm launches Dragonwing Q-6690 with integrated RFID and Artificial Intelligence

Qualcomm announced the Dragonwing Q-6690, billed as the world’s first enterprise mobile processor with fully integrated UHF RFID and built-in 5G, Wi-Fi 7, Bluetooth 6.0, ultra-wideband and Artificial Intelligence capabilities. The platform is aimed at rugged handhelds, point-of-sale systems and smart kiosks and offers software-configurable feature packs that can be upgraded over the air.

Recent books from the MIT community

A roundup of new titles from the MIT community, including Empire of Artificial Intelligence, a critical look at Sam Altman’s OpenAI, and Data, Systems, and Society, a textbook on harnessing Artificial Intelligence for societal good.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.