How your data fuels artificial intelligence training and the risks of chatbot diagnoses

From passports to medical conversations, personal data is being swept into artificial intelligence training sets, sparking privacy and safety concerns.

Recent research has uncovered that DataComp CommonPool, a widely used open-source training set for artificial intelligence, likely contains millions of images bearing highly sensitive personal information. This includes photographs of passports, credit cards, birth certificates, and documents displaying personally identifiable information (PII), as well as identifiable faces. Auditing just 0.1% of the dataset, researchers extrapolated that the underlying collection could contain hundreds of millions of such images, all scraped from publicly accessible sites. The revelation serves as a stark reminder that virtually anything posted online may end up in the data pipelines fueling artificial intelligence models, raising profound ethical, privacy, and security questions.

On another front, a shift in artificial intelligence health chatbot deployment is raising safety alarms. Most companies behind high-profile chatbots have stopped including medical disclaimers in responses to user health queries, according to new findings. Instead, these systems are answering health questions directly, sometimes following up and even attempting diagnoses. The removal of medical disclaimers eliminates a crucial safeguard, increasing the likelihood that users will place unwarranted trust in potentially unreliable or unsafe advice. Researchers argue that these developments could jeopardize public health, particularly for vulnerable individuals seeking help for issues like eating disorders or cancer, who might not realize the limitations of chatbots.

Additional technology stories highlight broader security and policy dilemmas. Hackers have recently exploited vulnerabilities in Microsoft software to attack government agencies, with engineers in a race to contain the threat. In Europe, the French government is probing into X (formerly Twitter) over alleged recommendation algorithm manipulation, and tensions around artificial intelligence policy are mounting as Meta declines to sign the European Union’s voluntary Artificial Intelligence code of practice, signaling potential regulatory friction. Meanwhile, international trends reveal a growing divide as authoritarian states accelerate digital surveillance and censorship through technology partnerships, inching the globe toward a digitally charged cold war. These developments collectively underscore the urgency for clear guardrails around data use, digital rights, and the rapidly evolving role of artificial intelligence in shaping society and individual lives.

83

Impact Score

UK and EU Artificial Intelligence regulatory outlook for May 2026

The UK is moving ahead with targeted Artificial Intelligence measures in policing, online safety, cyber security and copyright policy, while the EU is refining how the EU Artificial Intelligence Act will apply in practice. Consultations, new offences and implementation deadlines are shaping the next phase of compliance on both sides.

Germany sets out national implementation of the Artificial Intelligence Act

Germany has published a draft law to implement the European Artificial Intelligence Act through new supervisory structures, clearer institutional responsibilities, and measures designed to support innovation. The proposal puts the Federal Network Agency at the center of enforcement while preserving sector-specific oversight in sensitive fields.

ECB warns banks about new Artificial Intelligence security risks

The European Central Bank has called major banks to an emergency meeting over cybersecurity risks tied to advanced Artificial Intelligence models. Regulators want banks to speed up security updates as newer tools make it easier to find and exploit vulnerabilities.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.