Artificial Intelligence model learns to say it does not know

May 16, 2026

South Korean researchers developed a training method that helps Artificial Intelligence models recognize when they lack knowledge instead of responding with misplaced confidence. The approach aims to reduce hallucinations and improve reliability in areas such as autonomous driving and medicine.

Researchers in South Korea have developed a method designed to make Artificial Intelligence models acknowledge unfamiliar topics instead of answering with unjustified certainty. The work targets a longstanding problem in chatbot behavior: overconfidence when handling questions or situations outside a model’s training. Improving that ability could make Artificial Intelligence systems more dependable in settings where incorrect answers carry higher risks, including autonomous driving and medicine.

Researchers from the Korea Advanced Institute of Science and Technology say previous studies have identified Artificial Intelligence overconfidence as a major risk, especially in tasks such as medical diagnosis. Commonly used Artificial Intelligence models like OpenAI’s ChatGPT have been shown to hallucinate, producing fabricated information because they are pushed to guess rather than admit uncertainty. The researchers argue that a fundamental source of that problem lies in how models first learn from data through artificial neural networks. Small errors introduced during this early stage can spread through later training and lead to larger mistakes.

The team found that when random data was input into a neural network during the initialisation phase, the model exhibited high confidence despite not having learned anything. This led to hallucination. To address that issue, the researchers drew inspiration from human brain development. They noted that in humans, brain signals are generated without external input even before birth, helping the brain manage uncertainty.

Using that idea, the scientists created a process in which the neural network backbone of an Artificial Intelligence model undergoes brief pre-training with random noise inputs before actual learning begins. According to the researchers, this warm-up stage helps the model establish a baseline by adjusting its own uncertainty in advance. The process can help an Artificial Intelligence model set its initial confidence to a low level close to chance, and significantly reduce its overconfidence bias. In practical terms, the method teaches the model to begin from a state closer to “I don’t know anything yet”.

The researchers said models using warm-up training were better able to lower confidence and recognize that they do not know when facing data not encountered during training. They described this as a step toward helping Artificial Intelligence distinguish between what it knows and what it does not know. Se-Bum Paik, an author of the study published in Nature Machine Intelligence, said the findings show that incorporating principles of brain development can help Artificial Intelligence recognize its own knowledge state in a more human-like way, improving its ability to identify uncertainty or possible mistakes rather than only increasing answer accuracy.

Source

58

Impact Score

Latest News

Artificial Intelligence reshapes the UK jobs market

May 15, 2026

Artificial Intelligence is changing how UK businesses hire, train and structure work, with growing adoption among SMEs and rising concern over entry-level roles. The shift is increasing demand for digital skills while deepening worries about youth unemployment and long-term skills shortages.

State media shapes large language model outputs

May 15, 2026

Research in Nature finds that government control of media can influence large language model behavior through training data. The effect appears especially visible across languages, with models producing more favorable answers about China when prompted in Chinese.

Governments can shape what Artificial Intelligence chatbots say through training data

May 15, 2026

A Nature study finds that government influence over online media can carry through into large language model training data and affect chatbot responses to political questions. The effect appears especially strongly when prompts are asked in a country’s primary language.

Deepfake porn’s hidden victims

May 15, 2026

Nonconsensual sexual deepfakes are harming not only the people whose faces are inserted into explicit content, but also adult performers whose bodies and likenesses are repurposed without consent. As generative Artificial Intelligence tools spread, performers face growing psychological, legal, and financial risks with limited protection.

Mistral discusses cybersecurity Artificial Intelligence model with European banks

May 15, 2026

Mistral is discussing a cybersecurity-focused Artificial Intelligence model with European banks seeking alternatives to restricted-access systems such as Anthropic’s Mythos. The model is being developed to identify cybersecurity vulnerabilities.

Artificial Intelligence model learns to say it does not know

58

Impact Score

Latest News

Artificial Intelligence reshapes the UK jobs market

State media shapes large language model outputs

Governments can shape what Artificial Intelligence chatbots say through training data

Deepfake porn’s hidden victims

Mistral discusses cybersecurity Artificial Intelligence model with European banks

Contact Us