ArXiv tightens rules on Artificial Intelligence generated papers

ArXiv is escalating enforcement against careless use of large language models in scientific submissions. Authors who submit papers showing clear signs of unchecked model output can face a 1-year ban and stricter conditions for future postings.

ArXiv, a widely used open repository for preprint research, is increasing its crackdown on careless use of large language models in scientific papers. Although submissions appear before peer review, the platform is a major channel for research circulation in fields such as computer science and math, and it also serves as a source of data on scientific research trends.

The repository has already introduced measures aimed at low-quality, Artificial Intelligence-generated papers, including requiring first-time posters to obtain an endorsement from an established author. After being hosted by Cornell for more than 20 years, the organization is becoming an independent nonprofit, a change expected to help it raise more funding to address problems tied to Artificial Intelligence slop.

Thomas Dietterich, chair of arXiv’s computer science section, said that “if a submission contains incontrovertible evidence that the authors did not check the results of LLM generation, this means we can’t trust anything in the paper.” Examples of that evidence include hallucinated references and comments to or from the large language model. If such evidence is found, a paper’s authors will face “a 1-year ban from arXiv followed by the requirement that subsequent arXiv submissions must first be accepted by a reputable peer-reviewed venue.”

The policy does not ban large language models outright. Instead, it requires authors to take full responsibility for everything included in a paper, regardless of how it was produced. Researchers who paste in inappropriate language, plagiarized content, biased content, errors, mistakes, incorrect references, or misleading content from a model remain accountable for those problems.

Dietterich told 404 Media that the enforcement approach will operate as a “one-strike” rule, but moderators must first flag the problem and section chairs must confirm the evidence before a penalty is imposed. Authors will also have the ability to appeal. The move comes as recent peer-reviewed research has found fabricated citations increasing in biomedical research, likely linked to large language models.

68

Impact Score

Europe accelerates Artificial Intelligence in defence

European militaries are moving from limited Artificial Intelligence support tools to deeper integration in targeting, decision support and weapons systems. France, Germany and the United Kingdom are leading major programmes, while Ukraine is shaping how the technology is tested and deployed.

New LLM architectures target long-context efficiency

Recent open-weight language models are adding targeted architectural changes to cut the cost of long-context inference. Key ideas include cross-layer KV sharing, per-layer embeddings, compressed attention, and wider residual pathways.

Simple Artificial Intelligence recommendations for small business growth

Research from the University of Warwick and Nanyang Technological University, Singapore, examines how small and medium sized enterprises can use simpler Artificial Intelligence recommendation systems without large datasets or costly infrastructure. Findings from a field experiment suggest low data approaches can still increase customer engagement and spending.

Quantexa wins HMRC data modernisation contract

Quantexa has secured a £175 million, 10-year contract from HM Revenue & Customs to modernise the tax authority’s data infrastructure and support governed use of Artificial Intelligence across core operations. The deal positions the London-founded company at the centre of a major UK public sector data transformation programme.

EU Artificial Intelligence Act delay gives HR more time to prepare

The European Union has pushed back compliance deadlines for high-risk Artificial Intelligence systems, giving HR teams more time to prepare for rules that still carry broad reach beyond Europe. Experts say the delay should be treated as a chance to strengthen governance, data practices, and cross-functional accountability rather than slow down.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.