Study shows less advanced agents lose out in artificial intelligence negotiation

A new study reveals that disparities between artificial intelligence agents can lead to unequal negotiation outcomes, raising concerns about digital inequality.

The pursuit of bigger artificial intelligence models is giving way to a focus on autonomous agents—systems that can act, decide, and even negotiate for users in everyday scenarios. A recent study explored what happens when both sides of a negotiation—buyer and seller—use competing artificial intelligence agents. The findings point to a notable advantage for parties using stronger, more advanced agents, with those leveraging older or weaker systems routinely losing out in financial deals.

Researchers ran experiments with major language models acting as buyers and sellers in simulated negotiations involving electronics, vehicles, and real estate. The results showed a clear hierarchy: OpenAI’s newer ChatGPT-o3 and GPT-4.1 models consistently struck better deals compared to older or less capable models like GPT-3.5. DeepSeek’s latest models also performed well as sellers, while Qwen2.5 was more successful as a buyer. Not only did advanced models maximize profits or minimize spending, but some regularly outperformed others by a significant margin, mirroring real-world disparities that could emerge as artificial intelligence agents enter financial decision-making arenas.

However, the study highlighted more than just financial gaps. Even the leading models faltered at times, failing to close deals or getting entangled in endless negotiations. The researchers caution that differences in model architecture, training data, and especially model size contribute to these imbalances. They point to a bigger risk: if artificial intelligence agent negotiations become widespread, disparities in model access or sophistication could cement new forms of digital inequality, especially in domains like business procurement or finance where negotiation is common.

Experts urge caution and further research, emphasizing the simulated nature of these studies and the complexity of real markets. Some suggest that artificial intelligence agents should be evaluated not just by their best-case outcomes but also by how reliably—and safely—they handle stress and adversity, since even rare failures might result in significant systemic risk. While consumer-facing tools like Amazon´s ´Buy for Me´ avoid direct price negotiation for now, business platforms are beginning to experiment with artificial intelligence-powered sourcing assistants. For now, researchers advise using artificial intelligence as an informational resource, not a stand-in for human judgment in critical negotiations.

74

Impact Score

LLM-PIEval: a benchmark for indirect prompt injection attacks in large language models

Large language models have increased interest in Artificial Intelligence and their integration with external tools introduces risks such as direct and indirect prompt injection. LLM-PIEval provides a framework and test set to measure indirect prompt injection risk and the authors release API specifications and prompts to support wider assessment.

NVIDIA may stop bundling memory with gpu kits amid gddr shortage

NVIDIA is reportedly considering supplying only bare silicon to its aic partners rather than the usual gpu and memory kit as gddr shortages constrain fulfillment. The move follows wider industry pressure from soaring dram prices and an impending price increase from AMD of about 10% across its gpu lineup.

SK Hynix to showcase 48 Gb/s 24 Gb GDDR7 for Artificial Intelligence inference

SK Hynix will present a 24 Gb GDDR7 chip rated for 48 Gb/s at ISSCC 2026, claiming a symmetric dual-channel design and updated internal interfaces that push past the expected 32 to 37 Gb/s. The paper positions the device for mid-range Artificial Intelligence inference and SK Hynix will also show LPDDR6 running at 14.4 Gb/s.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.