OpenAI's New Reasoning Models Show Increased Hallucination Rates

Wednesday, 29 October

Sunday, 20 April, 2025

OpenAI's New Reasoning Models Show Increased Hallucination Rates

OpenAI's latest reasoning AI models, o3 and o4-mini, exhibit higher hallucination rates compared to their predecessors. Internal tests revealed that o3 hallucinated in 33% of responses on the PersonQA benchmark, while o4-mini's rate was 48%, surpassing earlier models like o1 and o3-mini. The company is investigating the causes, noting that the models' tendency to generate more claims may lead to both accurate and inaccurate outputs.

Read full story at TechCrunch

Tags:openai benchmark AI model

Categories

OpenAI's New Reasoning Models Show Increased Hallucination Rates

Also Read

Tata Motors Patches Major Leak

Chinese AI Models Dominate Crypto Trading Challenge

Ola Founder Aggarwal Tells HC Employee Suicide Campaign

Ixigo Revenue Jumps 37%, Q2 Loss Due to One-Time ESOP Cost

Aakash Shareholders Back Capital Increase for Rights Issue

Exporting Nvidia's Blackwell Chip to China Threatens US AI Lead

Major Microsoft Outage Hits Azure and Office 365 Users Globally

Super Micro Launches Subsidiary for US Govt AI Server Support

Cybersecurity is the Cornerstone of a Digital-First India: Jayesh Shah, Director, Orient Technologies

Subscribe To Our Newsletter.