openai o1 news - Search News

News

ChatGPT o3 hallucinates more than o1, and OpenAI has no idea why

OpenAI delivered advanced ChatGPT reasoning models this month that are more capable than o1, but they also hallucinate more.

7don MSN

OpenAI’s new reasoning AI models hallucinate more

OpenAI's reasoning AI models are getting better, but their hallucinating isn't, according to benchmark results.

OpenAI's most capable models hallucinate more than earlier ones

OpenAI says its latest models, o3 and o4-mini, are its most powerful yet. However, research shows the models also hallucinate ...

Futurism on MSN4d

OpenAI's Hot New AI Has an Embarrassing Problem

OpenAI's latest AI models tend to make things up — or "hallucinate" — substantially more than earlier versions.

OpenAI's newest o3 and o4-mini models excel at coding and math – but hallucinate more often

Historically, each new generation of OpenAI's models has delivered incremental improvements in factual accuracy, with ...

OpenAI’s o3: AI Benchmark Discrepancy Reveals Gaps in Performance Claims

The FrontierMath benchmark from Epoch AI tests generative models on difficult math problems. Find out how OpenAI’s o3 and ...

3don MSN

OpenAI’s leading models keep making things up — here's why

If you’ve used an AI model, you’ve most likely seen it hallucinate. This is when the model produces incorrect or misleading ...

Mashable7d

OpenAI's o3 and o4-mini hallucinate way higher than previous models

OpenAI's reasoning models are billed as more ... to "spend more time thinking before they respond," as described in the o1 announcement. Rather than largely relying on stochastic methods to ...

OpenAI debuts o3 and o4-mini advanced reasoning models

OpenAI introduced two new reasoning models this week: o3 and o4-mini. The company claims that these are its smartest AI ...

OpenAI releases new simulated reasoning models with full tool access

On Wednesday, OpenAI announced the release of two new models—o3 and o4-mini—that combine simulated reasoning capabilities ...

9don MSN

OpenAI announces o3 and o4-mini reasoning models for ChatGPT (updated)

OpenAI's o3 and o4-mini models for ChatGPT have arrived.

GV Wire8d

Comparing AI reasoning abilities reveals OpenAI's o1 model surpasses DeepSeek's R1 in generating accurate, sentence-level ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results