Openai Paper - Search News

16hon MSN

Claude just beat GPT-5, Gemini, and Grok in real-world job tasks, according to OpenAI’s own study

Surprisingly, the OpenAI study shows that the best performing model was Anthropic’s Claude Opus 4.1, which outpaced not only OpenAI’s GPT-5 but also Gemini and Grok.

24d

Why AI chatbots hallucinate, according to OpenAI researchers

Hallucinations are when chatbots confidently present wrong information as fact. They plague the most popular chatbots, like ...

11don MSN

OpenAI’s research on AI models deliberately lying is wild

It’s not news that AI models will lie. By now most of us have experienced AI hallucinations, or the model confidently giving ...

14d

What do people actually use ChatGPT for? OpenAI provides some numbers.

In addition to measuring overall user and usage growth, OpenAI's paper also breaks down total usage based on when its ...

17d

Why OpenAI’s solution to AI hallucinations would kill ChatGPT tomorrow

OpenAI’s latest research paper diagnoses exactly why ChatGPT and other large language models can make things up – known in ...

11d

OpenAI admits AI hallucinations are mathematically inevitable, not just engineering flaws

In a landmark study, OpenAI researchers reveal that large language models will always produce plausible but false outputs, ...

CNET on MSN

Is AI Capable of 'Scheming'? What OpenAI Found When Testing for Tricky Behavior

Research shows advanced models like ChatGPT, Claude and Gemini can act deceptively in lab tests. OpenAI insists it's a rarity ...

CNET on MSN

Most People Use ChatGPT for Personal Life, Not Work, According to a New OpenAI Study

That doesn't mean people are using the chatbot less. ChatGPT has about 700 million weekly active users worldwide, sending ...

10d

How OpenAI explains why LLMs appear confident when they have no idea

In a paper, OpenAI identifies confident errors in large language models as intentional technical weaknesses. Fixing them requires a rethink within the industry.

22d

Are bad incentives to blame for AI hallucinations?

A new research paper from OpenAI asks why large language models like GPT-5 and chatbots like ChatGPT still hallucinate and ...

MediaNamaOpinion

Over 70% ChatGPT Interactions Are Non-Work, Guidance Most Common Use Case: OpenAI Study

OpenAI study has highlighted how people are using ChatGPT, with 73% conversations being non-work related as of July 2025 ...

Observer

OpenAI Study Uncovers 3 Surprises About How People Use ChatGPT

OpenAI’s latest study finds ChatGPT is shaping daily life more than work, with women users now in the majority.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results