News
Reasoning tasks sharply raise AI costs, according to a new analysis by Artificial Analysis. Google's Gemini Flash 2.5 costs 150 times more to run than Flash 2.0, due to using 17 times more tokens and ...
A new report highlights how Chinese AI startup DeepSeek is outperforming Western benchmarks with a research team that is almost entirely China-based—and what that means for the US’s position in global ...
Phi 4's reasoning process shown in screenshot by Simon Willison.
OpenAI is reportedly in discussions with the U.S. Food and Drug Administration (FDA) about using AI to help evaluate pharmaceuticals, according to a new report from Wired. At the center of the talks ...
Running inside Qwen Chat, it lets users generate front-end code from a single instruction. A prompt like "Create a Twitter-like website" returns working HTML, CSS, and JavaScript—no coding skills or ...
Bytedance has introduced Agent TARS, an experimental open-source automation tool that visually processes web pages and interacts with the command line and file system, currently limited to macOS. The ...
SoundCloud changed its terms of use in February 2024 to allow uploaded music to be used for AI training. AI copyright activist Ed Newton-Rex spotted the change and ...
IBM CEO Arvind Krishna says the company has used AI and so-called AI agents to cut several hundred jobs in its HR department. At the same time, IBM has created new positions in areas like software ...
Retrieval-augmented generation (RAG) promises to help medical AI systems deliver up-to-date and reliable answers. But a new review shows that, so far, RAG rarely works as intended in real-world ...
Anthropic has launched a web search feature for its Claude API, letting developers combine Claude models with up-to-date web data without building their own search infrastructure. Claude decides when ...
ByteDance introduces Seedream 3.0, a new text-to-image model. Benchmarks suggest improvements over GPT-4o and Midjourney in speed, accuracy, and visual quality. ByteDance has released Seedream 3.0, a ...
OpenAI has been developing reinforcement fine-tuning (RFT) since last December, and it is now available for verified organizations using the o4-mini model. RFT uses a programmable evaluation system, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results