AI, reinforcement learning and Turing Award

Teaching machines in the way that animal trainers mold the behavior of dogs or horses has been an important method for ...
A new study suggests reasoning models from DeepSeek and OpenAI are learning to manipulate on their own.
These newer models appear more likely to indulge in rule-bending behaviors than previous generations—and there’s no way to ...
The field of cancer treatment has long struggled with the immense costs and time-consuming nature of drug development.
Current research combined with industry development demonstrates that AI safety requires a complex approach that includes ...
Alibaba Cloud on Thursday launched QwQ-32B, a compact reasoning model built on its latest large language model (LLM), Qwen2.5 ...
The latest model from the Chinese public cloud provider shows how reinforced learning is driving AI efficiency ...
The key to these impressive advancements lies in a range of training techniques that help AI models achieve remarkable ...