NetEase-backed study shows language model agents may detect bugs faster and with greater coverage than existing tools.
Anthropic's Claude Sonnet 4.5 now scores 77% on a key software engineering benchmark and can work autonomously for over 30 ...