Now, Claude Sonnet 4.5 has lapped that last model, outperforming it on the SWE-bench Verified evaluation, a human-filtered subset of the SWE-bench. Claude Sonnet 4.5 also outperformed leading models ...
Discover how Drawbridge AI bridges the gap between developers and clients, simplifying UI modifications and task management.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results