Terminal-Bench 2.0: Benchmarking AI Agents on Hard, Realistic CLI Tasks
Download (MP3)
Bagikan
Facebook
Twitter