Terminal-Bench 2.0: Benchmarking AI Agents on Hard, Realistic CLI Tasks

Download (MP3)




Bagikan FacebookTwitter