Terminal-Bench 2.0: Benchmarking AI Agents on Hard, Realistic CLI Tasks
DOWNLOAD
Bagikan
Facebook
Twitter