The Agent Company

Benchmarking LLM Agents on Consequential Real World Tasks