Problem statement
Even frontier LLMs struggle to solve complex tasks that require tools. While the web offers an abundance of information, there are not that many datasets for training agents to solve problems with tools. Designing datasets at scale is not a trivial task
