+1 to this, anecdotally I’ve found in my own evaluations that if your system pro... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		malwrar 31 days ago \| parent \| context \| favorite \| on: Something is afoot in the land of Qwen +1 to this, anecdotally I’ve found in my own evaluations that if your system prompt doesn’t explicitly declare how to invoke a tool and e.g. describe what each tool does, most models I’ve tried fail to call tools or will try to call them but not necessarily use the right format. With the right prompt meanwhile, even weak models shoot up in eval accuracy.

mongrelion 30 days ago [–]

> [...] _but not necessarily use the right format._

This has also been my experience. But isn't the harness sending the instructions on how to invoke a tool? Maybe it is missing the formatting part. What do you think?

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact