Appearance
Package Reference
Agent Skill Evals exposes one package: agent-skill-evals.
Use these three import paths:
agent-skill-evals/skill-checks: checksSKILL.mdfiles and their tests.agent-skill-evals/agent: runs an agent in a copied sample project and saves evidence.agent-skill-evals/assertions: providesskill.checksandskill.test.
The package does not expose a root import.
How Agent Tests Run
- Promptfoo loads
agent-skill-evals/agent.js. - Agent Skill Evals copies the sample project to a temporary folder.
- Agent Skill Evals runs
preconditionsin the copy. - If they pass, Agent Skill Evals runs the agent in the copy.
- Agent Skill Evals records evidence: changed files, command results, tool calls, output, and run details.
- Promptfoo calls
skill.test. skill.testchecksshouldandshould_notagainst the evidence.
How Skill Checks Run
- Promptfoo loads
agent-skill-evals/skill-checks.js. - Agent Skill Evals reads the
SKILL.mdfile fromvars.skillPath. - Agent Skill Evals reads the tests from
vars.testsGlob. - Agent Skill Evals checks the skill text, referenced files, tests, sample projects, and verifier scripts.
skill.checksreports the result.
Files To Add
Use this for Skill Checks:
js
// agent-skill-evals/skill-checks.js
export { default } from "agent-skill-evals/skill-checks";Use this for agent tests:
js
// agent-skill-evals/agent.js
export { default } from "agent-skill-evals/agent";Use this for both skill.checks and skill.test:
js
// agent-skill-evals/assertions.js
export { default } from "agent-skill-evals/assertions";
export * from "agent-skill-evals/assertions";