Skip to main content

Inputs & Outputs

Inputs

InputRequiredDefaultDescription
skill-nameYes-Name of the skill to evaluate
skill-pathYes-Path to the skill directory (must contain SKILL.md and evals/)
anthropic-api-keyYes-Anthropic API key for the claude CLI
pass-thresholdNo80Minimum pass rate (0-100) to succeed
timeoutNo120Timeout per eval case in seconds
post-commentNotruePost results as a PR comment
github-tokenNogithub.tokenToken for PR comments
upload-viewerNotrueUpload eval viewer HTML as artifact
node-versionNo22Node.js version for claude CLI
max-retriesNo3Max retry attempts per API call
retry-delayNo10Base delay between retries (seconds)

Outputs

OutputDescription
pass-rateOverall pass rate as percentage (0-100)
passedTotal criteria passed
totalTotal criteria evaluated
cases-runNumber of eval cases executed

Using outputs

- uses: skill-bench/skill-eval-action@v1
id: eval
with:
skill-name: my-skill
skill-path: skills/my-skill
anthropic-api-key: ${{ secrets.ANTHROPIC_API_KEY }}

- run: echo "Pass rate was ${{ steps.eval.outputs.pass-rate }}%"