Eval Viewer
The action uploads an interactive HTML viewer as a GitHub Actions artifact.
Tabs
Grading
Shows per-case results with:
- Pass/fail badges for each criterion
- Evidence quotes from the grader
- Full response output
- Duration and token count
Benchmark
Summary table with:
- Overall pass rate
- Per-case pass rate with visual bars
- Time and token usage
- Total cost
Downloading
Artifacts appear at the bottom of each workflow run. The filename includes a timestamp and skill name (e.g., 20260317T100000-tf-guide.html).
Disabling
Set upload-viewer: 'false' in the action inputs to skip artifact upload.