feat(eval): v0.2 graded relevance schema + harness #24
Reference in New Issue
Block a user
Delete Branch "feat/eval-v0-2-graded-relevance"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
queries.yaml v0.1 23 case → v0.2 schema swap:
ocr_derived / failure_expected)
run_eval.py 확장:
README.md 신규:
Phase 1 plan: ~/.claude/plans/phase-1-graded-eval-v0-2.md
Parent: ~/.claude/plans/peppy-hugging-nest.md § Phase 1
본 PR closure: schema + harness + README. 신규 28 case + baseline 박제 +
약점 분석 (embedding-sensitive failure pattern 4 카테고리 식별) 은 후속 PR.
Co-Authored-By: Claude Opus 4.7 (1M context) noreply@anthropic.com