oncall-guide-ai / evaluation /metric5_6_llm_judge_evaluator.py

Commit History

Enhance Direct LLM Evaluator and Judge Evaluator:
40d39ed

YanBoChen commited on

Add multi-system evaluation support for clinical actionability and evidence quality metrics
16a2990

YanBoChen commited on

Before Run the 1st Evalation: Add Precision & MRR Chart Generator and a sample test query
a2aaea2

YanBoChen commited on