All posts tagged with: evalite

วัดผล AI Code Reviewer ด้วยการทำ Evaluation แบบ LLM-as-a-Judge

นิลทดสอบ AI code review agent ด้วยการทำ evaluation แบบ LLM-as-a-Judge โดยใช้ library Evalite พร้อมเปรียบเทียบ judge model 2 เจ้า และวัดผลด้วย Issue Coverage กับ False Positive Rate

May 4, 2026