Evaluate a Retrieval Augmented Generation (RAG) system with Scorecard in minutes.
Schema of a Production RAG System
Simplified Schema of a Production RAG System
Setup accounts
Install SDK (and OpenAI optionally)
Create a minimal RAG system
Setup Scorecard client and Project
Create RAG testcases
query
, the retrievedContext
you expect to be used, and the idealAnswer
for judging correctness.Create AI judge metrics
Run and evaluate
Analyze results
Retrieval-Only LLM Testing in a RAG System
Different Types of Testing in a RAG System