RAG Evaluation

Compare RAG pipeline answers against vanilla LLMs — scored by GPT-4o judge on 5 criteria.

Powered by your RAG pipeline