A Generative AI Engineer has created a RAG application which can help employees retrieve answers from an internal knowledge base, such as Confluence pages or Google Drive. The prototype application is

Sign in or unlock GENERATIVE-AI-ENGINEER-ASSOCIATE to reveal the answer and full explanation for question #60. The question stem and answer options stay visible for context.

Evaluating RAG Systems for Improvement

Question

A Generative AI Engineer has created a RAG application which can help employees retrieve answers from an internal knowledge base, such as Confluence pages or Google Drive. The prototype application is now working with some positive feedback from internal company testers. Now the Generative AI Engineer wants to formally evaluate the system’s performance and understand where to focus their efforts to further improve the system. How should the Generative AI Engineer evaluate the system?

Options

AUse cosine similarity score to comprehensively evaluate the quality of the final generated
BCurate a dataset that can test the retrieval and generation components of the system separately.
CBenchmark multiple LLMs with the same data and pick the best LLM for the job.
DUse an LLM-as-a-judge to evaluate the quality of the final answers generated.

Unlock GENERATIVE-AI-ENGINEER-ASSOCIATE to see the answer

You've previewed enough free GENERATIVE-AI-ENGINEER-ASSOCIATE questions. Unlock GENERATIVE-AI-ENGINEER-ASSOCIATE for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Unlock GENERATIVE-AI-ENGINEER-ASSOCIATE - $49.99 / 30 days Sign in

Topics

#RAG System Evaluation#Component-wise Evaluation#System Diagnostics#Generative AI Engineering

Full GENERATIVE-AI-ENGINEER-ASSOCIATE Practice