After changing the response generating LLM in a RAG pipeline from GPT-4 to a model with a shorter context length that the company self-hosts, the Generative AI Engineer is getting the following error:

Sign in or unlock GENERATIVE-AI-ENGINEER-ASSOCIATE to reveal the answer and full explanation for question #89. The question stem and answer options stay visible for context.

RAG Pipeline Design and Optimization

Question

What TWO solutions should the Generative AI Engineer implement without changing the response generating model? (Choose two.)

Options

AUse a smaller embedding model to generate embeddings
BReduce the maximum output tokens of the new model
CDecrease the chunk size of embedded documents
DReduce the number of records retrieved from the vector database
ERetrain the response generating model using ALiBi

Unlock GENERATIVE-AI-ENGINEER-ASSOCIATE to see the answer

You've previewed enough free GENERATIVE-AI-ENGINEER-ASSOCIATE questions. Unlock GENERATIVE-AI-ENGINEER-ASSOCIATE for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Unlock GENERATIVE-AI-ENGINEER-ASSOCIATE - $49.99 / 30 days Sign in

Topics

#RAG Pipelines#LLM Context Window#Vector Search#Prompt Engineering

Full GENERATIVE-AI-ENGINEER-ASSOCIATE Practice