DatabricksDatabricks
GENERATIVE-AI-ENGINEER-ASSOCIATE · Question #89
GENERATIVE-AI-ENGINEER-ASSOCIATE Question #89: Real Exam Question with Answer & Explanation
The correct answer is C: Decrease the chunk size of embedded documents. See the full explanation below for the reasoning.
RAG Pipeline Design and Optimization
Question
After changing the response generating LLM in a RAG pipeline from GPT-4 to a model with a shorter context length that the company self-hosts, the Generative AI Engineer is getting the following error: What TWO solutions should the Generative AI Engineer implement without changing the response generating model? (Choose two.)
Options
- AUse a smaller embedding model to generate embeddings
- BReduce the maximum output tokens of the new model
- CDecrease the chunk size of embedded documents
- DReduce the number of records retrieved from the vector database
- ERetrain the response generating model using ALiBi
Topics
#RAG Pipelines#LLM Context Window#Vector Search#Prompt Engineering
Community Discussion
No community discussion yet for this question.