nerdexam
DatabricksDatabricks

GENERATIVE-AI-ENGINEER-ASSOCIATE · Question #89

GENERATIVE-AI-ENGINEER-ASSOCIATE Question #89: Real Exam Question with Answer & Explanation

The correct answer is C: Decrease the chunk size of embedded documents. See the full explanation below for the reasoning.

RAG Pipeline Design and Optimization

Question

After changing the response generating LLM in a RAG pipeline from GPT-4 to a model with a shorter context length that the company self-hosts, the Generative AI Engineer is getting the following error: What TWO solutions should the Generative AI Engineer implement without changing the response generating model? (Choose two.)

Options

  • AUse a smaller embedding model to generate embeddings
  • BReduce the maximum output tokens of the new model
  • CDecrease the chunk size of embedded documents
  • DReduce the number of records retrieved from the vector database
  • ERetrain the response generating model using ALiBi

Topics

#RAG Pipelines#LLM Context Window#Vector Search#Prompt Engineering

Community Discussion

No community discussion yet for this question.

Full GENERATIVE-AI-ENGINEER-ASSOCIATE PracticeBrowse All GENERATIVE-AI-ENGINEER-ASSOCIATE Questions