nerdexam
DatabricksDatabricks

GENERATIVE-AI-ENGINEER-ASSOCIATE · Question #95

GENERATIVE-AI-ENGINEER-ASSOCIATE Question #95: Real Exam Question with Answer & Explanation

The correct answer is D: context length 32768: smallest model is 14GB and embedding dimension 4096. See the full explanation below for the reasoning.

RAG System Design

Question

A Generative AI Engineer is creating an LLM-based application. The documents for its retriever have been chunked to a maximum of 512 tokens each. The Generative AI Engineer knows that cost and latency are more important than quality for this application. They have several context length levels to choose from. Which will fulfill their need?

Options

  • Acontext length 514; smallest model is 0.44GB and embedding dimension 768
  • Bcontext length 2048: smallest model is 11GB and embedding dimension 2560
  • Ccontext length 32768: smallest model is 14GB and embedding dimension 4096
  • Dcontext length 32768: smallest model is 14GB and embedding dimension 4096

Topics

#RAG System Design#LLM Context Management#Cost-Performance Tradeoffs#Embedding Models

Community Discussion

No community discussion yet for this question.

Full GENERATIVE-AI-ENGINEER-ASSOCIATE PracticeBrowse All GENERATIVE-AI-ENGINEER-ASSOCIATE Questions