AIP-C01 · Question #15
AIP-C01 Question #15: Real Exam Question with Answer & Explanation
The correct answer is A: Configure hybrid search by combining vector similarity with keyword matching to improve. Option A is the correct solution because hybrid search directly addresses the core retrieval failure modes while maintaining low latency and minimal operational overhead. In medical and scientific domains, exact terminology, abbreviations, and acronyms (for example, drug names, p
Question
A medical company is building a generative AI (GenAI) application that uses Retrieval Augmented Generation (RAG) to provide evidence-based medical information. The application uses Amazon OpenSearch Service to retrieve vector embeddings. Users report that searches frequently miss results that contain exact medical terms and acronyms and return too many semantically similar but irrelevant documents. The company needs to improve retrieval quality and maintain low end-user latency, even as the document collection grows to millions of documents. Which solution will meet these requirements with the LEAST operational overhead?
Options
- AConfigure hybrid search by combining vector similarity with keyword matching to improve
- BIncrease the dimensions of the vector embeddings from 384 to 1536. Use a post-processing AWS
- CReplace OpenSearch Service with Amazon Kendra. Use query expansion to handle medical
- DImplement a two-stage retrieval architecture in which initial vector search results are re-ranked by
Explanation
Option A is the correct solution because hybrid search directly addresses the core retrieval failure modes while maintaining low latency and minimal operational overhead. In medical and scientific domains, exact terminology, abbreviations, and acronyms (for example, drug names, procedures, or conditions) are critical. Pure vector similarity search often underweights these exact matches, leading to missed results and excessive semantically related but irrelevant documents. Amazon OpenSearch Service natively supports hybrid search, which combines keyword-based retrieval (such as BM25) with vector similarity search. Keyword search ensures precise matching for exact terms and acronyms, while vector search captures semantic meaning and contextual similarity. By blending these approaches, the retrieval system improves both precision and recall without introducing additional infrastructure. Hybrid search operates within the same OpenSearch index and query path, which preserves low end- user latency even at large scale. This is especially important as the document collection grows to millions of documents. Because OpenSearch handles scoring and ranking internally, no additional orchestration layers or post-processing steps are required.
Topics
Community Discussion
No community discussion yet for this question.