NCA-AIIO · Question #9
NCA-AIIO Question #9: Real Exam Question with Answer & Explanation
The correct answer is C: Transformer model. The Transformer model is the foundational architecture for modern large language models (LLMs). Introduced in the paper "Attention is All You Need," it uses stacked layers of self- attention mechanisms and feed-forward networks, often in encoder-decoder or decoder-only configurat
Question
Which architecture is the core concept behind large language models?
Options
- ABERT Large model
- BState space model
- CTransformer model
- DAttention model
Explanation
The Transformer model is the foundational architecture for modern large language models (LLMs). Introduced in the paper "Attention is All You Need," it uses stacked layers of self- attention mechanisms and feed-forward networks, often in encoder-decoder or decoder-only configurations, to efficiently capture long-range dependencies in text. While BERT (a specific Transformer-based model) and attention mechanisms (a component of Transformers) are related, the Transformer itself is the core concept. State space models are an alternative approach, not the primary basis for LLMs.
Topics
Community Discussion
No community discussion yet for this question.