Which of the following best describes positional encoding in Transformers? 单项选择题
A
It encodes syntactic rules of grammar.
B
It stores semantic information in words.
C
It allows the model to ignore token order.
D
It helps the model understand the position of words in a sequence.
登录即可查看完整答案
我们收录了全球超50000道真实原题与详细解析,现在登录,立即获得答案。
类似问题
What is combined with the inputs (embeddings) to the transformer architecture that encodes contextual information that can be used by attention mechanisms to create embeddings with more context?
In the positional encoding of Transformers, sinusoidal functions are used with different formulas for odd and even indices, incorporating the term 10000^(2i/d_model). Analyze the following statements and choose the correct explanations for the effects of increasing or decreasing the constant 10000. See the formula below: Hint: Lec 19, Slides 29-32.
In a self-attention transformer network, which of the following is true for sinusoid-based positional encoding vectors
A project has a required return of 12.6 percent, an initial cash outflow of $42,100, and cash inflows of $16,500 in Year 1, $11,700 in Year 2, and $10,400 in Year 4. What is the net present value?
更多留学生实用工具
希望你的学习变得更简单
加入我们,立即解锁 海量真题 与 独家解析,让复习快人一步!