A 'transformer' model was introduced, which consists of an encoder part and a decoder part. The paper also introduces a number of modern foundational concepts such as positional input encoding.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results