"self-attention layer" Papers

4 papers found