Papers
Collection
Large Language Model (LLM) and NLP related papers. • 354 items • Updated • 16
Extending the decoder Transformer with unsupervised learned random latent variables enhances performance on downstream tasks.
We propose an extension of the decoder Transformer that conditions its generative process on random latent variables which are learned without supervision thanks to a variational procedure. Experimental evaluations show that allowing such a conditioning translates into substantial improvements on downstream tasks.
Get this paper in your agent:
hf papers read 2510.17558 curl -LsSf https://hf.co/cli/install.sh | bash No model linking this paper
No dataset linking this paper
No Space linking this paper