open-machine/FlashNorm
Feature Extraction • Updated • 2
FlashNorm is a faster implementation of RMSNorm followed by linear layers, used in transformer-based models.
RMSNorm is used by many LLMs such as Llama, Mistral, and OpenELM. This paper details FlashNorm, which is an exact but faster implementation of RMSNorm followed by linear layers. See https://huggingface.co/open-machine/FlashNorm for code and more transformer tricks.
Get this paper in your agent:
hf papers read 2407.09577 curl -LsSf https://hf.co/cli/install.sh | bash No dataset linking this paper
No Space linking this paper