Implement LayerNormalization fusion #280

robertknight · 2024-07-12T06:07:43Z

This fuses subgraphs like below, excluding the "Add" in the top left corner, into a LayerNormalization operator:

So far this has been tested with BERT, GPT-2 and Whisper models.

This is a rather complex fusion and as a result, is liable to be brittle. That is, other equivalent ways of expressing the operator will not match. Also it doesn't help with operators which are variations of this such as RMSNorm (used in Llama etc.). In future it would probably make sense to break this up so that the individual sub-steps (centering, variance normalization, shift + scale) are fused separately.

TODO:

Tests

This is useful for extracting constant values from nodes when creating graph optimizations.

- Add the ability to create symbols which match only constant values - Add the ability to give operators in patterns an identifier which can be used to look up the operator node after the pattern has matched. This is useful to perform additional tests on the operator such as matching particular attributes.

robertknight force-pushed the fuse-layer-norm branch 2 times, most recently from 5e90d30 to 779d6e3 Compare July 12, 2024 07:19

robertknight added 3 commits July 12, 2024 19:16

Add trait for getting typed values from constant nodes

f91ddf1

This is useful for extracting constant values from nodes when creating graph optimizations.

Implement LayerNormalization fusion

bd8a086

robertknight force-pushed the fuse-layer-norm branch from 779d6e3 to bd8a086 Compare July 12, 2024 18:17

robertknight marked this pull request as ready for review July 12, 2024 18:18

robertknight merged commit 8eabc81 into main Jul 12, 2024
2 checks passed

robertknight deleted the fuse-layer-norm branch July 12, 2024 18:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement LayerNormalization fusion #280

Implement LayerNormalization fusion #280

robertknight commented Jul 12, 2024 •

edited

Loading

Implement LayerNormalization fusion #280

Implement LayerNormalization fusion #280

Conversation

robertknight commented Jul 12, 2024 • edited Loading

robertknight commented Jul 12, 2024 •

edited

Loading