Override cast_to_fp8 in te.module.linear #120

tocean · 2023-11-03T02:09:38Z

Description
Fix a bug in TE integration.
Currently we only override te.cpp_extensions.cast_to_fp8 with our own cat_to_fp8 in msamp.te.extension.

We also need to override te.module.linear.cast_to_fp8. Otherwise, it will use the original function which does not support ScalingTensor and will raise an exception in Megatron-LM

Major Revision

Override cast_to_fp8 in te.module.linear

wkcn

LGTM. Thanks!

fix bug with te

f737e5c

wkcn approved these changes Nov 3, 2023

View reviewed changes

tocean requested a review from guoshzhao November 3, 2023 03:20

guoshzhao approved these changes Nov 3, 2023

View reviewed changes

wkcn merged commit 3b0567a into main Nov 3, 2023
9 checks passed

wkcn deleted the yuxiang/te_bugbix branch November 3, 2023 05:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Override cast_to_fp8 in te.module.linear #120

Override cast_to_fp8 in te.module.linear #120

tocean commented Nov 3, 2023

wkcn left a comment

Override cast_to_fp8 in te.module.linear #120

Override cast_to_fp8 in te.module.linear #120

Conversation

tocean commented Nov 3, 2023

wkcn left a comment

Choose a reason for hiding this comment