Skip to content

Commit

Permalink
enable apply_vtensor_concat_pass in predictor.py
Browse files Browse the repository at this point in the history
  • Loading branch information
lszxb committed Sep 16, 2024
1 parent c174b8f commit b13e79d
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions llm/predict/predictor.py
Original file line number Diff line number Diff line change
Expand Up @@ -331,6 +331,7 @@ def __init__(self, config: PredictorArgument, tokenizer: PretrainedTokenizer = N
import paddlenlp_ops
inference_config.enable_custom_passes([
"remove_assign_out_pass", # remove the assign_out_ op at the end of while loop
"apply_vtensor_concat_pass", # replace concat op with vtensor implementation
])

with static_mode_guard():
Expand Down

0 comments on commit b13e79d

Please sign in to comment.