[Bug fixes] update attribute map handler #4421

wj-Mcat · 2023-01-11T06:22:56Z

PR types

Bug fixes

PR changes

APIs

Description

update attribute mapping in configuration_utils module.

try to fix: #4384

paddle-bot · 2023-01-11T06:23:01Z

Thanks for your contribution!

codecov · 2023-01-11T06:34:43Z

Codecov Report

Merging #4421 (4504364) into develop (94b66c3) will decrease coverage by 0.02%.
The diff coverage is 83.33%.

@@             Coverage Diff             @@
##           develop    #4421      +/-   ##
===========================================
- Coverage    39.65%   39.62%   -0.03%     
===========================================
  Files          433      433              
  Lines        60936    60983      +47     
===========================================
+ Hits         24163    24167       +4     
- Misses       36773    36816      +43

Impacted Files	Coverage Δ
paddlenlp/transformers/configuration_utils.py	`68.01% <83.33%> (+0.08%)`	⬆️
paddlenlp/trainer/trainer.py	`59.73% <0.00%> (-0.62%)`	⬇️
paddlenlp/transformers/ofa_utils.py	`7.97% <0.00%> (-0.26%)`	⬇️
paddlenlp/trainer/trainer_compress.py	`8.94% <0.00%> (+<0.01%)`	⬆️
paddlenlp/trainer/compression_args.py	`52.72% <0.00%> (+1.78%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

wj-Mcat · 2023-01-11T06:41:06Z

paddlenlp/transformers/configuration_utils.py

-        # do standard config map: there are some old-school pretrained-config not refactored.
-        config_dict = convert_to_legacy_config(cls.attribute_map, config_dict)
-
-        config_dict = flatten_model_config(config_dict)
-        if "model_type" in config_dict and hasattr(cls, "model_type") and config_dict["model_type"] != cls.model_type:
-            logger.warning(
-                f"You are using a model of type {config_dict['model_type']} to instantiate a model of type "
-                f"{cls.model_type}. This is not supported for all configurations of models and can yield errors."
-            )
-


将 convert_to_legacy_config和flatten_model_config迁移到from_dict函数中，因为：from_dict是from_pretrained的调用函数。

wj-Mcat · 2023-01-11T06:54:24Z

paddlenlp/transformers/configuration_utils.py

+        value = config.pop(standard_field, None) or config.pop(paddle_field, None)
+        if value is not None:
+            config[paddle_field] = value


问题

问题是出在：将 attribute_map中的values 参数都映射到第一层级参数上，且都是：target_paddle_field: None。故导致init_args下的参数没办法正确 map 上来，比如d_model

为什么这样调整

老版本model_config.json

{ "init_args": [ { "tie_word_embeddings": false, "pad_token_id": 0, "bos_token_id": 0, "eos_token_id": 1, "vocab_size": 32128, "d_model": 768, "d_kv": 64, "d_ff": 2048, "num_layers": 12, "num_decoder_layers": 12, "num_heads": 12, "relative_attention_num_buckets": 32, "dropout_rate": 0.1, "layer_norm_epsilon": 1e-06, "initializer_factor": 1.0, "feed_forward_proj": "gated-gelu", "init_class": "T5Model" } ], "init_class": "T5ForConditionalGeneration" }

在老板本配置文件中，模型的参数是放在init_args参数中，而这个模块是先递归调用convert_to_legacy_config函数（深度优先），针对于模型参数做 map，此时可能会将hidden_size -> d_model。

可是执行完毕之后，老代码会在根目录上设置：d_model: None，导致flatten_model_config 中没办法将init_args中的正确d_model映射回来。

现在这种做法是可以解决这个问题。

在新版本的配置文件config.json中

{ "architectures": [ "T5ForConditionalGeneration" ], "bos_token_id": 0, "d_ff": 2048, "d_kv": 64, "d_model": 768, "dropout_rate": 0.1, "enable_recompute": false, "eos_token_id": 1, "feed_forward_proj": "gated-gelu", "initializer_factor": 1.0, "is_encoder_decoder": true, "layer_norm_epsilon": 1e-06, "model_type": "t5", "num_decoder_layers": 12, "num_heads": 12, "num_layers": 12, "pad_token_id": 0, "paddlenlp_version": null, "relative_attention_max_distance": 128, "relative_attention_num_buckets": 32, "tie_word_embeddings": false, "use_cache": true, "vocab_size": 32128 }

因为没有init_args参数，也是可以针对于模型参数做 map。

sijunhe

lgtm

update config

79627a7

update legacy config

3005a39

wj-Mcat commented Jan 11, 2023

View reviewed changes

add comment for change

6569193

wj-Mcat marked this pull request as ready for review January 11, 2023 06:50

wj-Mcat commented Jan 11, 2023

View reviewed changes

trigger cla

4504364

sijunhe approved these changes Jan 11, 2023

View reviewed changes

sijunhe merged commit 22de327 into PaddlePaddle:develop Jan 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug fixes] update attribute map handler #4421

[Bug fixes] update attribute map handler #4421

wj-Mcat commented Jan 11, 2023 •

edited

Loading

paddle-bot bot commented Jan 11, 2023

codecov bot commented Jan 11, 2023 •

edited

Loading

wj-Mcat Jan 11, 2023

wj-Mcat Jan 11, 2023

sijunhe left a comment

[Bug fixes] update attribute map handler #4421

[Bug fixes] update attribute map handler #4421

Conversation

wj-Mcat commented Jan 11, 2023 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Jan 11, 2023

codecov bot commented Jan 11, 2023 • edited Loading

Codecov Report

wj-Mcat Jan 11, 2023

Choose a reason for hiding this comment

wj-Mcat Jan 11, 2023

Choose a reason for hiding this comment

问题

为什么这样调整

sijunhe left a comment

Choose a reason for hiding this comment

wj-Mcat commented Jan 11, 2023 •

edited

Loading

codecov bot commented Jan 11, 2023 •

edited

Loading