Update Embedding doc #6806

AndPuQing · 2024-08-03T09:03:43Z

相关链接

…evelop

paddle-bot · 2024-08-03T09:03:48Z

感谢你贡献飞桨文档，文档预览构建中，Docs-New 跑完后即可预览，预览链接：http://preview-pr-6806.paddle-docs-preview.paddlepaddle.org.cn/documentation/docs/zh/api/index_cn.html
预览工具的更多说明，请参考：飞桨文档预览工具

zhwesky2010 · 2024-08-05T03:43:02Z

docs/api/paddle/nn/Embedding_cn.rst

@@ -38,6 +38,8 @@ Embedding
    - **num_embeddings** (int) - 嵌入字典的大小，input 中的 id 必须满足 ``0 <= id < num_embeddings`` 。
    - **embedding_dim** (int) - 每个嵌入向量的维度。
    - **padding_idx** (int|long|None，可选) - padding_idx 的配置区间为 ``[-weight.shape[0], weight.shape[0]]``，如果配置了 padding_idx，那么在训练过程中遇到此 id 时，其参数及对应的梯度将会以 0 进行填充。
+    - **max_norm** (float，可选) - 如果给定，则每个范数大于 max_norm 的嵌入向量都会被重新规范化为具有范数 max_norm。注意，在动态图模式下，将会原位修改权重。默认值：None。


有一个其他Bug，不影响本文档合入。

在静态图下，是否会导致梯度无法计算，因为返回了一个新的weight。这个应该写为：

if max_norm: embedding_renorm_( x, weight, max_norm=max_norm, norm_type=norm_type )

保证是原来的weight参与组网计算与反向传播，返回一个新的weight，当前weight就从网络中分离了。这个静态图的问题需要修一下 @AndPuQing

这样的话，静态图的renorm 是不是就失效了

zhwesky2010 · 2024-08-05T03:43:59Z

...odel_convert/convert_from_pytorch/api_difference/functional/torch.nn.functional.embedding.md

@@ -9,7 +9,7 @@ torch.nn.functional.embedding(input, weight, padding_idx=None, max_norm=None, no
 ### [paddle.nn.functional.embedding](https://www.paddlepaddle.org.cn/documentation/docs/zh/api/paddle/nn/functional/embedding_cn.html#embedding)

 ```python
-paddle.nn.functional.embedding(x, weight, padding_idx=None, sparse=False, name=None)
+paddle.nn.functional.embedding(x, weight, padding_idx=None, max_norm=None, norm_type=2.0, sparse=False, name=None)


分类类别：torch参数更多->仅参数名不一致

这个的话，scale_grad_by_freq参数暂时还不支持，应该不用修改吧

zhwesky2010 · 2024-08-05T03:44:06Z

docs/guides/model_convert/convert_from_pytorch/api_difference/nn/torch.nn.Embedding.md

@@ -29,8 +31,8 @@ PyTorch 相比 Paddle 支持更多其他参数，具体如下：
 | num_embeddings     | num_embeddings            | 表示嵌入字典的大小。  |
 | embedding_dim     | embedding_dim            | 表示每个嵌入向量的维度。  |
 | padding_idx     | padding_idx            | 在此区间内的参数及对应的梯度将会以 0 进行填充  |
-| max_norm      | -            | 如果给定，Embeddding 向量的范数（范数的计算方式由 norm_type 决定）超过了 max_norm 这个界限，就要再进行归一化，Paddle 无此参数，暂无转写方式。  |
-| norm_type     | -            | 为 maxnorm 选项计算 p-范数的 p。默认值 2，Paddle 无此参数，暂无转写方式。  |
+| max_norm      | max_norm            | 如果给定，Embeddding 向量的范数（范数的计算方式由 norm_type 决定）超过了 max_norm 这个界限，就要再进行归一化。  |


分类类别：torch参数更多->仅参数名不一致

AndPuQing added 3 commits July 12, 2024 00:28

add weight and density options for histogram

e61b346

Merge branch 'develop' of https://github.com/PaddlePaddle/docs into d…

977b997

…evelop

fix embedding doc

57f7751

paddle-bot bot added the contributor label Aug 3, 2024

luotao1 assigned luotao1 and zhwesky2010 Aug 5, 2024

luotao1 added the PaddlePaddle Hackathon 飞桨黑客松活动issue与PR label Aug 5, 2024

zhwesky2010 reviewed Aug 5, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Embedding doc #6806

Update Embedding doc #6806

AndPuQing commented Aug 3, 2024 •

edited

Loading

paddle-bot bot commented Aug 3, 2024

zhwesky2010 Aug 5, 2024

AndPuQing Aug 10, 2024

zhwesky2010 Aug 5, 2024

AndPuQing Aug 10, 2024

zhwesky2010 Aug 5, 2024

Update Embedding doc #6806

Are you sure you want to change the base?

Update Embedding doc #6806

Conversation

AndPuQing commented Aug 3, 2024 • edited Loading

paddle-bot bot commented Aug 3, 2024

zhwesky2010 Aug 5, 2024

Choose a reason for hiding this comment

AndPuQing Aug 10, 2024

Choose a reason for hiding this comment

zhwesky2010 Aug 5, 2024

Choose a reason for hiding this comment

AndPuQing Aug 10, 2024

Choose a reason for hiding this comment

zhwesky2010 Aug 5, 2024

Choose a reason for hiding this comment

AndPuQing commented Aug 3, 2024 •

edited

Loading