You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Also interested in how the location tokens are used for 3d grounding task?
For now, the code shows how it is initialised only. However, it is not clear how the token is mapped to 3d space.
For example, the output may be like: the sofa is at <location 12>, <location 1>, <location 37>. But what does this mean? Each location corresponding to XYZ position, or a scalar number ?
Also interested in how the location tokens are used for 3d grounding task?还对位置令牌如何用于 3d 接地任务感兴趣?
For now, the code shows how it is initialised only. However, it is not clear how the token is mapped to 3d space.目前,代码仅显示它的初始化方式。但是,目前尚不清楚令牌如何映射到 3D 空间。
For example, the output may be like: the sofa is at <location 12>, <location 1>, <location 37>. But what does this mean? Each location corresponding to XYZ position, or a scalar number ?例如,输出可能如下:沙发位于<位置 12>、<位置 1>、<位置 37>。但这意味着什么呢?每个位置对应于 XYZ 位置,还是一个标量数?
Hope the author could explain more on this!希望作者能对此进行更多解释!
Hello, how can you get the grounding outputs? Could you tell me the format of your prompts? I couldn't find it in the paper or in the projects. Thanks a lot!
Dear author:
Thanks for your interesting work.
I have two questions about 3D Localization Mechanism:
By the way, I wander when will you release detail data & code about grounding? I am really looking forward to it~
Best!
Xiaolong
The text was updated successfully, but these errors were encountered: