Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to align scene_ids in data_part2_scene.json from those in ScanNet? #23

Closed
ch3cook-fdu opened this issue Sep 2, 2023 · 9 comments
Closed

Comments

@ch3cook-fdu
Copy link

I notice that the scene_ids in data_part2_scene.json have a dtype of int, while the scene_ids in scannet are something like scene0000_00. So, how do I align these two datasets? Or how can I split the scannet annotations from that file by myself?

@evelinehong
Copy link
Collaborator

@cy94
Copy link

cy94 commented Nov 30, 2023

Hello,
I used the final_scene_map_dict_scan_v2.json file but it doesnt contain scene ID mappings for all the annotations in data_part2_scene_v2.json. Is there are a different scene ID mapping that I can use?

@ch3cook-fdu
Copy link
Author

Hello, I used the final_scene_map_dict_scan_v2.json file but it doesnt contain scene ID mappings for all the annotations in data_part2_scene_v2.json. Is there are a different scene ID mapping that I can use?

I think the scannet part is only part of data_part2_scene_v2.json. However, the annotations is quite noisy, and contain many duplicates...

@cy94
Copy link

cy94 commented Dec 1, 2023

Thanks! Out of 202k train annotations about 60k could be matched with ScanNet IDs, is that correct? and are the rest from Habitat-3D?

@ch3cook-fdu
Copy link
Author

Thanks! Out of 202k train annotations about 60k could be matched with ScanNet IDs, is that correct? and are the rest from Habitat-3D?

I think the precise count would be 57,434?

@cy94
Copy link

cy94 commented Dec 1, 2023

I found 60582 in data_part2_scene_v2.json, is that the right file to use?

@ch3cook-fdu
Copy link
Author

I am using the following code for filtering:

import json
mapping_file = json.load(open('final_scene_map_dict_scan_v2.json', 'r'))
data = json.load(open('data_part2_scene.json', 'r'))
scannet_annotations = []
corpus = set()
for anno in data:
    scannet_scene_id = mapping_file.get(str(anno['scene_id']), None)
    if scannet_scene_id is None:
        continue
    scannet_annotations.append(anno)
print(len(scannet_annotations))

@cy94
Copy link

cy94 commented Dec 1, 2023

I see, I'm using data_part2_scene_v2.json, I didn't see v1 (data_part2_scene.json) anywhere on your Google Drive. Maybe thats why there are more?

@ch3cook-fdu
Copy link
Author

That might be the case!

This issue was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants