Skip to content

Tracker image input size vs. Detector image size #602

Answered by mikel-brostrom
adrian-soch asked this question in Q&A
Discussion options

You must be logged in to vote

Here are the results: https://github.com/mikel-brostrom/Yolov5_StrongSORT_OSNet/wiki/Effect-of-passing-full-image-input-vs-1280-re-scaled-to-StrongSORT-on-MOT17

1280 Full images
IDF1 70.199 72.399
MOTA 63.701 63.781
HOTA 55.69 56.783

IDF1 takes the largest hit by far, which makes sense. This behavior would be further accentuated by lowering the resolution even more in val.py. The execution time is barely affected by using the full image. This is because the detections are resized to 128x256 by the ReID preprocessing before they are fed to the ReID model.

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by adrian-soch
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants