Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About reproducing #11

Open
liujikun opened this issue Jul 13, 2021 · 4 comments
Open

About reproducing #11

liujikun opened this issue Jul 13, 2021 · 4 comments

Comments

@liujikun
Copy link

Hi,

Thank you for your great work! Have you ever cleaned the lhq dataset? I used the lhq_1024_jpg dataset to reproduce the effect, and the FID can only reach 9. I tried to fine-tune on the open source model, but the initial FID was as high as 24.

Best,
JiKun

@universome
Copy link
Owner

Hi! I am sorry for answering that late. Yes, we indeed preprocessed the dataset (as specified in the Experiments section) with the procedure described in Section 3.3/Appendix C/Algorithm 1.

Here is the script that we used to preprocess the datasets (we use a threshold of 0.95 for LHQ).

Also note, that in our Table 1, we provide results for the 256x256 resolution, not for 1024x1024 (for 1024x1024, our model achieved FID/∞-FID of 10.11/10.53).

That's strange that the initial FID was so high. But maybe those "unconnectable" images (that are removed by our preprocessing procedure) really change the distribution so much.

@universome
Copy link
Owner

Hmm, the currently released dataset has images sorted by their InceptionV3 likelihood (from least to most probable). I think in the above script, one should also shuffle the images. I will update it

@Muccul
Copy link

Muccul commented Mar 19, 2022

@universome Hi, some confusion about your jupyter of preprocessing.
LHQ256 dataset contain 90k data, and rest contain 70k after split(15k train, 5k val), but in your jupyer file, why filtering number is 167k with 0.99 threshold?
image

@Muccul
Copy link

Muccul commented Mar 19, 2022

@universome Hi, some confusion about your jupyter of preprocessing. LHQ256 dataset contain 90k data, and rest contain 70k after split(15k train, 5k val), but in your jupyer file, why filtering number is 167k with 0.99 threshold? image

btw, if we get a subset of data, How to divide it into training set and test set ? and calculate fid just on test set in subset for your paper result?
Looking forward to your reply, thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants