What dataset is used to train the pretrained model? #17

xieenze · 2019-09-02T09:04:04Z

Hello author:
I try to train ASTER on Synth800K and Synth90K, same setting as the TPAMI Papaer. However, I can not reimplement the accuracy in the paper and your pretrained model.
Can you tell me what dataset are you used to train the pretrained model you provided? Thanks very much!

ayumiymk · 2019-09-03T05:31:19Z

Both the pytorch port and the tensorflow port are trained with Synth90k and SynthText, just as the paper described. Have you tried the tensorflow version?

xieenze · 2019-09-03T06:02:19Z

No, I didn't run TensorFlow version. But this PyTorch I use Synth and Syn90K, I failed to run the accuracy you reported in readme.md.
My accuracy is average 1.5-2 lower than yours in all datasets.
Can you tell me some tricks or details maybe I ignored to train such high accuracy ASTER?
Thanks very much!

ayumiymk · 2019-09-03T06:19:54Z

Actually I do not use any tricks to achieve such performance. All results reported in the README.md are trained and tested with the given scripts.

ayumiymk · 2019-09-03T09:10:51Z

Hi @xieenze,

With the released model, can you obtain the reported results in the README.md.

xieenze · 2019-09-03T09:28:43Z

Yes, using your released model, I can achieve the same results in readme
Maybe I need to check sth. Because I use 4 gpus and modify the batch size and lr.
Thanks very much!

ayumiymk · 2019-09-06T05:33:24Z

Hi enze,

Have you reproduced the reported results? If yes, please let me know, I am curious about the effects from different hyper-parameters.

WenjiaWang0312 · 2019-09-06T05:47:35Z

Hi, I used the workers of 100, and batch_size of 1024. I also modified the learning-rate by hand when the accuracy stuck at a point. I had not achieved the accuracies on those 7 test datasets as your report. Approximately 1.0% to 2.0% lower than your results.

ayumiymk · 2019-09-07T09:29:41Z

emmmm... why not try the given settings? I can not guarantee the same performance with different settings. @JasonBoy1

WenjiaWang0312 · 2019-09-07T14:22:37Z

emmmm... why not try the given settings? I can not guarantee the same performance with different settings. @JasonBoy1

OK, I should try recently

WenjiaWang0312 · 2019-09-07T14:22:50Z

emmmm... why not try the given settings? I can not guarantee the same performance with different settings. @JasonBoy1

thank you very much

WenjiaWang0312 · 2019-09-07T16:04:18Z

Hi, dear author. I found that the default batch_size in the script/stn_att_rec.sh is different from that in config.py. Which did you used in your experiment?
And how many GPUs did you used?

ayumiymk · 2019-09-08T06:10:35Z

The arguments in config.py are just default values for definition. Actually, you should just run bash script/stn_att_rec.sh. All the settings are given in this script, including the number of GPUs (here 2 GPUs are used).

minglii1998 · 2019-09-18T06:49:44Z

Hello there, I'd like to know if any one of you could achieve the same performance ~
I ran bash script/stn_att_rec.sh with two GPUs, but it seemed that my accuracy is about 1-2 lower than the ASTER (L2R) and much lower than ASTER.Pytorch, which should I expect?

ayumiymk · 2019-09-18T12:42:36Z

Hello there, I'd like to know if any one of you could achieve the same performance ~
I ran bash script/stn_att_rec.sh with two GPUs, but it seemed that my accuracy is about 1-2 lower than the ASTER (L2R) and much lower than ASTER.Pytorch, which should I expect?

Maybe you can try our tensorflow implement. If you can not obtain the same results using both the tf and pytorch version. I suspect that there is something wrong with your training data.

minglii1998 · 2019-09-23T02:09:03Z

Hello there, I'd like to know if any one of you could achieve the same performance ~
I ran bash script/stn_att_rec.sh with two GPUs, but it seemed that my accuracy is about 1-2 lower than the ASTER (L2R) and much lower than ASTER.Pytorch, which should I expect?

Maybe you can try our tensorflow implement. If you can not obtain the same results using both the tf and pytorch version. I suspect that there is something wrong with your training data.

Thanks for reply~
Still I can't achieve the performance. I wonder if there's any chance you could provide me with the datasets in lmdb format, I would be really grateful~
Sorry to bother!

ayumiymk · 2019-09-23T03:07:43Z

Hello there, I'd like to know if any one of you could achieve the same performance ~
I ran bash script/stn_att_rec.sh with two GPUs, but it seemed that my accuracy is about 1-2 lower than the ASTER (L2R) and much lower than ASTER.Pytorch, which should I expect?

Maybe you can try our tensorflow implement. If you can not obtain the same results using both the tf and pytorch version. I suspect that there is something wrong with your training data.

Thanks for reply~
Still I can't achieve the performance. I wonder if there's any chance you could provide me with the datasets in lmdb format, I would be really grateful~
Sorry to bother!

How about the performance of the Tensorflow implement?

What's more, I am sorry. The size of the training data is about 50GB. I don't have such a big cloud drive. You should carefully check your own data.

minglii1998 · 2019-09-23T03:46:14Z

Hello there, I'd like to know if any one of you could achieve the same performance ~
I ran bash script/stn_att_rec.sh with two GPUs, but it seemed that my accuracy is about 1-2 lower than the ASTER (L2R) and much lower than ASTER.Pytorch, which should I expect?

Maybe you can try our tensorflow implement. If you can not obtain the same results using both the tf and pytorch version. I suspect that there is something wrong with your training data.

Thanks for reply~
Still I can't achieve the performance. I wonder if there's any chance you could provide me with the datasets in lmdb format, I would be really grateful~
Sorry to bother!

How about the performance of the Tensorflow implement?

What's more, I am sorry. The size of the training data is about 50GB. I don't have such a big cloud drive. You should carefully check your own data.

Thanks anyway, I will try the tf implement~

ayumiymk · 2019-09-24T04:58:21Z

@minglii1998 If you make any progress, please let me know. Thanks!

minglii1998 · 2019-10-11T14:45:19Z

Hello there, finally I achieved the the performance as @ayumiymk mentioned, thus it's for sure that there's nothing wrong with this code.
The performance is almost the same, maybe 0.1-0.3 lower or higher like that.
The reason I can't get the same acc mostly because the datasets I used. The way I pre-process the ST dataset is not right. Handling the exception is of great importance while getting the lmdb format data.

WenjiaWang0312 · 2019-10-12T03:50:27Z

Hello there, finally I achieved the the performance as @ayumiymk mentioned, thus it's for sure that there's nothing wrong with this code.
The performance is almost the same, maybe 0.1-0.3 lower or higher like that.
The reason I can't get the same acc mostly because the datasets I used. The way I pre-process the ST dataset is not right. Handling the exception is of great importance while getting the lmdb format data.

Dear MingLii:
Could you tell me what's the right way to 'handling the exception' of the ST dataset? My implemented accuracy is about 2%-3% lower than the author's report.

Pay20Y · 2019-10-17T04:44:38Z

I wonder how many images are cropped from ST800K? Thanks! @minglii1998

ofexe · 2019-10-17T09:00:34Z

@ayumiymk Could you share the scripts that you used to create lmdb from Synth90k and SynthText datasets? That would be helpful, thank you.

WenjiaWang0312 · 2019-10-18T01:45:13Z

Dear author:
Could you share what's the right way to 'handling the exception' of the ST dataset? Like the @minglii1998 said.

minglii1998 · 2019-10-18T13:53:03Z

@JasonBoy1 @Pay20Y @ofexe
Apologies for my not reply, I was so busy that even if I received the email from git, I still couldn't show up. The author shared the datasets he used with me, and I will share them here with his permission.
You can download from from here,code:wi05
Hope all of all of you can get a good result!
@ayumiymk Hi bro, maybe it's about time to close this issue, thanks for your sharing!

Pay20Y · 2019-10-19T01:12:20Z

@minglii1998 thanks a lot!

ofexe · 2019-10-22T06:40:21Z

@minglii1998 I've already made my own datasets, but still thanks.

YooWang · 2020-05-26T14:07:00Z

Hello there, I'd like to know if any one of you could achieve the same performance ~
I ran bash script/stn_att_rec.sh with two GPUs, but it seemed that my accuracy is about 1-2 lower than the ASTER (L2R) and much lower than ASTER.Pytorch, which should I expect?

Maybe you can try our tensorflow implement. If you can not obtain the same results using both the tf and pytorch version. I suspect that there is something wrong with your training data.

Thanks for reply~
Still I can't achieve the performance. I wonder if there's any chance you could provide me with the datasets in lmdb format, I would be really grateful~
Sorry to bother!

How about the performance of the Tensorflow implement?
What's more, I am sorry. The size of the training data is about 50GB. I don't have such a big cloud drive. You should carefully check your own data.

Thanks anyway, I will try the tf implement~

hello, can you teach me how training from begin.

xieenze closed this as completed Oct 22, 2019

ayumiymk mentioned this issue Jan 11, 2020

你好，请问有下载训练数据集的链接吗？ #42

Closed

Pay20Y mentioned this issue Jan 15, 2020

How can I reproduce ICDAR13 acc? Pay20Y/SAR_TF#21

Open

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What dataset is used to train the pretrained model? #17

What dataset is used to train the pretrained model? #17

xieenze commented Sep 2, 2019

ayumiymk commented Sep 3, 2019

xieenze commented Sep 3, 2019

ayumiymk commented Sep 3, 2019

ayumiymk commented Sep 3, 2019

xieenze commented Sep 3, 2019

ayumiymk commented Sep 6, 2019

WenjiaWang0312 commented Sep 6, 2019 •

edited

Loading

ayumiymk commented Sep 7, 2019

WenjiaWang0312 commented Sep 7, 2019

WenjiaWang0312 commented Sep 7, 2019

WenjiaWang0312 commented Sep 7, 2019

ayumiymk commented Sep 8, 2019 •

edited

Loading

minglii1998 commented Sep 18, 2019

ayumiymk commented Sep 18, 2019

minglii1998 commented Sep 23, 2019

ayumiymk commented Sep 23, 2019

minglii1998 commented Sep 23, 2019

ayumiymk commented Sep 24, 2019

minglii1998 commented Oct 11, 2019

WenjiaWang0312 commented Oct 12, 2019 •

edited

Loading

Pay20Y commented Oct 17, 2019 •

edited

Loading

ofexe commented Oct 17, 2019

WenjiaWang0312 commented Oct 18, 2019

minglii1998 commented Oct 18, 2019

Pay20Y commented Oct 19, 2019

ofexe commented Oct 22, 2019

YooWang commented May 26, 2020

What dataset is used to train the pretrained model? #17

What dataset is used to train the pretrained model? #17

Comments

xieenze commented Sep 2, 2019

ayumiymk commented Sep 3, 2019

xieenze commented Sep 3, 2019

ayumiymk commented Sep 3, 2019

ayumiymk commented Sep 3, 2019

xieenze commented Sep 3, 2019

ayumiymk commented Sep 6, 2019

WenjiaWang0312 commented Sep 6, 2019 • edited Loading

ayumiymk commented Sep 7, 2019

WenjiaWang0312 commented Sep 7, 2019

WenjiaWang0312 commented Sep 7, 2019

WenjiaWang0312 commented Sep 7, 2019

ayumiymk commented Sep 8, 2019 • edited Loading

minglii1998 commented Sep 18, 2019

ayumiymk commented Sep 18, 2019

minglii1998 commented Sep 23, 2019

ayumiymk commented Sep 23, 2019

minglii1998 commented Sep 23, 2019

ayumiymk commented Sep 24, 2019

minglii1998 commented Oct 11, 2019

WenjiaWang0312 commented Oct 12, 2019 • edited Loading

Pay20Y commented Oct 17, 2019 • edited Loading

ofexe commented Oct 17, 2019

WenjiaWang0312 commented Oct 18, 2019

minglii1998 commented Oct 18, 2019

Pay20Y commented Oct 19, 2019

ofexe commented Oct 22, 2019

YooWang commented May 26, 2020

WenjiaWang0312 commented Sep 6, 2019 •

edited

Loading

ayumiymk commented Sep 8, 2019 •

edited

Loading

WenjiaWang0312 commented Oct 12, 2019 •

edited

Loading

Pay20Y commented Oct 17, 2019 •

edited

Loading