experiment result #7

MrLinNing · 2018-07-30T09:20:01Z

Hello peterliht，
I ran through your code according to the instructions, did not modify any parameters, but found that the results vary greatly.
What parameters did you modify before releasing the code?
The following experimental results on resnet18：
python train.py --model_dir experiments/resnet18_distill/resnext_teacher

My experimental environment is：

python 3.5.2
pytorch 0.4.0
GPU  TITAN Xp

The text was updated successfully, but these errors were encountered:

xht033 · 2018-08-28T03:42:41Z

me too. A huge gap between my experiment and the author's report.

ChuangbinC · 2018-09-28T06:22:58Z

@MrLinNing Do you get the experiment result closed to the author's result?

michaelklachko · 2018-09-29T23:08:49Z

I just ran an experiment on CIFAR-10, with the student being a simple LeNet-5 like network (64C - MP - 128C - MP - 400FC-10), and the teacher is a deeper version (128C-128C-MP-128C-128C-MP-128C-128C-512FC-10).

The teacher gets to ~93% accuracy, the student without KL is ~86.5%. With KL, the student gets to 87.5% consistently.

I didn't use this repo code, only copied the KL loss function to my code.

xiaowenmasfather · 2019-03-31T08:09:39Z

I found a 10% gap too, 84% nowhere near the expected 94.788% . Student net: Resnet-18, Teacher net: Resnext29. Parameters are the same with @peterliht 's original settings.

wnma3mz · 2019-08-14T08:36:37Z

I also got similar results. train_set: 84.914%, test_set: 83.89%
The teacher model is derived from the author's pretrained_teacher_models.zip\pretrained_teacher_models\base_resnext29\.
The accuracy of testing this model is: train_set: 100%, test_set: 96.23%.
Other parameters are consistent with the author.

haitongli · 2019-08-23T22:39:00Z

Looking through another thread of issue discussions on the data loader, the accuracy inconsistency might be due to the way how student and teacher models got their data when we used shuffling.

haitongli · 2019-08-23T22:40:11Z

Has anyone used pytorch 0.3 to run and test?

michaelklachko · 2019-08-23T23:01:37Z

@peterliht why would you want to use Pytorch 0.3? The current stable version is 1.2.

@wnma3mz @xiaowenmasfather Resnet-18 should get to 94.0% without any teachers. If that's not the case, then you're doing something wrong.

haitongli · 2019-08-23T23:07:59Z

@peterliht why would you want to use Pytorch 0.3? The current stable version is 1.2.

@wnma3mz @xiaowenmasfather Resnet-18 should get to 94.0% without any teachers. If that's not the case, then you're doing something wrong.

I understand there is newer (and more stable) version of pytorch available. I just wanted to understand if people have seen different results across different pytorch versions. When first creating this repo 2 years ago, as specified in requirements.txt, v0.3 was used. I want to get a better understanding of issues that have prevented people from reproducing results and see if fixes can be done along with the most stable pytorch version.

wnma3mz · 2019-08-24T00:20:30Z

Hi
@michaelklachko
You‘re right. Resnet-18 with the author's hyperparameters can indeed reach 94%. So my doubt is, where is the problem? Has anyone encountered the same problem and helped me?

@peterliht
Thanks for your suggestion, I will try it on version 0.3 later.

haitongli · 2019-08-24T00:22:27Z

@wnma3mz another thread might also be worth looking into @ #9 and also @ #4

wnma3mz · 2019-08-24T00:30:54Z

@peterliht
Thank you for your prompt reply. I have already seen this issue, I have changed code according to this comment to ensure the correctness of the distillation.

forjiuzhou · 2019-09-12T14:43:24Z

@wnma3mz another thread might also be worth looking into @ #9 and also @ #4

I compare the max index of teacher's output with label, these two disagree with each other. I have commit a request to fix this issue.

conditionWang · 2020-07-21T01:58:49Z

I met the same problem of accuracy gap. I have tried adjusting the learning rate to a small one and observed an improvement, making my results close to those of Peterliht. You can try changing the learning rate and running the code again.

tianli · 2021-01-19T07:57:25Z

@wnma3mz another thread might also be worth looking into @ #9 and also @ #4

I compare the max index of teacher's output with label, these two disagree with each other. I have commit a request to fix this issue.

Your request (#17) fix the problem and I am getting much improved result. I wonder why it is not merged into the master yet!

tianli · 2021-01-19T16:07:53Z

FYI, with the pull request #17, I was able to get accuracy 95.19% on reset18 with the resnext29 teacher.

haitongli · 2021-01-22T23:33:48Z

Thanks for all the discussions and the reminder from @tianli about the pull request. I haven't been able to keep track of this repo for a while. #17 has been merged.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

experiment result #7

experiment result #7

MrLinNing commented Jul 30, 2018 •

edited

Loading

xht033 commented Aug 28, 2018

ChuangbinC commented Sep 28, 2018

michaelklachko commented Sep 29, 2018

xiaowenmasfather commented Mar 31, 2019

wnma3mz commented Aug 14, 2019

haitongli commented Aug 23, 2019

haitongli commented Aug 23, 2019

michaelklachko commented Aug 23, 2019

haitongli commented Aug 23, 2019

wnma3mz commented Aug 24, 2019

haitongli commented Aug 24, 2019 •

edited

Loading

wnma3mz commented Aug 24, 2019

forjiuzhou commented Sep 12, 2019

conditionWang commented Jul 21, 2020

tianli commented Jan 19, 2021

tianli commented Jan 19, 2021

haitongli commented Jan 22, 2021

experiment result #7

experiment result #7

Comments

MrLinNing commented Jul 30, 2018 • edited Loading

xht033 commented Aug 28, 2018

ChuangbinC commented Sep 28, 2018

michaelklachko commented Sep 29, 2018

xiaowenmasfather commented Mar 31, 2019

wnma3mz commented Aug 14, 2019

haitongli commented Aug 23, 2019

haitongli commented Aug 23, 2019

michaelklachko commented Aug 23, 2019

haitongli commented Aug 23, 2019

wnma3mz commented Aug 24, 2019

haitongli commented Aug 24, 2019 • edited Loading

wnma3mz commented Aug 24, 2019

forjiuzhou commented Sep 12, 2019

conditionWang commented Jul 21, 2020

tianli commented Jan 19, 2021

tianli commented Jan 19, 2021

haitongli commented Jan 22, 2021

MrLinNing commented Jul 30, 2018 •

edited

Loading

haitongli commented Aug 24, 2019 •

edited

Loading