What accuracy should we expect when training Alexnet from scratch on ImageNet?

## 📚 Documentation

The README https://github.com/pytorch/examples/blob/main/imagenet/README.md is very helpful when getting started with training AlexNet.

We are able to successfully train AlexNet to approximately 56% top-1 and 79% top-5 accuracy on the validation set.  But this is still a fair bit below Krizhevsky's published results of circa 83% or 85% top-5 accuracy on these training sets. 

We are training with the default recommendations for a single GPU in the README for AlexNet:
```
python main.py -a alexnet --lr 0.01 --gpu 0 /data/datasets/imagenet/
```

What out-of the box accuracy should we expect when training AlexNet on ImageNet with the default PyTorch implementation?

What sort of hyperparameter changes do you recommend to duplicate Alex Krizhevsky's accuracies?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What accuracy should we expect when training Alexnet from scratch on ImageNet? #987

📚 Documentation

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

What accuracy should we expect when training Alexnet from scratch on ImageNet? #987

Description

📚 Documentation

Activity

mostafaelhoushi commented on May 9, 2022

mostafaelhoushi commented on May 9, 2022

msaroufim commented on Jul 10, 2022

mostafaelhoushi commented on Jul 25, 2022

wangtiance commented on Jan 11, 2023

mostafaelhoushi commented on Jan 11, 2023

wangtiance commented on Jan 12, 2023

mostafaelhoushi commented on Jan 12, 2023

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions