Research Paper Summary

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

by Mingxing Tan, Quoc V. Le

Abstract

EfficientNet introduces a new scaling method that balances network depth, width, and resolution using a simple compound coefficient. The approach achieves state-of-the-art accuracy on ImageNet while being up to 8.4x smaller and 6.1x faster than previous ConvNets.

Key Highlights

Proposed a compound scaling method for ConvNet dimensions.
EfficientNet-B7 achieves 84.3% top-1 accuracy on ImageNet.
Demonstrated transferability to datasets like CIFAR-100 and Flowers.

Methodology

The compound scaling method uses a baseline network and uniformly scales depth, width, and resolution. Neural architecture search is employed to optimize the baseline model, resulting in the EfficientNet family.

Results and Key Findings

EfficientNet-B0 achieves 77.1% top-1 accuracy with 5.3M parameters.
EfficientNet-B7 is 8.4x smaller and 6.1x faster than GPipe.
Achieved state-of-the-art results on multiple datasets.

Applications and Impacts

EfficientNet's scalability and efficiency make it ideal for deployment in mobile applications, large-scale image recognition, and transfer learning scenarios.

Conclusion

EfficientNet revolutionizes ConvNet scaling by balancing depth, width, and resolution, setting new standards for accuracy and efficiency in deep learning.