Tong He, Zhi Zhang, Hang Zhang, Zhongyue Zhang, Junyuan Xie, Mu Li ArXiv link — https://arxiv.org/abs/1812.01187 Contributions In this paper — the authors examine a collection of training procedure and model architecture refinements and empirically evaluate their impact on the final model accuracy via ablation study. these tricks introduce minor modifications…