paper review

1.Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

post-thumbnail

2.ON LARGE-BATCH TRAINING FOR DEEP LEARNING: GENERALIZATION GAP AND SHARP MINIMA

post-thumbnail

3.GRAPH ATTENTION NETWORKS

post-thumbnail