[x] Born Again Neural Networks - distilling a teacher model to a student model with an identical architecture, the student outperforms the teacher (using a dense net) \cite{furlanelloborn}
[x] Non-local Neural Networks \cite{1711.07971} - facebookAI - Achieves best results in video classification, object segmentation and pose estimation - Reddit
[x] Distilling a Neural Network Into a Soft Decision Tree \cite{1711.09784} - describes a way of using a trained neural net to create a type of soft decision tree that generalizes better than one learned directly from the training data - Reddit