While deep learning models have replaced hand-designed features across m...
Optimization of non-convex loss surfaces containing many local minima re...
Data augmentation has emerged as a powerful technique for improving the
...
While there has been much recent work studying how linguistic informatio...