Building document recognition application, extracting text and annotating important phrases using machine learning.