Image Recognition
Advances in data science and data engineering have led to the development of Intelligent document processing (IDP) solutions. Our aim is to test the APIs with a variety of noises in order to determine the noises the OCR APIs can handle. Hence it is important to see all the metrics together to get a clear idea of the OCR API’s performance. Fig 5.2 (c): SmartDocQA - Vertical text: Vision and Textract text output comparison. Hence we tried various methods to clean the images before feeding to the API and checked whether API performance improved or not.