Optical Character Recognition (OCR) is the task of detecting and recognizing text / characters from images, fax documents, scanned paper documents to digital format for further consumption. OCR has wide range of use-cases and is used across different industries like - Banking, Insurance, Legal, Retail, etc., Whatever may be the use case, it is important to identify the correct pre-trained OCR model to create the solution. The task of finding and evaluating a solution can take not only weeks but months, but by using
the effort is reduced merely to the matter of minutes.
Search of 'mmocr' models in
Or click on
Try on Tiyaro from MMOCR GitHub
allows us a quicker way to compare different pre-trained machine learning models along with the SaaS Vendor APIs helping us accelerate the evaluation process. To compare models, all of them must be of the same
Model Type, in this case:
MMOCR IIIT 5k dataset public experiment
Once we have completed searching and using demos for different models we can head on over to create experiments. There are various ways to create experiments but we are simply going over to the experiments tab. Where we start a new experiment.
After that, we can select our experiment model type, which in our case would be Optical Character Recognition.
Once completed we can select the models to train the experiments on, the model selection provides us with various filters.
After selecting the MMOCR models, you can upload your custom dataset. There are various formats to upload the OCR dataset on Tiyaro for the experiment. Easiest is zipping the CSV file containing one column as the input. The sample IIIT 5k-word dataset used in the experiment can be downloaded from Experiment config's Data section.
After running the experiment we can see the results tab show up, here you would also be able to see the latency of the models as well as the results of the experiment, for our use case we are able to see the table containing our input and the respective model predictions. We can also download the result in a zip file.
The result below is shown in the table, the first column containing the actual input image, and the subsequent columns being the OCR results.
As seen from the above results, depending upon model training and test dataset, we have to evaluate the results, and choose the model that best suits the use-case.
Text Detection + Text Recognition and
KIE models are of model-type:
optical-character-recognition in Tiyaro. You can find these
end2end ocr models on Tiyaro as well.
Tiyaro Docs - ocr response signature
response.textsimply returns the recognized text by the model
response.raw_responsereturns the complete response from model. Based on model provider, it varies a lot. While we try our best to keep Tiyaro Models updated with MMOCR updates, there might be some delays. In such cases, if you don't find a particular MMOCR model on Tiyaro, or any other model for that matter, simply raise a request in Tiyaro EasyServe . You can share the experiment with your coworkers and on social media. Also, you can make a copy of the given experiment to enhance or modify the particular experiment. Wish to create one? Head on over to Tiyaro !