Skip to content

MAINSUP/Computer-vision-for-OCR

Repository files navigation

Application to convert both scanned and digital documents to docx type file.

* Digital documents are simply read and saved in a docx document. * Scanned pdf documents recognition is featured with print/handwriting recognition model allowing to appear handwritten text as cursive font in an output file. * Application is optimized for poor quality documents processing; however, user should pass that y/n flag at starting prompt to avoid preprocessing of documents of good quality. * Pytesseract engine is used to OCR images.

The application is hosted on Streamlit Community Cloud. It can beaccessed with the below link.

  • PDF to Docx App
  • About

    Meduzzen

    Resources

    License

    Stars

    Watchers

    Forks

    Releases

    No releases published

    Packages

    No packages published

    Contributors 2

    •  
    •