MacOS Screenshot OCR & LaTeX Model

This Python program extracts text and LaTeX code from screenshots, which can be useful when dealing with poorly formatted PDFs or complex scientific documents. The program leverages Optical Character Recognition (OCR) technology, temporary file management, and user-friendly GUI for streamlined and efficient text and LaTeX extraction.

Features

Screenshot Capture: Capture screenshots of selected screen areas and save them as temporary files.
OCR Engine (Pytesseract): Extract text from the captured screenshots using the Pytesseract library.
LaTeX Engine (Modified Pix2tex from Forked LatexOCR): Extract LaTeX code from the captured temporary screenshots based on the LaTeX-OCR (modified pix2tex, a forked LatexOCR model).
Clipboard Management: Append the recognized text or LaTeX code to the clipboard in an organized manner. Includes an easy-clear button for clearing the clipboard content.
User Interface: A lightweight, easy-to-use interface built with tkinter.

Installation

Clone the repository and navigate to the project folder:

git clone https://github.com/yourusername/mac_screenshot_ocr_latex.git
cd mac_screenshot_ocr_latex

Install the required libraries:

pip install -r requirements.txt

Clone and integrate the modified LatexOCR model (https://github.com/rawcsav/LaTeX-OCR) for use with the latex_engine.py file.
Modify the capture_tools.py, ocr_engine.py, and latex_engine.py files to include your custom configurations, if necessary.
Run the main_app.py script to start the application:

python main_app.py

Usage

Open the application and choose your desired recognition mode (Text OCR or LaTeX OCR).
Press "Enter" or click the corresponding button to capture a screenshot of the desired content.
The recognized text or translated LaTeX code will be appended to your clipboard.
Optional: Click "Clear Clipboard" to erase the clipboard content.
Paste the extracted content into your preferred application.

Demos

Online Demo

Limitations and Acknowledgement

Thanks to Lukas Belcher and his foundational Latex model (https://github.com/lukas-blecher/LaTeX-OCR)!

While this solution performs well in recognizing standard text, it may struggle with complex scientific symbols, mathematical notations, or technical expressions. In the future, specialized OCR models can be developed to handle such content more accurately. Additionally, improvements can be made to preprocessing and post-processing techniques to further enhance the quality and readability of extracted text.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MacOS Screenshot OCR & LaTeX Model

Features

Installation

Usage

Demos

Limitations and Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

rawcsav/MacOS-Screenshot-Text-OCR

Folders and files

Latest commit

History

Repository files navigation

MacOS Screenshot OCR & LaTeX Model

Features

Installation

Usage

Demos

Limitations and Acknowledgement

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages