Handwriting to Text - OCR Converter

A modern web application that converts handwritten and printed text from images into digital text using Tesseract.js OCR engine.

Features

📝 Convert handwritten and printed text from images to digital text
🖼️ Support for multiple image formats (PNG, JPEG, GIF)
⚡ Real-time processing with live preview
🔧 Advanced image preprocessing options
🌍 Multiple language support
📱 Responsive design for all devices
🎯 Drag and drop file upload

Tech Stack

React 18
TypeScript
Tesseract.js
Tailwind CSS
Vite
Lucide React Icons

Getting Started

Prerequisites

Node.js 16.x or higher
npm or yarn

Installation

Clone the repository:

git clone https://github.com/codegallery-me/Handwritten-Text-Recognition.git
cd handwriting-ocr-app

Install dependencies:

npm install

Start the development server:

npm run dev

Build for production:

npm run build

Usage Guide

Basic Usage

Open the application in your browser
Upload an image by either:
- Dragging and dropping an image file
- Clicking "Browse Files" to select an image
Wait for the OCR processing to complete
View the extracted text in the results panel

Recognition Settings

Language Options

eng: Standard English recognition
eng_best: High-accuracy English recognition (slower)
osd: Auto-detect orientation and script

Preprocessing Options

none: No preprocessing
bw: Black & White mode (best for handwriting)
sharpen: Sharpened mode (best for printed text)

API Documentation

Component Structure

interface Settings {
  language: 'eng' | 'eng_best' | 'osd';
  preprocessing: 'none' | 'bw' | 'sharpen';
}

interface AppProps {}

interface AppState {
  image: string | null;
  text: string;
  loading: boolean;
  error: string | null;
  settings: Settings;
}

Core Functions

`handleFile(file: File) => void`

Processes the uploaded file and initiates OCR.

Parameters:

file: The image file to process

`preprocessImage(imageData: string) => Promise<string>`

Applies preprocessing filters to the image before OCR.

Parameters:

imageData: Base64 encoded image string Returns:
Promise resolving to processed image data URL

`processImage(imageData: string) => Promise<void>`

Performs OCR on the image using Tesseract.js.

Parameters:

imageData: Base64 encoded image string

Tesseract Configuration

{
  tessedit_char_whitelist: 'ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz0123456789.,!?-_\'"\n ',
  tessedit_pageseg_mode: '6',
  tessjs_create_pdf: '0',
  tessjs_create_hocr: '0',
  tessjs_create_tsv: '0'
}

Best Practices for Optimal Results

Image Quality
- Use clear, well-lit images
- Ensure good contrast between text and background
- Avoid blurry or distorted images
Preprocessing Selection
- For handwritten text: Use "Black & White" mode
- For printed text: Use "Sharpen" mode
- For unclear results: Try different preprocessing options
Language Selection
- Use "English (Best)" for highest accuracy
- Use "Auto Detect" for unknown text orientation
- Standard "English" for faster processing

Performance Considerations

Image size: Larger images take longer to process
Language mode: "English (Best)" is more accurate but slower
Browser resources: Processing occurs client-side
Maximum file size: 10MB recommended

Error Handling

The application handles various error cases:

Invalid file types
Processing failures
Network issues
Browser compatibility

Browser Support

Chrome (latest)
Firefox (latest)
Safari (latest)
Edge (latest)

Contributing

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Tesseract.js for OCR functionality
Tailwind CSS for styling
Lucide React for icons

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
src		src
.gitignore		.gitignore
README.md		README.md
eslint.config.js		eslint.config.js
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Handwriting to Text - OCR Converter

Features

Tech Stack

Getting Started

Prerequisites

Installation

Usage Guide

Basic Usage

Recognition Settings

Language Options

Preprocessing Options

API Documentation

Component Structure

Core Functions

`handleFile(file: File) => void`

`preprocessImage(imageData: string) => Promise<string>`

`processImage(imageData: string) => Promise<void>`

Tesseract Configuration

Best Practices for Optimal Results

Performance Considerations

Error Handling

Browser Support

Contributing

License

Acknowledgments

About

Uh oh!

Releases

Uh oh!

Contributors 2

Uh oh!

Languages

codegallery-me/Handwritten-Text-Recognition

Folders and files

Latest commit

History

Repository files navigation

Handwriting to Text - OCR Converter

Features

Tech Stack

Getting Started

Prerequisites

Installation

Usage Guide

Basic Usage

Recognition Settings

Language Options

Preprocessing Options

API Documentation

Component Structure

Core Functions

handleFile(file: File) => void

preprocessImage(imageData: string) => Promise<string>

processImage(imageData: string) => Promise<void>

Tesseract Configuration

Best Practices for Optimal Results

Performance Considerations

Error Handling

Browser Support

Contributing

License

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Uh oh!

Contributors 2

Uh oh!

Languages

`handleFile(file: File) => void`

`preprocessImage(imageData: string) => Promise<string>`

`processImage(imageData: string) => Promise<void>`