Question 1

What image formats does the OCR tool support?

Accepted Answer

The tool accepts PNG, JPG/JPEG, WebP, TIFF, and BMP. Support for HEIC (iPhone photos) and AVIF depends on your browser — they work in recent Chrome and Safari. If your image is in an unsupported format, convert it to PNG or JPG using the Image Converter tool first.

Question 2

How do I get better OCR accuracy?

Accepted Answer

Accuracy depends on four factors: resolution (use 300 DPI or higher), contrast (dark text on a light background), orientation (text lines should be horizontal), and language selection (always match the dropdown to the text in your image). Of these, selecting the wrong language is the most common cause of garbled output.

Question 3

Can I extract text from a scanned PDF?

Accepted Answer

Not directly — this tool processes image files. To OCR a scanned PDF, use a PDF tool to export the pages as images (PNG or JPG), then upload each image here. For digitally-created PDFs with an embedded text layer, the PDF to Word converter extracts text without needing OCR.

Question 4

Does the image get uploaded to a server?

Accepted Answer

No. The OCR engine (Tesseract.js) runs as WebAssembly inside your browser. Your image files are never sent to any server — processing happens entirely on your device. This makes the tool safe to use with confidential documents, medical records, and private photos.

Question 5

Why does OCR work on typed text but not handwriting?

Accepted Answer

Tesseract's standard models are trained on printed and typed characters. Cursive handwriting and calligraphy use connected strokes that differ fundamentally from the separated character shapes in the training data. For handwritten notes, dedicated handwriting-recognition services (Google Document AI, Microsoft Azure AI Vision) use separate models that handle connected strokes.

Question 6

What languages does the OCR support?

Accepted Answer

The tool supports 20+ languages including English, French, German, Spanish, Italian, Portuguese, Dutch, Russian, Chinese (Simplified and Traditional), Japanese, Korean, Arabic, and Hindi. Select the correct language before processing — using the wrong language model is the single most common cause of poor results on non-English text.

Question 7

Why are some characters misread (0 vs O, 1 vs l)?

Accepted Answer

OCR is probabilistic — it estimates each character from its visual shape. Visually similar characters (0 and O, 1 and l and I, rn and m) are the most common source of confusion. At lower resolutions, these characters' shapes become nearly identical at the pixel level. Higher resolution (300 DPI+) and good contrast reduce but do not eliminate these errors. Always proofread extracted text for high-stakes use.

Question 8

Can I process multiple images at once?

Accepted Answer

The tool processes one image at a time. For multiple images, open the tool in separate browser tabs. For batch processing of hundreds of pages, a command-line workflow using the Tesseract desktop application or a cloud OCR API is more efficient.

Input Type	Expected Accuracy	How to Improve
Printed document scan at 300 DPI	Excellent (95–99%)	Ensure flat, well-lit scan
High-res screenshot of text	Excellent (95–99%)	Use full-size screenshot, no compression
Phone photo of document	Good (85–95%)	Use good lighting, hold camera level
Low-res image (under 150 DPI)	Moderate (60–80%)	Upscale image before uploading
Image with shadows or glare	Moderate (60–80%)	Convert to greyscale, increase contrast
Rotated or skewed text	Poor (30–60%)	Deskew image before uploading
Handwritten text	Poor (10–40%)	Use a dedicated handwriting OCR service
Decorative or stylised fonts	Variable	Results depend heavily on font clarity

Image to Text Online Free

How to Extract Text from an Image

About OCR — Optical Character Recognition

Image Quality Guide for OCR Accuracy

Tips to Improve OCR Accuracy

Select the Right Language First

Use High-Resolution Images

Maximize Contrast

Straighten the Image

Pre-process with Greyscale

Proofread High-Stakes Output

Frequently Asked Questions

Related Tools