pennypdf

Format conversion

Scanned PDF to Word

A scanned PDF is just a PDF of a photograph — there's no text layer, so 'select all' gives you nothing. Converting that to editable Word is actually two jobs: OCR to create the text layer, then PDF-to-Word to preserve the layout. Trying to skip the OCR step is why most online tools give you a one-page image embedded in a .docx.

PennyPDF chains the two together. OCR (Tesseract, 100+ languages) produces a searchable PDF in about 3 seconds per page. That searchable PDF feeds into the Word conversion, which reads the newly-created text layer and maps it onto Word paragraphs, tables, and images.

Quality on old or low-contrast scans is meaningfully better if you enable our AI-enhanced OCR option (coming soon, premium). For everyday modern scans at 200dpi+, standard OCR usually gets 98%+ character accuracy.

How it works

  1. 1Upload the scanned PDF at /ocr.
  2. 2Pick the language (auto-detect is fine for most languages).
  3. 3Download the searchable PDF OCR output.
  4. 4Feed that into /pdf-to-word for the editable .docx.

Frequently asked

How accurate is the OCR?+

Tesseract is 98-99% character accuracy on clean modern scans. Drops to 93-95% on low-contrast or skewed scans. Handwriting is not supported well by any free OCR engine.

What languages?+

100+ via Tesseract. The picker includes the top 20 by search volume; for rare languages, pass the ISO code in the advanced options.

Combined cost?+

3 coins for OCR (covers up to 50 pages; +1 per additional 50), plus 2 coins for the Word conversion. About 20 cents total at Starter-pack pricing.

Can I skip straight from scan to Word without the intermediate PDF?+

Not currently — the intermediate searchable PDF is what makes the layout preservation work. It's also more useful: you can archive the searchable PDF as your master copy and regenerate Word later for free.

Will handwritten notes on the scan come through?+

As images, yes (they stay in the Word doc as pictures). As editable text, no — Tesseract doesn't do handwriting well, and no free engine does.

Why PennyPDF

  • No subscription. Ever.
  • Coins never expire — use them in 5 years.
  • Client-side processing for 14 of 22 tools.
  • No watermarks at any tier.
  • Per-operation pricing, shown before you click.
  • Same coins for web + public API.