pennypdf

Format conversion

PDF to Excel — extract tables from any PDF

Getting a table out of a PDF and into Excel is one of those tasks that feels like it should be trivial but usually isn't. Select-and-copy gives you one long column of text. Right-click Save-As-Excel gives you a picture embedded in a spreadsheet. Neither is usable.

Our tool uses a combination of line detection (finds the table borders), text clustering (finds rows without borders), and column-alignment heuristics. Works on bank statements, scientific papers, invoices, and quarterly reports. Multi-page tables are concatenated into one sheet.

If the table is in a scanned PDF, run /ocr first — AI-enhanced OCR (coming in a later release) handles hand-drawn tables better than traditional OCR, which sometimes reads ruled lines as characters.

How it works

  1. 1Upload your PDF.
  2. 2Select which pages contain tables (default: all pages).
  3. 3Preview the extracted table; adjust column boundaries if needed.
  4. 4Download the .xlsx — each table is a separate sheet, pages are concatenated.

Frequently asked

Does it work for bank statements?+

Yes — bank statements are actually one of the easier cases because they use consistent column positions. We've tested against Chase, Bank of America, HSBC UK, and Revolut exports.

Multi-page tables?+

The tool auto-concatenates tables that span pages, as long as the column structure is consistent. Mixed-structure multi-page tables export as separate sheets.

Currency and number formatting?+

Numbers are detected and Excel-formatted as numbers (not strings). Currency symbols get stripped but the symbol is preserved in a separate column so you don't lose the information.

Can I extract a table from a specific page only?+

Yes. The page selector lets you pick any range. Useful when a 200-page annual report has three tables on three pages and you don't want to pay for full-document processing.

What about PDFs with no visible tables?+

If there's no tabular structure to find, the tool falls back to emitting each paragraph as a row. Usually not what you want — consider /pdf-to-text or /pdf-to-word instead.

Why PennyPDF

  • No subscription. Ever.
  • Coins never expire — use them in 5 years.
  • Client-side processing for 14 of 22 tools.
  • No watermarks at any tier.
  • Per-operation pricing, shown before you click.
  • Same coins for web + public API.