Format conversion
PDF to Excel — extract tables from any PDF
Getting a table out of a PDF and into Excel is one of those tasks that feels like it should be trivial but usually isn't. Select-and-copy gives you one long column of text. Right-click Save-As-Excel gives you a picture embedded in a spreadsheet. Neither is usable.
Our tool uses a combination of line detection (finds the table borders), text clustering (finds rows without borders), and column-alignment heuristics. Works on bank statements, scientific papers, invoices, and quarterly reports. Multi-page tables are concatenated into one sheet.
If the table is in a scanned PDF, run /ocr first — AI-enhanced OCR (coming in a later release) handles hand-drawn tables better than traditional OCR, which sometimes reads ruled lines as characters.
How it works
- 1Upload your PDF.
- 2Select which pages contain tables (default: all pages).
- 3Preview the extracted table; adjust column boundaries if needed.
- 4Download the .xlsx — each table is a separate sheet, pages are concatenated.
Frequently asked
Does it work for bank statements?+
Yes — bank statements are actually one of the easier cases because they use consistent column positions. We've tested against Chase, Bank of America, HSBC UK, and Revolut exports.
Multi-page tables?+
The tool auto-concatenates tables that span pages, as long as the column structure is consistent. Mixed-structure multi-page tables export as separate sheets.
Currency and number formatting?+
Numbers are detected and Excel-formatted as numbers (not strings). Currency symbols get stripped but the symbol is preserved in a separate column so you don't lose the information.
Can I extract a table from a specific page only?+
Yes. The page selector lets you pick any range. Useful when a 200-page annual report has three tables on three pages and you don't want to pay for full-document processing.
What about PDFs with no visible tables?+
If there's no tabular structure to find, the tool falls back to emitting each paragraph as a row. Usually not what you want — consider /pdf-to-text or /pdf-to-word instead.
Why PennyPDF
- No subscription. Ever.
- Coins never expire — use them in 5 years.
- Client-side processing for 14 of 22 tools.
- No watermarks at any tier.
- Per-operation pricing, shown before you click.
- Same coins for web + public API.