PDF to Excel Converter Free
Extract tables and data from any PDF and download as Excel (.xlsx) or CSV. Runs entirely in your browser — your files never leave your device. No signup required.
Drop your PDF here
or click to browse — .pdf files supported
How to Convert PDF to Excel
- 1Choose an extraction mode: Table detection for columnar data (invoices, reports, statements) or Line-by-line for simple lists and text.
- 2Drag your PDF onto the upload zone or click to browse. The file loads locally — nothing is sent to a server.
- 3Click Extract to Spreadsheet. The tool processes each page and shows a preview of the first rows.
- 4Click Download .xlsx to open in Excel or Google Sheets, or Download .csv for a plain-text format compatible with any data tool.
What Types of PDFs Convert Well
The converter works best on digitally-created PDFs — documents exported from Excel, accounting software, ERP systems, or reporting tools. These PDFs have a precise text layer that maps directly to column positions.
Scanned PDFs (photos of paper documents) have no text layer and cannot be converted. PDFs with complex layouts — merged cells, diagonal text, multi-level headers, or footnotes mixed with data — may need manual cleanup after conversion.
Conversion Quality by Document Type
| Document type | Mode | Expected quality |
|---|---|---|
| Bank statement (digital PDF) | Table detection | Excellent — consistent column alignment |
| Invoice from accounting software | Table detection | Excellent — structured line items |
| Excel report exported to PDF | Table detection | Excellent — original column structure preserved |
| Price list or product catalogue | Table detection | Good — minor cleanup may be needed |
| Simple text report or list | Line-by-line | Good — each line maps to a row |
| Multi-column newsletter layout | Table detection | Moderate — columns may interleave |
| Scanned document (image PDF) | Either | Not supported — use OCR tool first |
Tips for Best Results
Try Both Modes
If Table detection produces mixed-up columns, switch to Line-by-line. The preview updates after each extraction so you can compare without re-uploading.
Check for Text Layer First
Open the PDF and try to select text with your cursor. If you can highlight individual words, the PDF has a text layer and will convert well. If selection highlights the whole page as an image, use OCR first.
Unlock Password PDFs First
Password-protected PDFs cannot be processed. Use the PDF Password Remover to unlock the file first, then convert to Excel.
Use CSV for Data Pipelines
If you are importing into a database, feeding a Python/R script, or uploading to a data warehouse, the .csv output is simpler and more portable than .xlsx.
Clean Up in Excel
After opening the .xlsx, use Excel's Text to Columns and Find & Replace tools to fix any cells that merged or split incorrectly. Most documents need only minor adjustments.
Large PDFs
For PDFs with hundreds of pages, the extraction runs page-by-page with a progress bar. The browser tab stays responsive — you can switch away and return when extraction is complete.
Frequently Asked Questions
How does the PDF to Excel conversion work?
The tool uses PDF.js to extract the text layer from your PDF file, then analyses the x/y coordinates of each text item to detect column boundaries. Items that share consistent x-positions are clustered into columns, and each detected line becomes a row in the spreadsheet. The output is a real .xlsx file built using the SpreadsheetML open standard — no third-party library required.
Which extraction mode should I use?
Use "Table detection" for PDFs that contain structured tabular data — invoices, financial reports, statements, price lists, or any document where text is arranged in columns. This mode clusters x-positions to assign cells to columns automatically. Use "Line-by-line" for simpler documents like lists, bullet-point reports, or single-column text where you just want each line as a row.
Why does the output look different from the original PDF?
PDF stores content as positioned elements rather than structured data. The converter reconstructs structure from positions, which is approximate. Complex layouts — merged cells, rotated text, cells that span multiple columns, or inconsistent column alignment — may not map perfectly. A review and light cleanup in Excel is normal for complex documents.
Can I convert a scanned PDF to Excel?
No. Scanned PDFs are images of pages with no text layer, so there is no data to extract. To convert a scanned document, first use the OCR tool to extract the text, then copy the output into Excel manually or paste it into a new PDF before re-converting.
Is my PDF uploaded to a server?
No. All processing uses PDF.js running in your browser. Your file is never sent anywhere — the conversion happens entirely on your device and works without an internet connection after the page has loaded.
What is the difference between .xlsx and .csv output?
.xlsx is the native Excel format — it supports multiple sheets, cell types, number formatting, and is directly openable in Microsoft Excel, Google Sheets, and LibreOffice Calc. .csv (comma-separated values) is plain text and universally compatible with any spreadsheet application, database import tool, or script. Use .xlsx for Excel workflows and .csv for data pipelines or scripting.
What is the maximum PDF size I can convert?
There is no server-side size limit. Practical limits depend on your device's memory. PDFs up to 20 MB with under 200 pages process quickly on most modern devices. Very large PDFs may take longer as each page is processed sequentially.