PublicSoftTools

PDF to Excel Converter Free

Extract tables and data from any PDF and download as Excel (.xlsx) or CSV. Runs entirely in your browser — your files never leave your device. No signup required.

Drop your PDF here

or click to browse — .pdf files supported

Everything runs locally — your PDF never leaves your device.

How to Convert PDF to Excel

  1. 1Choose an extraction mode: Table detection for columnar data (invoices, reports, statements) or Line-by-line for simple lists and text.
  2. 2Drag your PDF onto the upload zone or click to browse. The file loads locally — nothing is sent to a server.
  3. 3Click Extract to Spreadsheet. The tool processes each page and shows a preview of the first rows.
  4. 4Click Download .xlsx to open in Excel or Google Sheets, or Download .csv for a plain-text format compatible with any data tool.

What Types of PDFs Convert Well

The converter works best on digitally-created PDFs — documents exported from Excel, accounting software, ERP systems, or reporting tools. These PDFs have a precise text layer that maps directly to column positions.

Scanned PDFs (photos of paper documents) have no text layer and cannot be converted. PDFs with complex layouts — merged cells, diagonal text, multi-level headers, or footnotes mixed with data — may need manual cleanup after conversion.

Conversion Quality by Document Type

Document typeModeExpected quality
Bank statement (digital PDF)Table detectionExcellent — consistent column alignment
Invoice from accounting softwareTable detectionExcellent — structured line items
Excel report exported to PDFTable detectionExcellent — original column structure preserved
Price list or product catalogueTable detectionGood — minor cleanup may be needed
Simple text report or listLine-by-lineGood — each line maps to a row
Multi-column newsletter layoutTable detectionModerate — columns may interleave
Scanned document (image PDF)EitherNot supported — use OCR tool first

Tips for Best Results

Try Both Modes

If Table detection produces mixed-up columns, switch to Line-by-line. The preview updates after each extraction so you can compare without re-uploading.

Check for Text Layer First

Open the PDF and try to select text with your cursor. If you can highlight individual words, the PDF has a text layer and will convert well. If selection highlights the whole page as an image, use OCR first.

Unlock Password PDFs First

Password-protected PDFs cannot be processed. Use the PDF Password Remover to unlock the file first, then convert to Excel.

Use CSV for Data Pipelines

If you are importing into a database, feeding a Python/R script, or uploading to a data warehouse, the .csv output is simpler and more portable than .xlsx.

Clean Up in Excel

After opening the .xlsx, use Excel's Text to Columns and Find & Replace tools to fix any cells that merged or split incorrectly. Most documents need only minor adjustments.

Large PDFs

For PDFs with hundreds of pages, the extraction runs page-by-page with a progress bar. The browser tab stays responsive — you can switch away and return when extraction is complete.

Frequently Asked Questions

How does the PDF to Excel conversion work?

The tool uses PDF.js to extract the text layer from your PDF file, then analyses the x/y coordinates of each text item to detect column boundaries. Items that share consistent x-positions are clustered into columns, and each detected line becomes a row in the spreadsheet. The output is a real .xlsx file built using the SpreadsheetML open standard — no third-party library required.

Which extraction mode should I use?

Use "Table detection" for PDFs that contain structured tabular data — invoices, financial reports, statements, price lists, or any document where text is arranged in columns. This mode clusters x-positions to assign cells to columns automatically. Use "Line-by-line" for simpler documents like lists, bullet-point reports, or single-column text where you just want each line as a row.

Why does the output look different from the original PDF?

PDF stores content as positioned elements rather than structured data. The converter reconstructs structure from positions, which is approximate. Complex layouts — merged cells, rotated text, cells that span multiple columns, or inconsistent column alignment — may not map perfectly. A review and light cleanup in Excel is normal for complex documents.

Can I convert a scanned PDF to Excel?

No. Scanned PDFs are images of pages with no text layer, so there is no data to extract. To convert a scanned document, first use the OCR tool to extract the text, then copy the output into Excel manually or paste it into a new PDF before re-converting.

Is my PDF uploaded to a server?

No. All processing uses PDF.js running in your browser. Your file is never sent anywhere — the conversion happens entirely on your device and works without an internet connection after the page has loaded.

What is the difference between .xlsx and .csv output?

.xlsx is the native Excel format — it supports multiple sheets, cell types, number formatting, and is directly openable in Microsoft Excel, Google Sheets, and LibreOffice Calc. .csv (comma-separated values) is plain text and universally compatible with any spreadsheet application, database import tool, or script. Use .xlsx for Excel workflows and .csv for data pipelines or scripting.

What is the maximum PDF size I can convert?

There is no server-side size limit. Practical limits depend on your device's memory. PDFs up to 20 MB with under 200 pages process quickly on most modern devices. Very large PDFs may take longer as each page is processed sequentially.