Question 1

How does the PDF to Excel conversion work?

Accepted Answer

The tool uses PDF.js to extract the text layer from your PDF file, then analyses the x/y coordinates of each text item to detect column boundaries. Items that share consistent x-positions are clustered into columns, and each detected line becomes a row in the spreadsheet. The output is a real .xlsx file built using the SpreadsheetML open standard — no third-party library required.

Question 2

Which extraction mode should I use?

Accepted Answer

Use "Table detection" for PDFs that contain structured tabular data — invoices, financial reports, statements, price lists, or any document where text is arranged in columns. This mode clusters x-positions to assign cells to columns automatically. Use "Line-by-line" for simpler documents like lists, bullet-point reports, or single-column text where you just want each line as a row.

Question 3

Why does the output look different from the original PDF?

Accepted Answer

PDF stores content as positioned elements rather than structured data. The converter reconstructs structure from positions, which is approximate. Complex layouts — merged cells, rotated text, cells that span multiple columns, or inconsistent column alignment — may not map perfectly. A review and light cleanup in Excel is normal for complex documents.

Question 4

Can I convert a scanned PDF to Excel?

Accepted Answer

No. Scanned PDFs are images of pages with no text layer, so there is no data to extract. To convert a scanned document, first use the OCR tool to extract the text, then copy the output into Excel manually or paste it into a new PDF before re-converting.

Question 5

Is my PDF uploaded to a server?

Accepted Answer

No. All processing uses PDF.js running in your browser. Your file is never sent anywhere — the conversion happens entirely on your device and works without an internet connection after the page has loaded.

Question 6

What is the difference between .xlsx and .csv output?

Accepted Answer

.xlsx is the native Excel format — it supports multiple sheets, cell types, number formatting, and is directly openable in Microsoft Excel, Google Sheets, and LibreOffice Calc. .csv (comma-separated values) is plain text and universally compatible with any spreadsheet application, database import tool, or script. Use .xlsx for Excel workflows and .csv for data pipelines or scripting.

Question 7

What is the maximum PDF size I can convert?

Accepted Answer

There is no server-side size limit. Practical limits depend on your device's memory. PDFs up to 20 MB with under 200 pages process quickly on most modern devices. Very large PDFs may take longer as each page is processed sequentially.

Document type	Mode	Expected quality
Bank statement (digital PDF)	Table detection	Excellent — consistent column alignment
Invoice from accounting software	Table detection	Excellent — structured line items
Excel report exported to PDF	Table detection	Excellent — original column structure preserved
Price list or product catalogue	Table detection	Good — minor cleanup may be needed
Simple text report or list	Line-by-line	Good — each line maps to a row
Multi-column newsletter layout	Table detection	Moderate — columns may interleave
Scanned document (image PDF)	Either	Not supported — use OCR tool first

PDF to Excel Converter Free

How to Convert PDF to Excel

What Types of PDFs Convert Well

Conversion Quality by Document Type

Tips for Best Results

Try Both Modes

Check for Text Layer First

Unlock Password PDFs First

Use CSV for Data Pipelines

Clean Up in Excel

Large PDFs

Frequently Asked Questions

Related Tools