How to OCR a PDF: Free Tools and Where They Fall Short
You scan a contract, save it as a PDF, and a week later you go to search for a clause — and nothing comes up. The page looks like text, but to the computer it is just a picture of text. You can't select it, copy it, or search it. Turning that image back into real, machine-readable characters is exactly what OCR does, and "how do I OCR a PDF" is one of the most common things people type into a search bar for a reason: almost every team has a folder of scanned files they can see but can't actually use.
This guide covers the practical ways to OCR a PDF — the fastest free online tools, the option already sitting in your Google account, the desktop route, and a way to handle hundreds of files at once. Then it covers the part most "make your PDF searchable" pages skip: where free OCR quietly stops being enough, and what to reach for when it does.
What OCR actually does to a PDF
It helps to know what you're changing. A PDF can hold text in two very different ways. A text-based PDF (exported from Word, a browser, or most software) already contains real characters, so search and copy work out of the box. An image-only PDF — anything that came from a scanner, a phone photo, or a fax — is a flat picture. There are no characters underneath, which is why your search returns nothing.
Optical character recognition reads that picture, recognizes the letters and numbers by their shapes, and writes a hidden text layer behind the image. The result is usually called a searchable PDF: it looks identical, but now you can search, select, and copy the text, and screen readers can read it. Most tools can also export the recognized text out to Word or plain text if you need to edit it — that's the difference between making a PDF searchable and making it editable.
A quick way to tell which kind you have: open the PDF and try to select a sentence with your cursor. If nothing highlights, it's image-only and needs OCR.
Four ways to OCR a PDF
There is no single "best" method — the right one depends on how many files you have, how sensitive they are, and whether you need clean editable text or just search.
1. Free online OCR tools (fastest for one file)
For a single document, a browser tool is the quickest route. Adobe's free Acrobat online OCR, Smallpdf, iLovePDF, PDF24, and DeftPDF all follow the same three steps: upload the PDF, pick the document's language (this noticeably improves accuracy), and recognize the text. Most then let you choose the output — a searchable PDF, an editable Word document, or a plain text file — so the same tool covers "OCR PDF to Word" and "OCR PDF to text" alike, with no install and on any device.
The trade-offs are worth knowing. Free tiers cap file size and the number of files — often just one at a time — layout can shift on busy pages, and — most importantly — your file is uploaded to someone else's server to be processed. Reputable tools delete uploads after a set window (PDF24 after an hour, DeftPDF after three), but for anything confidential that round trip is still a real concern, which we'll come back to.
2. Google Docs (free, and you probably already have it)
If the file is already in Google Drive, right-click it and choose Open with → Google Docs. Drive runs OCR automatically and drops the recognized text below an image of each page. It's free, handles clean scans surprisingly well, and saves straight to Drive. The catch is formatting: tables and multi-column layouts usually come out as plain running text, so it's best for simple pages where you mainly need the words.
3. Adobe Acrobat Pro (desktop, layout-faithful)
If you have Acrobat Pro installed, open the scanned PDF, choose Scan & OCR → Recognize Text → In this file, and run it. The desktop version keeps the original layout more reliably than online tools and lets you edit text in place afterward. This is the steadier choice when the document's formatting matters or you'll be working inside the PDF.
4. Command line for bulk (OCRmyPDF)
When "a PDF" is really a folder of two hundred PDFs, clicking upload buttons stops making sense. Open-source tools like OCRmyPDF add a searchable text layer to scanned files from a single command and can be scripted to run across an entire directory on Windows, macOS, or Linux. It's the most technical option, but for recurring or high-volume work it's by far the least tedious.
Getting better OCR results
OCR quality depends more on the input than the tool. A few habits noticeably reduce errors:
Scan at 300 DPI or higher — small or faint text needs the extra resolution to be read cleanly. Keep the page flat and evenly lit, since shadows, glare, and curled book pages confuse character recognition. Fix the orientation and deskew crooked pages before you run OCR; a sideways or slanted page produces garbled output, and most tools have a deskew or "clean" option for exactly this. Set the document language — this is the highest-leverage setting most people skip. The major tools let you choose the language (and even combine several), and picking the right one sharply reduces errors, especially for accented or non-Latin characters. Finally, save as a searchable PDF — or as PDF/A if the file is headed for long-term archiving — because if you skip that step the page still looks like text on screen but search won't work, the single most common "my OCR didn't work" complaint.
Where free PDF OCR stops working
For a clean one-page scan, the free routes above are all you need. The trouble starts when the documents look like real business paperwork.
Four limits show up again and again. Complex layouts and tables — multi-column invoices, statements, forms with merged cells — often come out as scrambled running text, because basic OCR reads characters without understanding structure. Handwriting trips up most free tools entirely. Volume turns a two-minute task into a week when there are thousands of files. And the quiet one: a searchable PDF is still just text, not data. If what you actually need is the invoice number, the date, and the total pulled into your system, search alone doesn't get you there — someone still has to read the document and type the fields.
There's also a security line most teams cross without noticing. Free online OCR works by uploading your file to an external server. For a contract, a medical record, or a bank statement, that's a data-exposure question your compliance team would want to weigh in on.
This is the point where OCR turns into intelligent document processing: instead of producing a searchable picture, the system reads the document, understands its structure, and returns the specific fields as structured data. Modern document AI uses vision-language models to handle the messy layouts and handwriting that break basic OCR, processes large batches automatically, and — for sensitive files — can run on-premise so documents never leave your network. Korea Deep Learning's DEEP Agent is built for that step: it reads a scanned or photographed document in its original format, extracts the values you care about, and ties each one back to where it appeared on the page, so the output is data you can trust rather than text you still have to retype.
Conclusion
OCR a PDF and you turn a picture of text back into something you can search, copy, and use — and for a single clean scan, a free online tool, Google Docs, Acrobat, or a quick command line will all do the job. Match the method to the situation: one file, something already in Drive, layout-faithful editing, or bulk. The honest limit is that free OCR gives you searchable text, and most real work needs more than that — accurate reading of complex layouts and handwriting, hundreds of files at a time, structured data instead of a searchable image, and the ability to keep confidential documents private. When your PDFs cross that line, the question stops being "how do I OCR this" and becomes "how do I get reliable data out of it."
Free tools choke on the messy ones — complex tables, handwriting, hundreds of files. Send us the PDF that never works, and watch the data come back clean. See it on your own files → koreadeep.com.
Frequently asked questions
How do I OCR a PDF for free? Upload it to a free online OCR tool — Adobe Acrobat online, Smallpdf, iLovePDF, Foxit, or PDF24 — and download the searchable PDF, or open the file in Google Docs from Google Drive, which runs OCR automatically. Free tiers limit file size and daily usage, and they upload your file to an external server, so avoid them for confidential documents.
How do I make a scanned PDF searchable? Run OCR on it and save the result as a searchable PDF. OCR adds a hidden text layer behind the scanned image so search, copy, and screen readers work, while the page still looks the same. If search still fails afterward, the file was likely saved without that text layer — run OCR again and confirm you export as a searchable PDF.
Why doesn't search work after I OCR a PDF? Almost always because the file is still image-only — either OCR wasn't fully applied or it wasn't saved as a searchable PDF. A page can look like text on screen while being just a picture underneath. Re-run OCR and check that the text is selectable with your cursor before relying on search.
Can OCR read tables and handwriting in a PDF? Basic free OCR handles neat tables and printed text reasonably well but often scrambles complex or merged-cell tables and struggles with handwriting. Reading those reliably — and turning them into structured data rather than loose text — is what vision-language-based document AI is built for.
How do I OCR many PDFs at once? For bulk work, use a tool that supports batch processing or a scriptable command-line tool like OCRmyPDF that can run across a whole folder. Keep the scan resolution high and avoid heavy compression. For ongoing high volumes where you need extracted fields rather than just searchable files, a document AI pipeline is more practical than repeated manual uploads.
Is it safe to OCR confidential PDFs with free online tools? Free online OCR uploads your file to a third-party server to process it, which is a data-exposure risk for contracts, medical records, or financial documents. Reputable tools delete uploads after a short window, but for sensitive files use a local desktop tool or an on-premise document AI system that processes everything inside your own network.
How do I OCR a PDF to Word or plain text? Run OCR with a tool that offers export options and choose Word (.docx) or text (.txt) as the output instead of a searchable PDF. Adobe Acrobat, Smallpdf, and DeftPDF all export to editable formats. Expect to clean up formatting afterward — exporting to Word recovers the words reliably but rarely reproduces complex layouts perfectly.
Can I edit a PDF after I OCR it? Yes. Once OCR has added a real text layer, you can edit the text directly in a PDF editor, or export to Word, make your changes, and save back to PDF. Note that editing the recognized text is only as reliable as the recognition — if OCR misread a character, you'll be correcting it by hand, which is one more reason accuracy on the first pass matters.