JPG to Word: 5 Proven Methods to Convert Images (2026)
You have a JPG — maybe a scanned contract, a photographed whiteboard, or a textbook page from your phone camera — and you need the text inside a Word document you can actually edit. The bridge between image and editable text is OCR (Optical Character Recognition), and the method you pick determines whether you get clean paragraphs or a wall of garbled characters.
Here are five methods ranked by practicality, from the zero-install option you can use right now to the developer-grade pipeline for batch processing hundreds of scans. Each has real tradeoffs in accuracy, formatting preservation, and cost.
Method Comparison Table
| Method | Accuracy | Preserves Formatting | Handles Tables | Cost | Best For |
|---|---|---|---|---|---|
| Google Docs OCR | Good | Partial (loses columns/tables) | No | Free | Quick one-off conversions, no install |
| Microsoft OneNote | Good | Minimal (plain text only) | No | Free (with Microsoft account) | Extracting text from photos already in OneNote |
| Adobe Acrobat Pro | Excellent | Yes (layout, tables, fonts) | Yes | $22.99/mo | Scanned documents where formatting matters |
| Tesseract 5.3.4 CLI | Excellent (tuned) | No (raw text output) | With extensions | Free, open source (Apache 2.0) | Batch processing, developer pipelines |
| Online OCR tools | Varies | Partial | Some | Free tiers available | No-install, occasional use |
If you just need raw text and do not care about layout, Google Docs is the fastest path. If the original document has tables, columns, or specific fonts you need to preserve, Adobe Acrobat Pro is the only option that reliably handles complex layouts. For developers processing scanned images at scale, Tesseract in a script is the right answer.
Try it yourself
Convert between any image format instantly — free, instant, no signup. Your images never leave your browser.
Method 1: Google Docs (Free, Zero Install)
Google Docs has built-in OCR that most people do not know about. Upload a JPG to Google Drive, open it with Google Docs, and Google extracts the text automatically. No extension, no third-party tool.
Steps
- Open Google Drive and upload your JPG file.
- Right-click the uploaded image and select Open with > Google Docs.
- Google processes the image. The original image appears at the top; the extracted text appears below it.
- Select all the extracted text, copy it, and paste into a new Google Doc or Microsoft Word file.
- Alternatively, download the Google Doc directly as
.docxvia File > Download > Microsoft Word (.docx).
What to Expect
Google Docs OCR handles clean, high-contrast printed text well. It struggles with multi-column layouts — columns get merged into a single text stream. Tables come through as plain text with no cell structure. Handwritten text is mostly ignored unless it is exceptionally neat block lettering.
For better results, make sure your JPG is at least 300 DPI. You can check your image's DPI before uploading, and change the DPI if it is too low. Higher resolution gives the OCR engine more pixel data per character, which directly improves recognition accuracy.
Method 2: Microsoft OneNote (Free with Microsoft Account)
OneNote has a hidden OCR feature buried in the right-click menu. It is not marketed as an OCR tool, but it works surprisingly well for extracting text from pasted images.
Steps
- Open OneNote (desktop app, version 16.x or later — the Windows 10 Store app also works).
- Paste or insert your JPG image into a note page.
- Wait 30-60 seconds for OneNote to index the image (it processes OCR in the background).
- Right-click the image and select Copy Text from Picture.
- Paste the text into Microsoft Word or any text editor.
Limitations
OneNote gives you plain text only — no formatting, no tables, no font matching. It is best when you need the words and do not care about layout. Also, the "Copy Text from Picture" option sometimes takes a minute to appear while OneNote processes the image silently. If you do not see it, wait and try again.
Method 3: Adobe Acrobat Pro (Best for Formatted Output)
Adobe Acrobat Pro is the gold standard for JPG-to-Word conversion when the original document has tables, columns, headers, footers, or mixed formatting. It is the only option in this list that reliably reconstructs document layout in the Word output.
Steps
- Open Adobe Acrobat Pro (version 2024 or later, or current Acrobat Pro subscription).
- Go to File > Create > PDF from File and select your JPG.
- Once the PDF is created, go to Tools > Export PDF.
- Choose Microsoft Word (.docx) as the export format.
- Click Export. Acrobat runs OCR and exports a Word document with preserved layout.
Why It Costs Money
Acrobat Pro uses proprietary OCR that handles complex layouts — multi-column pages, tables with merged cells, mixed font sizes, headers and footers. The Word output often looks remarkably close to the original image. That level of layout reconstruction is genuinely difficult, and no free tool matches it consistently.
If you are converting scanned contracts, invoices with line-item tables, or academic papers with citations and footnotes, this is the tool that saves you from manually reformatting everything. For simpler text extraction, it is overkill.
Method 4: Tesseract CLI (Free, Open Source, Batch-Ready)
Tesseract is the open-source OCR engine maintained by Google. It runs on every major operating system and processes images from the command line — which makes it the best option for batch converting hundreds of JPGs to text files that you then compile into Word documents.
Installation
macOS (Homebrew):
brew install tesseract
# Version: tesseract 5.3.4
Ubuntu/Debian:
sudo apt install tesseract-ocr
# Version: tesseract 5.3.4 (check with tesseract --version)
Windows (Chocolatey):
choco install tesseract --version=5.3.4
Basic Usage
# Convert a single JPG to text
tesseract scan.jpg output -l eng
# This creates output.txt with the extracted text
Batch Processing Multiple JPGs
# Convert all JPGs in a folder to text files
for img in *.jpg; do
tesseract "$img" "${img%.jpg}" -l eng
echo "Processed: $img"
done
# Combine all text files into one
cat *.txt > combined-output.txt
After generating the text files, open combined-output.txt in Word or LibreOffice Writer, apply formatting, and save as .docx. This workflow handles hundreds of scanned pages in minutes.
Improving Tesseract Accuracy
Tesseract accuracy depends heavily on image quality. Before running OCR, optimize your images — ensure they are clean, high-contrast, and properly oriented:
- Resolution: 300 DPI minimum. Below 200 DPI, character recognition degrades noticeably. Check DPI here.
- Binarization: Convert to grayscale or black-and-white before OCR. Tesseract has built-in binarization, but pre-processing noisy images improves results.
- Deskew: Rotated text tanks accuracy. Straighten scanned images before processing.
- Language packs: For non-English text, install the appropriate language pack (
tesseract-ocr-deufor German,tesseract-ocr-jpnfor Japanese, etc.).
# Higher accuracy with page segmentation mode
tesseract scan.jpg output -l eng --psm 6 --oem 1
# --psm 6: assume a single uniform block of text
# --oem 1: use LSTM neural network engine (best accuracy)
If you need to extract text from images regularly, Tesseract in a shell script is the most scalable approach. Wrap it in a cron job or CI pipeline and you have a fully automated document conversion system.
Method 5: Online OCR Tools (No Install, Instant)
Several web-based tools convert JPG to Word directly in the browser. These are useful when you cannot install software and need a quick conversion.
Recommended Tools
| Tool | Free Tier | Output Formats | Max File Size | Registration |
|---|---|---|---|---|
| OnlineOCR.net | 15 pages/hour | Word, Excel, Text | 15 MB | Not required |
| i2OCR | Unlimited (with ads) | Word, Text, PDF | 10 MB | Not required |
| Adobe Acrobat Online | 1 free conversion | Word | 100 MB | Adobe account |
Privacy Consideration
Online OCR tools upload your image to their servers for processing. For sensitive documents — contracts, medical records, financial statements — use an offline method like Tesseract or desktop Acrobat instead. If your content is not sensitive, online tools are the fastest zero-install option.
For non-sensitive images that just need format conversion before OCR, Pixotter's image converter processes everything client-side — your files never leave your browser.
Image Quality Tips for Better OCR Results
OCR accuracy is 80% determined before the software even runs. The quality of your input JPG is the single biggest variable.
The Basics
- 300 DPI minimum. The industry standard for OCR-ready scans. At 150 DPI, small characters (footnotes, subscripts) become unreadable to OCR engines. Learn how to check image DPI and how to change it if needed.
- High contrast. Black text on white background gives the best results. Low-contrast combinations (gray on cream, light blue on white) cause misreads.
- Straight alignment. Even a 2-3 degree rotation reduces accuracy. Deskew your scans before OCR.
- Clean background. Stains, shadows, and fold creases create noise that OCR engines misinterpret as characters.
- No compression artifacts. Heavily compressed JPGs blur character edges. If your source image is low quality, resize it to a higher resolution or re-scan from the original. You can also compress images at a quality level that preserves text sharpness — 85% quality is usually the sweet spot.
Handling Tables in JPG-to-Word Conversion
Tables are the hardest element for OCR. Most free tools extract table text but lose the cell structure — you end up with numbers and labels in a random order.
What works for tables:
- Adobe Acrobat Pro reconstructs table structure in about 80% of cases.
- Tesseract with the
--psm 6flag (uniform text block) handles simple grids. - For complex tables with merged cells or irregular layouts, manual cleanup in Word is usually faster than fighting OCR software.
What does not work:
- Google Docs OCR flattens tables into comma-separated text.
- OneNote ignores table structure entirely.
Handwriting Recognition
Handwriting OCR has improved dramatically, but it is still unreliable for anything beyond neat, consistent printing. Google Docs and Google Lens handle block handwriting reasonably well. Cursive remains a challenge for every tool — Adobe Acrobat Pro and dedicated handwriting OCR services (like Google Cloud Vision API) produce the best results, but expect 70-85% accuracy on cursive at best.
If you regularly convert handwritten notes, consider a digital pen or tablet that captures strokes directly — the recognition accuracy is fundamentally higher when the software sees the writing happen rather than trying to reconstruct it from a photo.
Converting JPGs to Word in Bulk
If you have dozens or hundreds of scanned pages to convert, the manual approach does not scale. Here is the practical batch workflow:
- Organize your JPGs. Name them sequentially (
page-001.jpg,page-002.jpg) so the final document is in order. - Pre-process for quality. Resize undersized images and ensure 300 DPI. Convert from other formats to clean JPG if needed.
- Run Tesseract in batch:
# Process all pages, maintaining order
for img in $(ls page-*.jpg | sort); do
tesseract "$img" "${img%.jpg}" -l eng --oem 1
done
cat page-*.txt > manuscript.txt
- Import into Word. Open
manuscript.txtin Word, apply styles and formatting, insert page breaks between logical sections. - Spot-check. OCR errors cluster around the same characters:
lvs1,Ovs0,rnvsm. Use Find & Replace for common misreads.
For an alternative approach to document conversion, you can also convert images to PDF first, then use Word's built-in PDF import to get an editable document — Word's PDF-to-docx conversion preserves more layout than raw OCR text.
Frequently Asked Questions
Can I convert a JPG to Word without OCR?
No. A JPG is a grid of pixels — it contains no text data. OCR is the process of recognizing characters in those pixels. Every JPG-to-Word conversion uses OCR, whether the tool calls it that or not. Google Docs, Acrobat, and online converters all run OCR behind the scenes.
Which free method gives the best accuracy?
Google Docs OCR for single images, Tesseract 5.3.4 for batch processing. Google Docs requires no setup and handles most printed text well. Tesseract gives you more control over processing parameters and scales to thousands of images.
Why does my OCR output have so many errors?
Almost always an image quality issue. Check three things: DPI (needs to be 300+), contrast (black text on white), and alignment (no rotation). A blurry or low-resolution JPG will produce garbage output regardless of which OCR tool you use. See the image quality tips section above.
Can OCR handle multiple languages in one image?
Yes, but you need to specify the languages. In Tesseract: tesseract image.jpg output -l eng+fra for English and French. Google Docs auto-detects languages in most cases. Accuracy drops when two scripts mix on the same line (e.g., Latin and Cyrillic characters interleaved).
How do I convert a JPG to Word on my phone?
The fastest mobile option is Google Lens (built into the Google app on Android, available via the Google app on iOS). Point it at text, tap "Copy all," and paste into a Word document. Microsoft Lens (formerly Office Lens) is another strong option — it captures, straightens, and OCRs documents directly into Word format.
Does converting JPG to Word preserve the original layout?
Only Adobe Acrobat Pro reliably preserves complex layouts (tables, columns, headers). Free tools extract text but lose formatting. If layout matters, budget for Acrobat Pro or plan to manually reformat in Word after extraction.
Can I convert a screenshot to a Word document?
Yes — screenshots are just images. The same methods apply. Screenshots tend to have clean text and high contrast, which means OCR accuracy is usually excellent. If you are working with screenshots specifically, you might want a PDF intermediate step before converting to Word.
What is the maximum image size for OCR?
Google Docs handles images up to 2 MB. Tesseract has no built-in limit (it processes whatever your system memory allows). Online tools vary — most cap at 10-15 MB. If your JPG is too large, compress it while keeping the resolution at 300 DPI or above. Reducing file size without dropping resolution preserves OCR accuracy.
Try it yourself
Ready to convert formats? Drop your image and get results in seconds — free, instant, no signup. Your images never leave your browser.