mirror of
https://github.com/ivuorinen/cheatsheet-tldr.git
synced 2026-01-26 11:33:59 +00:00
34 lines
1.1 KiB
Plaintext
34 lines
1.1 KiB
Plaintext
---
|
|
syntax: markdown
|
|
tags: [tldr, common]
|
|
source: https://github.com/tldr-pages/tldr.git
|
|
---
|
|
# ocrmypdf
|
|
|
|
> Generate a searchable PDF or PDF/A from a scanned PDF or an image of text.
|
|
> More information: <https://ocrmypdf.readthedocs.io/en/latest/cookbook.html>.
|
|
|
|
- Create a new searchable PDF/A file from a scanned PDF or image file:
|
|
|
|
`ocrmypdf {{path/to/input}} {{path/to/output.pdf}}`
|
|
|
|
- Skip pages of a mixed-format input PDF file that already contain text:
|
|
|
|
`ocrmypdf --skip-text {{path/to/input.pdf}} {{path/to/output.pdf}}`
|
|
|
|
- Clean, de-skew, and rotate pages of a poor scan:
|
|
|
|
`ocrmypdf --clean --deskew --rotate-pages {{path/to/input.pdf}} {{path/to/output.pdf}}`
|
|
|
|
- Perform lossy optimization on a PDF without performing any OCR:
|
|
|
|
`ocrmypdf --tesseract-timeout 0 --optimize 2 --skip-text {{path/to/input.pdf}} {{path/to/output.pdf}}`
|
|
|
|
- Set the metadata of a searchable PDF file:
|
|
|
|
`ocrmypdf --title "{{title}}" --author "{{author}}" --subject "{{subject}}" --keywords "{{keyword; key phrase; ...}}" {{path/to/input.pdf}} {{path/to/output.pdf}}`
|
|
|
|
- Display help:
|
|
|
|
`ocrmypdf --help`
|