@radzor/document-ocr
Extract text and structured data from PDFs, images, and scanned documents. Supports Tesseract (local), Google Vision, and Azure Document Intelligence. Handles multi-page PDFs, tables, and form fields.
Install
View source on GitHub →Inputs
| Name | Type | Default | Description |
|---|---|---|---|
| provider | 'tesseract' | 'google-vision' | 'azure' | tesseract | OCR provider. tesseract runs locally; google-vision and azure require API credentials. |
| apiKey | string | — | API key for Google Vision. Not required for tesseract.GOOGLE_API_KEY |
| azureEndpoint | string | — | Azure Document Intelligence endpoint URL. |
| azureKey | string | — | Azure Document Intelligence API key. |
| language | string | eng | Language hint for OCR (ISO 639-3 for Tesseract, BCP-47 for cloud providers). |