Six modules.
One coherent platform.
Each Vision OCR product is built to stand alone — and to compose seamlessly with the rest. Pick what you need today, expand when you're ready.
Document OCR Engine
Printed text, every angle, every quality.
Our flagship engine extracts text from scans, photos, faxes and PDFs at human-grade accuracy. Auto-deskew, denoise and orientation detection are built-in — no preprocessing required.
- 99.4% character-level accuracy
- 120+ languages, 14 scripts
- Tables, forms, key-value pairs
- Auto deskew, denoise, rotation
Handwriting Recognition
Cursive. Print. Scrawled. We read it.
A purpose-built HTR (Handwritten Text Recognition) model trained on 50M+ samples across English, Hindi, Devanagari, Arabic and CJK scripts. Cursive, mixed-script, and noisy inputs handled.
- English, Hindi, Marathi, Tamil, Bengali
- Cursive & printed mixed-script
- Confidence scores per word
- Custom domain fine-tuning
Multilingual & Translation
Borders shouldn't break your pipeline.
Detect script, language, and translate in a single API call. Built for global enterprises that move documents across regions every day.
- Auto language detection
- 120 source languages
- Inline translation pipeline
- Right-to-left script support
Developer APIs
Drop-in REST & gRPC. SDKs for every stack.
Battle-tested endpoints with predictable latency, idempotent semantics, and webhooks. SDKs for Python, Node.js, Go, Java and Ruby — with first-class TypeScript types.
- REST + gRPC endpoints
- Webhooks for async jobs
- SDKs: Python, Node, Go, Java
- 99.99% uptime SLA
Document Management Suite
From inbox to indexed in seconds.
Classify incoming documents, route them to the right workflow, and archive them with full-text search. Includes audit trails, redaction, and retention policies out of the box.
- Auto classification (50+ types)
- Workflow routing rules
- Full-text search & retention
- PII redaction & audit logs
Enterprise & Compliance
Run it where your data already lives.
Deploy on-prem, in your VPC, or in our private cloud. SOC 2 Type II certified. GDPR, HIPAA and DPDP-ready. Customer-managed encryption keys supported.
- On-prem & VPC deployment
- SOC 2 · GDPR · HIPAA · DPDP
- BYOK encryption
- Single-tenant available
Three lines of code.
Production-ready output.
No proprietary file formats. No bespoke SDK quirks. A single endpoint that returns clean, typed JSON — every time.
from visionocr import VisionOCR
client = VisionOCR(api_key="sk_live_…")
result = client.extract(
file="invoice.pdf",
schema="invoice",
languages=["en", "hi"],
)
# typed JSON, ready for your DB
print(result.fields)
# {
# "invoice_number": "INV-2025-04471",
# "total": 7839.00,
# "currency": "USD",
# "confidence": 0.987
# }