Use AgentRouter to extract OCR text, layout, equations, and handwriting from scanned or visual documents through one routed document API.
3
Live Providers
12
Live Routes
10 credits ($0.01)
OCR Starts At
10 credits ($0.01)
Layout Extract
The live document catalog currently exposes 4 capabilities for AI agents. OCR Starts At currently starts at 10 credits ($0.01), and layout extract currently starts at 10 credits ($0.01).
Availability Note
All 12 document routes are live now across Mathpix, OpenAI, and Gemini.
What Is OCR & Document Understanding?
Agent document understanding is the operational layer that lets an AI agent recover text, layout, tables, equations, and handwriting from visual or scanned documents programmatically.
Document workflows often split across different providers. One route may be best for OCR, another for equations, another for layout, and another for handwriting recovery. That fragmentation forces the agent to carry provider-specific logic for tasks that should stay behind one contract.
AgentRouter keeps those jobs behind one routed document surface so the agent can recommend a route before execution and switch providers as cost or quality changes.
Top Scenarios
Recover readable text when the workflow starts with a scan, screenshot, or photographed document.
Gemini
MPP
10 credits ($0.01)
Mathpix
MPP
10 credits ($0.01)
OpenAI
MPP
10 credits ($0.01)
Receipts, invoices, and general scan ingestion
Coverage: 1 capabilities · 3 live routes
Start with OCR
Return structural blocks and tables when the agent needs more than plain OCR text.
Gemini
MPP
10 credits ($0.01)
Mathpix
MPP
10 credits ($0.01)
OpenAI
MPP
10 credits ($0.01)
Invoices, forms, and structured document parsing
Coverage: 1 capabilities · 3 live routes
Start with Layout Extract
Use the math-specific route when the document contains equations instead of general prose.
Gemini
MPP
10 credits ($0.01)
Mathpix
MPP
10 credits ($0.01)
OpenAI
MPP
10 credits ($0.01)
Math, research, and technical paper workflows
Coverage: 1 capabilities · 3 live routes
Start with Equation Parse
Recover handwritten content when generic OCR is not the right fit.
Gemini
MPP
10 credits ($0.01)
Mathpix
MPP
10 credits ($0.01)
OpenAI
MPP
10 credits ($0.01)
Notes, whiteboards, and handwritten forms
Coverage: 1 capabilities · 3 live routes
Start with Handwriting Parse
Provider Comparison
The rows below are derived from the live route catalog currently exposed through AgentRouter.
| Provider | Best For | Key Capabilities | Starting Price | Available Via |
|---|---|---|---|---|
G Gemini Covers OCR, Equation Parse, Layout Extract, Handwriting Parse through the live routed catalog. 4 live routes | OCR and Equation Parse | OCR, Equation Parse, Layout Extract, Handwriting Parse Modes: Managed | 10 credits ($0.01) | MPP |
M Mathpix Covers OCR, Equation Parse, Layout Extract, Handwriting Parse through the live routed catalog. 4 live routes | OCR and Equation Parse | OCR, Equation Parse, Layout Extract, Handwriting Parse Modes: Managed |
Quick Start
Get an API key, ask AgentRouter to recommend the best route for the job, and execute through one shared wallet instead of wiring each provider separately.
Step 1
Enable the API once
Create one AgentRouter API key that can be reused across all live domains and capabilities.
Step 2
Recommend the route
Let AgentRouter compare the current routes, prices, and route availability before the first execute call.
Step 3
Execute the capability
Run the selected workflow through the chosen route while AgentRouter handles pricing, wallet debits, and upstream settlement.
Works with
Pricing By Task
These examples translate the current live route prices into workflow questions operators usually evaluate before shipping a domain into an agent.
| Task | Route | Price | Notes |
|---|---|---|---|
| OCR one document image | Gemini (document.ocr.gemini.mpp) | 10 credits ($0.01) | Recover readable text from a scan or photo. |
| Extract layout and tables | Gemini (document.layout.extract.gemini.mpp) | 10 credits ($0.01) | Return higher-level structure instead of plain text only. |
| Parse one equation-heavy page | Gemini (document.equation.parse.gemini.mpp) | 10 credits ($0.01) | Best when mathematical notation matters. |
| Parse one handwritten note | Gemini (document.handwriting.parse.gemini.mpp) | 10 credits ($0.01) | Use when pen-written content is the primary input. |
Pricing note: 1000 credits = $1 USD. Raw API responses still return numeric credit fields such as creditsCharged.
Why AgentRouter
Recommendation happens before execution, so the agent can compare current route supply and pricing instead of hard-coding one provider forever. The same routed contract stays stable even as provider coverage or transport paths change underneath.
FAQ
It is the routed layer that lets an AI agent recover OCR text, page layout, equations, and handwriting from visual documents through one document domain.
Use OCR when you mainly need readable text. Use layout extract when the agent also needs tables, blocks, and document structure.
Use equation parse when the input contains mathematical notation and the goal is a machine-readable formula output rather than plain text recovery.
No. AgentRouter handles route recommendation and execution across the currently live document routes behind one wallet.
Reference
Use the capability browser below when you want contract-level detail: endpoints, live example modals, route breakdowns, and exact route prices.
| Capability | Endpoints | Price |
|---|---|---|
ocr Extract text from document images or scans. | GET /api/agentic-api/domains/document/capabilities/ocrRead the capability contract, current lifecycle status, and machine-readable metadata for this document workflow. | Free |
POST /api/agentic-api/domains/document/capabilities/ocr/recommendAsk AgentRouter to compare eligible routes inside this capability and return the best recommendation before execution. | Free | |
POST /api/agentic-api/domains/document/capabilities/ocr/executeExecute this capability through the selected route while AgentRouter handles pricing, wallet debits, and upstream settlement. | ||
equation.parse Extract mathematical notation and return LaTeX or comparable structured output. | GET /api/agentic-api/domains/document/capabilities/equation-parseRead the capability contract, current lifecycle status, and machine-readable metadata for this document workflow. | Free |
POST /api/agentic-api/domains/document/capabilities/equation-parse/recommendAsk AgentRouter to compare eligible routes inside this capability and return the best recommendation before execution. | Free | |
POST /api/agentic-api/domains/document/capabilities/equation-parse/executeExecute this capability through the selected route while AgentRouter handles pricing, wallet debits, and upstream settlement. | ||
layout.extract Extract higher-level document structure such as blocks, sections, tables, or fields. | GET /api/agentic-api/domains/document/capabilities/layout-extractRead the capability contract, current lifecycle status, and machine-readable metadata for this document workflow. | Free |
POST /api/agentic-api/domains/document/capabilities/layout-extract/recommendAsk AgentRouter to compare eligible routes inside this capability and return the best recommendation before execution. | Free | |
POST /api/agentic-api/domains/document/capabilities/layout-extract/executeExecute this capability through the selected route while AgentRouter handles pricing, wallet debits, and upstream settlement. | ||
handwriting.parse Extract text from handwriting or pen-like input. | GET /api/agentic-api/domains/document/capabilities/handwriting-parseRead the capability contract, current lifecycle status, and machine-readable metadata for this document workflow. | Free |
POST /api/agentic-api/domains/document/capabilities/handwriting-parse/recommendAsk AgentRouter to compare eligible routes inside this capability and return the best recommendation before execution. | Free | |
POST /api/agentic-api/domains/document/capabilities/handwriting-parse/executeExecute this capability through the selected route while AgentRouter handles pricing, wallet debits, and upstream settlement. |
Related Pages
AgentRouter landing page
Browse all live domains, top API cards, and route coverage across the platform.
Install and auth
Enable AgentRouter, create the API key, and connect the shared wallet flow.
Phone, SMS & Voice domain
Create managed phone agents, provision numbers, inspect conversations and calls, and keep telephony workflows behind one supplier-neutral phone domain.
Web Crawl & Extraction domain
Fetch, scrape, extract, crawl, map, browse, and screenshot public pages through one routed surface.
10 credits ($0.01)
| MPP |
O OpenAI Covers OCR, Equation Parse, Layout Extract, Handwriting Parse through the live routed catalog. 4 live routes | OCR and Equation Parse | OCR, Equation Parse, Layout Extract, Handwriting Parse Modes: Managed | 10 credits ($0.01) | MPP |