OCR API

An OCR API that returns data you can verify

One REST call returns structured JSON where every field carries a bounding box and a match score. Bearer auth, built-in templates, async jobs, signed webhooks.

Most OCR APIs hand you a wall of text and a confidence number for the whole page. You still have to find the invoice total, parse it, and hope it landed in the right place. The OCR API in space-ocr does the structuring for you: one POST with an image and a template, and you get back typed fields as JSON.

The part that matters for production is what rides along with each value. Every field comes back with the exact box on the page it was read from, the four corners of that box, and a match score. So your pipeline doesn't have to trust a model's word — it can check each value against where it actually sits on the document.

A real response you can inspect

Hover any field below — the box on the invoice is where that value was read. This is a real parsed result: the billing name ソジュハンザン海物語様, the amount due ¥84,263, the total ¥46,752, each line item, all returned with their own box and a match score. Nothing here is mocked.

Verified fields

Invoice

Each value with a box carries a verified on-page location — bbox + 4-point vertices + match_ratio — on a 0–1000 normalized grid (0,0 top-left → 1000,1000 bottom-right), the same shape the live API returns. Hover a field to trace it back to the pixels it came from.

One call, JSON with boxes

POST /ocr/fields with one image and get typed fields back. Each value carries its bbox, so you skip the second pass of finding where things are.

bbox, vertices, match_ratio

Every field returns xmin/ymin/xmax/ymax on a 0–1000 grid, four oriented vertices that follow the page tilt, and a match_ratio you can threshold on.

Built-in templates

Pass a templateId — receipt, invoice, delivery, business_card, driver_license, and more — or send your own fields, including an array field for line items.

Async jobs + signed webhooks

POST /upload to queue images, get a job per file, and receive an HMAC-SHA256 signed webhook on completion — or poll GET /jobs/{jobId}.

CSV and JSON exports

JSON over REST, plus CSV with a UTF-8 BOM (Excel- and CJK-safe) where line items unfold into sub-rows for a stored sheet.

Languages on autopilot

Japanese, Korean, Chinese, and English in one engine — no language hint to set, mixed scripts and full-width characters handled.

How the OCR API works in space-ocr

Authenticate with a Bearer token — your key is prefixed spocr_ — against the base URL https://api.space-ocr.com. Send one raster image to POST /ocr/fields as a URL or base64 (the public API takes images — JPEG, PNG, GIF, BMP, TIFF, WebP — so for a PDF you send page images). Pass a built-in templateId or your own fields, and you get back { status: 'success', data: {...} } with a value, bbox, vertices, and match_ratio per field.

The coordinates aren't invented by the model. The LLM returns each value plus the word-token ids it used; a character matcher then aligns that value against the symbols Google Vision actually detected on the page and scores the coverage as the match_ratio. A score of 0.85 or higher is a confident match, and 1.0 means every character was located. Every response also carries an X-Request-Id header, and errors come back as { error: { code, message, requestId } }.

extract fields from an image

curl -s https://api.space-ocr.com/ocr/fields \
  -H "Authorization: Bearer $SPACE_OCR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "image": "https://example.com/invoice.png",
    "imageType": "url",
    "templateId": "invoice"
  }'

the same call in Python

import os, requests

resp = requests.post(
    "https://api.space-ocr.com/ocr/fields",
    headers={"Authorization": f"Bearer {os.environ['SPACE_OCR_API_KEY']}"},
    json={
        "image": "https://example.com/invoice.png",
        "imageType": "url",
        "templateId": "invoice",
    },
    timeout=60,
)
resp.raise_for_status()
for name, field in resp.json()["data"].items():
    print(name, field["value"], field["bbox"], field["match_ratio"])

How to call the OCR API

Get an API key
Sign in and create a key — it is prefixed spocr_. Send it as Authorization: Bearer <key> on every request to https://api.space-ocr.com.
Send an image
POST /ocr/fields with image (a URL or pure base64) and imageType. For a PDF, send the page images — the API takes raster formats (JPEG, PNG, GIF, BMP, TIFF, WebP).
Pick a template or fields
Pass a built-in templateId like 'invoice' or 'receipt', or supply your own fields — including an array field with children for line-item tables.
Read the structured result
You get { status: 'success', data: {...} } where each value carries its bbox, vertices, match_ratio, and bbox_source. Threshold on match_ratio to flag anything below 0.85.
Scale out and query
Queue many images with POST /upload (job per file, signed webhooks or GET /jobs/{jobId}), then read a stored sheet with GET /view using where, sort, and select — no re-OCR, no extra charge.

Simple, predictable pricing

Pay $0.05 per image (¥10 / ₩100), with a free tier of 100 scans a month and no credit card. Reading a stored sheet back with GET /view doesn't re-OCR and isn't charged. Flat plans add monthly scans, more sheets, and storage.

Free

100 scans / month
3 sheets
1 GB storage

Free — no card

Starter

$19/mo

400 scans / month
10 sheets
10 GB storage

Start free

Ship OCR that returns checkable data

Free tier — 100 scans a month, no credit card. Every field comes back with its box and a match score.

Get an API key API docs

API for Extracting Data From Invoices: A Developer Guide

OCR API with Bounding Boxes: Verify Every Value (2026)

OCR API with Source Coordinates: Verify Every Value (2026)