Loading...

Please wait...

API Access

Integrate Roysa's traceable multimodal AI — documents, images, audio, and video — into your applications

API Credits

Balance: -- credits

¢8.5 /credit
$
≈ 118 pages

Minimum purchase: $10

Your API Key

Beta

Loading your API key...

Quick Start

Works with documents, images, audio & video

Endpoint: POST /extract Input: PDF, Images (JPG, PNG, WEBP) Output: JSON — fields, confidence, bounding boxes Cost: 1 credit / page
Extract structured fields from a document or image
curl -X POST https://api.roysa.net/extract \
  -H "X-API-Key: YOUR_API_KEY" \
  -F "file=@invoice.pdf" \
  -F 'fields=[{"name":"Vendor"},{"name":"Invoice Number"},{"name":"Total Amount"},{"name":"Due Date"}]'

Endpoint Reference

Task Endpoint Accepts Returns Cost
Extract POST /extract PDF, Images JSON — fields, confidence, bboxes 1 credit / page
Ask / Q&A POST /document-ask PDF, Images, transcript_text JSON — answer, references, session_id 1 credit / request
Geo POST /document-ask PDF, Images, transcript_text JSON — locations with bboxes 1 credit / request
Redact POST /document-ask PDF, Images, transcript_text JSON — sensitive items with bboxes 1 credit / request
Review POST /document-ask PDF, Images, transcript_text JSON — score, findings, recommendations 1 credit / request
Transcribe POST /transcribe Audio, Video PDF — speaker-labeled transcript 1 credit / request
Translate POST /translate-document PDF, Images PDF — translated document 1 credit / request
Process Media POST /process-media Audio, Video JSON — transcript_segments, video_intelligence 1 credit / request
Extract from Media POST /extract-media transcript_segments JSON body JSON — fields, confidence, timestamps 1 credit / request

Authentication: Pass your key as X-API-Key: rk_... or Authorization: Bearer rk_....

Audio/Video with Ask / Geo / Redact / Review: Call POST /process-media first to get transcript_segments, join the text, then send it as transcript_text to POST /document-ask.

Sessions: The session_id in every /document-ask response can be reused for follow-up questions on the same document — no re-upload and 1 credit per question.

Supported audio: MP3, WAV, FLAC, OGG, M4A, AAC, WMA, OPUS  ·  Video: MP4, MOV, AVI, MKV, WEBM, WMV, FLV, M4V

Error codes: 402 Insufficient credits  ·  400 Bad request  ·  500 Processing error

Rate Limits

100
Requests per minute
1M
Requests per day
50 MB
Max file size

Need higher limits? Contact us for custom plans.