Now in production - API keys available

Voice in.
Intelligence out.

Transcription, diarization, PCI compliance detection, financial extraction, and sentiment analysis - one API call, $0.49 per audio hour.

97+
Languages supported
$0.49
Per audio hour
216×
Faster than realtime
12s
For a 46-min call
Everything included

One API call.
Full intelligence.

Upload audio, get a structured JSON response with transcription, diarization, compliance flags, financial data, sentiment, and action items - all for $0.49/hr.

AI Diarization

Automatically identifies Agent vs. Customer and labels every line. Handles cross-talk, hold music, and accented speech.

included

PCI Compliance

Detects credit card numbers, CVVs, and expiration dates spoken during calls. Flags sensitive data for automatic redaction or audit.

included

Financial Extraction

Pulls payment amounts, recurring charges, pending balances, card types, and billing dates directly from conversation context.

included

Sentiment Analysis

Customer sentiment (positive/negative/neutral) and agent performance scoring on every call. Track quality at scale.

included

Word-Level Timestamps

Every word gets a precise start/end timestamp. Power keyword search, compliance audits, and QA review with millisecond accuracy.

included

Structured JSON Output

Every response is a structured JSON object - call summary, type, outcome, customer info, agent info, agreements, action items, and key issues.

included
How it works

Three steps.
No infrastructure required.

Upload your audio

Send any audio file - WAV, MP3, OGG, FLAC. We auto-detect language. Files up to 4 hours, up to 25 MB. Stereo or mono.

Get everything back - instantly

In one synchronous response: diarized transcript, word timestamps, call summary, financial data, compliance flags, sentiment, action items. A 46-minute call returns in ~12 seconds.

Build on top

Pipe the structured JSON into your CRM, dashboard, coaching tool, or compliance system. Every field is machine-readable and ready to store.

Developer experience

One API call. Full intelligence.

Standard REST. Upload a file, get structured JSON. No SDKs, no webhooks, no polling. Works with cURL, Python, Node, anything.

python
import requests

response = requests.post(
    "https://api.voxparse.com/v1/transcribe",
    headers={"X-API-Key": "vxp_..."},
    files={"file": open("call_recording.mp3", "rb")},
)

data = response.json()
ai = data["ai_analysis"]

print(f"Summary: {ai['call_summary']}")
print(f"Customer: {ai['customer']['name']}")
print(f"Sentiment: {ai['sentiment']['customer_sentiment']}")
print(f"PCI flags: {ai['compliance']['sensitive_data_shared']}")
print(f"Payment today: {ai['financial']['payment_today']}")
Simple pricing

One plan. Everything included.
No subscriptions.

Prepay any amount. Usage is deducted at $0.49/hr. Every feature - diarization, compliance, financial extraction, sentiment - included in every call.

$0.49 / audio hour
  • 97+ languages, 216x real-time speed
  • AI speaker diarization (Agent / Customer)
  • Word-level timestamps on every word
  • PCI compliance detection (cards, CVV, expiry)
  • Financial extraction (payments, balances, billing)
  • Sentiment analysis & agent performance scoring
  • Call summary, type, outcome classification
  • Action items & agreements extraction
  • Custom AI instructions (up to 2,000 chars)
  • Structured JSON output - ready for your CRM

No subscriptions. No tiers. No feature gates. Everything above is included in every single API call.

Estimate your cost
$
Transcription + Full AI
111
audio hours
$0.49 / hr - everything included

Balances valid for 6 months. $200+ top-ups receive bonus balance.

How we compare

Provider Base Price All Features AI Analysis PCI Compliance Custom Instructions Speed (46-min call)
VoxParse $0.49/hr $0.49/hr Included Included Included ~12 seconds
AssemblyAI $0.21/hr $0.51+/hr* +$0.28/hr add-ons Extra (PII Redaction) LeMUR (token cost) ~30 seconds
Deepgram $0.46/hr $0.60+/hr Extra cost Not available Not available ~15 seconds
Google Cloud STT $0.96/hr $0.96/hr Not available Not available Not available ~60 seconds
AWS Transcribe $1.44/hr $1.60+/hr Extra cost Extra cost Not available ~120 seconds
Head-to-head

VoxParse vs AssemblyAI

Same 46-minute customer service call. Same day. Real results.

VoxParse Pro Winner
Processing time 12.1 seconds
Total cost (all features) $0.49/hr
Output format Structured JSON
Speaker diarization ✓ AI-powered
Name accuracy ✓ "Jesús" (accent preserved)
Email correction ✓ Auto-fixed
PCI masking ✓ Included
Sentiment analysis ✓ Included
Financial extraction ✓ Included
Custom instructions ✓ Included
AssemblyAI Universal-3 Pro
Processing time ~30 seconds
Total cost (all features) $0.51+/hr
Output format Raw text
Speaker diarization ✓ Built-in
Name accuracy ⚠ "Jus" (truncated)
Email correction ✗ Not available
PCI masking $ PII Redaction add-on
Sentiment analysis $ +$0.02/hr
Financial extraction ✗ Not available
Custom instructions $ LeMUR (token cost)
Processing Speed (46-min call)
VoxParse
12s
AssemblyAI
~30s
Cost per 1,000 Audio Hours (all features)
VoxParse
$450
AssemblyAI
$510+

Benchmark conducted April 2026 on a 46-minute English-language customer service recording. Both providers tested with the same audio file within the same hour.

Real output

Here's what you get back.

Actual response from a 46-minute customer service call. Processed in 12 seconds.

json - ai_analysis (excerpt)
{
  "call_summary": "Customer called about a billing discrepancy on March invoice. Agent issued a $75 credit and adjusted recurring rate to $149.99/mo.",
  "call_type": "billing",
  "call_outcome": "resolved",
  "customer": { "name": "James Rivera", "company": "Greenfield Dental Group", "email": "jrivera@greenfielddental.com" },
  "financial": {
    "credit_issued": "$75.00",
    "recurring_amount": "$149.99",
    "pending_balance": "$0.00",
    "payment_method": "Visa ending in 8831"
  },
  "compliance": {
    "recording_disclosure": true,
    "sensitive_data_shared": ["Credit card 4532 **** **** 8831", "CVV ***"]
  },
  "sentiment": { "customer_sentiment": "neutral", "agent_performance": "excellent" }
}

Start processing audio today.

Start with just $10. No commitments, no subscriptions. Get your API key in under a minute.

Get your API key