synthetic users. •  live

Simulate real users.
Ship reliable agents.

Watch a synthetic driver call your voice agent in Spanish, switch to English mid-call, and trip an edge case before it ever reaches a customer.

VoiceTextHeadless
10k+

edges found this week

120+

languages

parallel drivers

drv_8c2a · bilingual_switcher🇪🇸 es-MX → en-USLive
Trace
Eval:judge · bilingual_switcher · 4.6/5⏵ auto-cycling — hover to pause
Works with your stack
  • CrewAI Logo
  • Fireworks AI Logo
  • Groq Logo
  • GitHub Logo
  • Google Cloud Logo
  • Hugging Face Logo
  • Cohere Logo
  • CircleCI Logo
  • Anthropic Logo
synthetic users

Drivers act like real users. Not like scripts.

A Driver is a synthetic user with personality, context, and a goal. They interact with your agent across long, multi-turn conversations and expose edges your QA scripts won't — at a scale your team can't match by hand.

Drivers · 12 active3 found edges
Frustrated returnerIt's been 31 days. ONE day past your policy. Are you serious?en-US·22 turns·edge
Confused seniorHello? Is anyone there? I think I pressed something.en-US·14 turns·passed
Power userSkip the intro. I need invoice history for Q3 in CSV.en-US·9 turns·passed
Bilingual switcherHola, my husband's on the line — can we switch to English?es→en·17 turns·edge
🇪🇸 es-MXscenario: bilingual_switcher · duration: 1:08Replaying
Audio
Transcript
CallerHola, llamo porque mi pedido no ha llegado. [traffic noise, intermittent]
AgentLo siento mucho. ¿Me puede dar el número de pedido?
CallerSure, it's — actually can we switch to English? My husband is on the line.
AgentOf course. Could you share the order number whenever you're ready?
Trace
stt.transcribe
router.classify
lang.detect
policy.lookup
order.fetch
tts.synthesize
vad.segment
switch.detect ✗
prompt.build
db.query
audio.encode
auth
llm.haiku
llm.sonnet
eval.judge
stream.flush
00:24
00:0000:2400:4801:1201:36
120+languages & localesnoise · crosstalk · clippingone timeline, three views
Voice

Voice agents, tested like real calls.

Synthetic callers in 120+ languages, with the noise and clipping and crosstalk your real customers actually call from. Audio, transcript, and trace land on the same timeline — so Conversation Designers and Engineers debug the same call, in the same view.

Exploration
Tests verify.

Drivers explore.

Traditional QA writes scripts and asserts on outcomes you already know to check. Drivers don't have a script — they have personality and a goal, and they wander. What you get back is a map of every behavior your agent actually has, including the ones nobody could have written down beforehand.

"In the first run we found 47 edges across our 12 happiest paths. Six of them were already shipping in production."

0edges discovered by drivers, not by your team
happy path
edge found by driver
startresolve
0edges discovered
happy pathedge found by driver
Eval in CI

Gate every PR on real conversations.

Run synthetic-user simulations on every commit. No more vibe testing. Catch regressions in tone, policy, tool use, or routing before they ship — with judge, symbolic, and audio-based evals on the same scorecard.

PR #284feat(voice): add bilingual switch handling
build / typecheck42s
okareo / sim · text agent (40 drivers)1m 48s
!okareo / sim · voice agent (12 drivers)2m 41s
okareo / eval · bilingual_switcherswitch.detect failed
okareo / eval · refund_pathjudge: pass · 4.6/5
eval gate: blocking1 of 14 scenarios regressed — merge blocked.
PRODUCTIONfailed call · 2.4kTEST LIBRARY+1 scenarioNEXT PReval gateDEVELOPMENTfix → shipclosedloop.
Closed Loop

Production failures become tests.

Every escaped failure in production becomes a synthetic scenario that runs on your next PR. Your test library gets stronger every week — without anyone hand-writing new tests. The agents that survive Okareo don't just ship; they get harder to break.

Your AI shouldn't be a black box.
Unlock visibility today.

What You Get

Why teams pick Okareo.

Six things Okareo does that the rest of the category doesn't. Or can't.

01 / Modality

Voice and text, one platform

Stop switching tools when modalities change.

02 / Coverage

Find edges, not just bugs

Synthetic users explore behaviors humans wouldn't think to try.

03 / Debug Surface

Trace and transcript, one view

Designers and engineers debug the same call.

04 / Workflow

Eval on every PR

Wire Okareo into CI. Gate releases on real conversation quality.

05 / Loop

Production feeds development

Failed real calls become tomorrow's test scenarios.

06 / Team

Built for cross-functional teams

Conversation Designers, Engineers, ML, and QA in one workspace.

What You Get

Why teams pick Okareo.

Six things Okareo does that the rest of the category doesn't. Or can't.

01 / Modality

Voice and text, one platform

Stop switching tools when modalities change.

02 / Coverage

Find edges, not just bugs

Synthetic users explore behaviors humans wouldn't think to try.

03 / Debug Surface

Trace and transcript, one view

Designers and engineers debug the same call.

04 / Workflow

Eval on every PR

Wire Okareo into CI. Gate releases on real conversation quality.

05 / Loop

Production feeds development

Failed real calls become tomorrow's test scenarios.

06 / Team

Built for cross-functional teams

Conversation Designers, Engineers, ML, and QA in one workspace.

What You Get

Why teams pick Okareo.

Six things Okareo does that the rest of the category doesn't. Or can't.

01 / Modality

Voice and text, one platform

Stop switching tools when modalities change.

02 / Coverage

Find edges, not just bugs

Synthetic users explore behaviors humans wouldn't think to try.

03 / Debug Surface

Trace and transcript, one view

Designers and engineers debug the same call.

04 / Workflow

Eval on every PR

Wire Okareo into CI. Gate releases on real conversation quality.

05 / Loop

Production feeds development

Failed real calls become tomorrow's test scenarios.

06 / Team

Built for cross-functional teams

Conversation Designers, Engineers, ML, and QA in one workspace.

What You Get

Why teams pick Okareo.

Six things Okareo does that the rest of the category doesn't. Or can't.

01 / Modality

Voice and text, one platform

Stop switching tools when modalities change.

02 / Coverage

Find edges, not just bugs

Synthetic users explore behaviors humans wouldn't think to try.

03 / Debug Surface

Trace and transcript, one view

Designers and engineers debug the same call.

04 / Workflow

Eval on every PR

Wire Okareo into CI. Gate releases on real conversation quality.

05 / Loop

Production feeds development

Failed real calls become tomorrow's test scenarios.

06 / Team

Built for cross-functional teams

Conversation Designers, Engineers, ML, and QA in one workspace.