Question 1

Why ground on a captured AI surface instead of a search API?

Accepted Answer

A search API returns links and snippets; your pipeline still has to read, rank and synthesize them. A captured Perplexity or ChatGPT answer has already done live retrieval and synthesis on the real surface, and arrives with its citations attached — one hop to grounded, source-backed context. Many pipelines use both: google_search for breadth, an answer surface for synthesis.

Question 2

Is sync mode fast enough for interactive use?

Accepted Answer

It runs the same real-browser acquisition inline, so it is bounded by genuine surface latency — captures take longer than an index lookup, and we will not quote a fake number. The honest engineering answer: use sync for the interactive path where the grounding is the product, cache aggressively, and push everything else through async + webhooks.

Question 3

How do I avoid grounding on a hallucinated empty?

Accepted Answer

That failure mode is designed out: an absent answer is an explicit surfacePresent:false with a surface_absent warning (and costs nothing), and a failed lane is an explicit failed status with an acquisition_failed reason. Your pipeline branches on contract fields instead of guessing whether an empty string means "no answer" or "broken scraper".

Question 4

Can I use this to build evaluation datasets instead of live grounding?

Accepted Answer

Yes — the same captures work offline: batch-submit query sets, store the Envelopes, and you have dated, source-attributed, provenance-stamped answer data for evals and fine-tuning corpora. The durable raw payloads (job.artifacts.rawKey) preserve the verbatim upstream for reproducibility.

Live search grounding for RAG, from real AI surfaces.

The workflow, end to end.

Query synchronously at request time

Take the flat view when you only need text + sources

Feed answer + sources into your generation step

Handle absence and failure explicitly

The Envelope fields that do the work.

Honest limits

Asked precisely.

Why ground on a captured AI surface instead of a search API?

Is sync mode fast enough for interactive use?

How do I avoid grounding on a hallucinated empty?

Can I use this to build evaluation datasets instead of live grounding?

Build it on the capture layer.