Gap-aware, adjudicable drafts: a first draft a reviewer can trust

The situation

The same pattern recurs wherever a structured document has to be drafted from a stack of source materials. Research protocols. Grant applications. Compliance filings. Underwriting memos. RFP responses. People spend hours pulling information out of source documents to populate template fields. The process is slow. It's error-prone. Different drafters interpret the same source material differently and end up with different documents for the same underlying facts. Whoever reviews those documents has to chase inconsistencies that came from the input stage, not the substance.

Our client, a clinical research alliance preparing human subjects research protocols for IRB review, wanted a first draft a researcher could start from, not a finished document a researcher had to fight with. The underlying pattern generalizes well past IRBs.

What we built

A template auto-population API. The caller uploads a set of context documents and selects a template schema. The system parses the documents, runs LLM extraction with constrained structured outputs, and returns a draft with each field populated from the source material. Every field comes back as one of three things:

A drafted value, with a confidence score and a citation back to the specific source passage that supported the extraction.
A flagged low-confidence guess that pulls the reviewer's eye.
An explicit gap marker that says, in effect, "the source material was insufficient to draft this field."

The extraction uses constrained decoding into a typed Pydantic schema, so every field conforms to a typed contract. The model can't invent a field shape. When information is missing, the schema forces an explicit gap rather than letting the model paper over the hole with fluent prose.

How it's defensible

Three things carry the defensibility load.

First, structured outputs with constrained decoding. Every extracted field is typed. Every field is either a draft with a confidence score and a source passage, or an explicit gap. There's no unstructured middle ground where the model hides an unsupported claim.

Second, source attribution on every field. When a reviewer reads a drafted "inclusion criteria" field, they see the specific page and paragraph in the source documents that the draft came from. Accepting the draft is a judgment call with all the evidence in front of them. Rejecting it is a one-click action, not an investigation.

Third, explicit gap surfacing. The system is designed so that "I don't have enough information to draft this field" is a normal output, not a failure. A reviewer would much rather see ten fields drafted and five marked as gaps than see fifteen fields drafted with uniform confidence, because the second case hides which fields they need to scrutinize.

The point of all three is that the human reviewer stays in charge. The system drafts. The human decides. A first draft a reviewer can trust beats a finished draft they can't.

What it replaced

Manual extraction and reformatting. A process that took a researcher hours per document and produced inconsistent outputs across drafters and projects.

What a similar engagement looks like

8 to 12 weeks to build a drafter for a new template library. We need the templates, a reference set of completed documents for schema design, and access to typical source document shapes. You get the deployed API, the template schemas, the extraction prompts tuned for the domain, and the confidence-scoring rubric.

It's a fit for IRBs, grant offices, compliance teams, regulatory affairs, underwriting, audit, due diligence, and any setting where a structured document has to be drafted from a set of source materials and a human reviewer needs to see exactly where every piece of the draft came from.

Gap-aware, adjudicable drafts: a first draft a reviewer can trust

The situation

What we built

How it's defensible

What it replaced

What a similar engagement looks like

Making the case inside your organization?

More Work

Other systems we've shipped

Deepfield: a modular assessment platform

DARPA program analysis sites: evidence you can audit

The Marshall archive: making a hidden corpus navigable

Initiate Contact

Ready to transform your
decision architecture?

Gap-aware, adjudicable drafts: a first draft a reviewer can trust

The situation

What we built

How it's defensible

What it replaced

What a similar engagement looks like

Making the case inside your organization?

More Work

Other systems we've shipped

Deepfield: a modular assessment platform

DARPA program analysis sites: evidence you can audit

The Marshall archive: making a hidden corpus navigable

Initiate Contact

Ready to transform your decision architecture?

Ready to transform your
decision architecture?