Question 1

What is execution integrity for healthcare AI?

Accepted Answer

Execution integrity verifies that what a healthcare AI agent writes to clinical systems (EHR notes, prescriptions, referrals) faithfully reflects the actual patient conversation. It goes beyond checking what the agent said to verify what it actually wrote.

Question 2

How does clinical artifact verification work?

Accepted Answer

Lithrim ingests the conversation transcript and the artifact the AI agent produced (SOAP note, referral letter, etc.), then runs faithfulness, completeness, and safety checks. Each artifact receives a verdict (PASS, WARN, or BLOCK) based on alignment with the source conversation.

Question 3

What safety flags does Lithrim detect?

Accepted Answer

Lithrim detects critical clinical safety issues including: WRONG_DOSAGE (medication dosage doesn’t match transcript), FABRICATED_HISTORY (medical history not mentioned in conversation), MISSED_ALLERGY (allergy mentioned but omitted from note), WRONG_LATERALITY (left/right body side errors), WRONG_CODE (incorrect billing or diagnostic codes), and MISSED_ESCALATION (urgent findings not flagged for follow-up).

Question 4

Can Lithrim enforce my hospital's specific FHIR profile or custom validation rules?

Accepted Answer

Yes. The Structural validator runs against an artifact profile per organization, which can bind to FHIR US Core, HL7v2 segment definitions, ICD-10-CM, or a custom JSON schema you provide. Custom profiles are onboarded with the Lithrim team during the design partner phase and live alongside our standard healthcare profiles. The combined verdict is the worst of semantic (council) and structural (validator), so a schema violation blocks regardless of LLM judge consensus.

Question 5

Is Lithrim open source?

Accepted Answer

The core is. The evaluation harness under Lithrim (the judge council, the deterministic verification floor, and the audit trail) ships as an Apache-2.0 Community Edition you can run self-hosted with your own model keys: github.com/lithrim-dev/lithrim. The commercial platform adds curated clinical packs, custom artifact profiles, calibration, and support. The research behind the verification floor is published with a DOI and is reproducible from the public repository.

Question 6

Who is Lithrim built for?

Accepted Answer

Lithrim is built for companies deploying AI agents in healthcare: clinical scribe companies, telehealth platforms, health plans, and any organization whose AI agents write to electronic health records. It provides the verification layer between the AI agent and the patient record.

Question 7

What errors can AI scribes make in clinical notes?

Accepted Answer

AI scribes can introduce several categories of errors into clinical notes: wrong medication dosages, fabricated medical history not mentioned by the patient, omitted allergies, left/right body side errors (wrong laterality), incorrect billing or diagnostic codes, and missed urgent findings that need escalation. A 2024 study found 127 errors across 44 AI-generated clinical notes.

Question 8

How do you verify what an AI agent writes to the EHR?

Accepted Answer

Clinical artifact verification compares the AI-generated document (SOAP note, referral, prescription) against the source patient conversation, checking three dimensions: faithfulness (does every claim have transcript support?), completeness (are all clinically relevant details captured?), and safety (are there dangerous errors?). Each artifact receives a PASS, WARN, or BLOCK verdict.

Question 9

What is the difference between AI evaluation and artifact verification?

Accepted Answer

Traditional AI evaluation measures whether the agent said the right thing: task completion rate, response accuracy, latency. Artifact verification measures whether the agent wrote the right thing, comparing the clinical document it produced against the actual patient conversation. An agent can score 100% on conversation quality while writing a clinical note with fabricated medical history.

Question 10

What is artifact lineage in healthcare AI?

Accepted Answer

Artifact lineage is the end-to-end traceability chain from patient conversation to clinical document to EHR write. It tracks: (1) the raw transcript, (2) HIPAA compliance check results, (3) per-artifact faithfulness, completeness, and safety scores, (4) safety flags like WRONG_DOSAGE or FABRICATED_HISTORY, and (5) the final verdict that gates whether the artifact can be written to the patient record.

Question 11

How does Lithrim work with AI scribe companies?

Accepted Answer

Lithrim integrates via API or webhook. Scribe companies send the conversation transcript and the AI-generated clinical note to Lithrim’s /v1/analyze endpoint. Lithrim returns a verdict (PASS/WARN/BLOCK), safety flags, faithfulness score, and evidence spans showing exactly where the note diverges from the conversation. This runs before the note is written to the EHR, acting as a release gate.

Question 12

What is a BLOCK verdict in clinical artifact verification?

Accepted Answer

A BLOCK verdict means the AI-generated clinical artifact has a faithfulness score below 70%, indicating critical divergence from the source conversation. BLOCK artifacts should not be written to the EHR without human review and correction. Common causes include fabricated medical history, wrong medication dosages, and missed allergies.

Metric	Current	Target	Delta
Identity Verification	73.40%	99.00%	-25.6
PHI Boundary	100.00%	99.50%	+0.5
Escalation	95.57%	97.00%	-1.4
Scope Safety	100.00%	99.50%	+0.5

Your AI Agent Said the Right Thing. Then It Wrote the Wrong Dosage.

Release Gate

See What Your AI Agent Actually Writes

From pre-launch eval to per-finding evidence chain. The full stack.

The core is open source.

What you're not catching today, in dollars.

The study behind the floor.

Common questions about artifact verification