The Complete Guide to AI Medical Scribes (2026): How They Work & Which to Choose

How AI medical scribes work, what to evaluate, and what clinicians actually notice first.

Medically Reviewed by Donald Lazure

Written by the Commure Scribe Team

Published: March 11, 2026

8 min read

Try the #1 AI Scribe for Free

No Credit Card Required

Join 20,000+ Clinicians

Try for Free

TABLE OF CONTENTS

Try the #1 AI Scribe.

99.4% accuracy. 43-second charts. $59/month.

Try Commure Scribe for Free

What You Need to Know

  • A 2025 JAMA study found ambient AI scribes reduced physician burnout by 21% across two major health systems.4 
  • The most common finding from early adopters: the AI captures clinical context they would have missed or compressed under time pressure.

You finish your last patient at 5:15 pm but you have another two hours of charts to write. For most clinicians, that's just Tuesday. The documentation burden is a second job.

AI medical scribes exist so you leave the building with every chart closed. The best ones capture the full context of the visit so you can focus on the patient in front of you, not the note you'll write after.

This guide explains how they work and what to look for.

Why Is Clinical Documentation Still Eating Your Day?

Time on charts. For every hour a physician spends with patients, they spend roughly two hours documenting.1 Across solo practices and group practices alike, that ratio plays out every day.

Burnout is the outcome. Nearly half of US physicians now report at least one symptom of burnout, and documentation load is the most cited driver, ranking above patient volume and clinical complexity.2,7 A 2025 study across Mass General Brigham and Emory found that ambient AI documentation reduced physician burnout by 21% and improved wellbeing scores by 31%.3

Small practices feel this most. A 2025 survey of small primary care practices found 41% less documentation time and 60% less burnout after adopting an ambient AI scribe.9

Where the Time Goes

The breakdown happens at the same point in every practice. The encounter ends, the patient leaves. The clinician now has 15 minutes or more of conversation that needs to become a SOAP note with ICD-10 codes before the next patient walks in. That gap is where documentation debt builds.

After-hours debt. By end of day, that debt is three, five, sometimes ten notes long. The AMA estimates physicians spend 86 minutes on after-hours EHR work per day on average.7 For a practice without extended admin support, that time comes out of the clinician's personal life.

Why Current Workarounds Fall Short

Human scribes cost $30,000 to $45,000 per year per provider. For a practice without dedicated admin support, that is a fixed overhead line that doubles when a provider joins. They also require training, scheduling, and HIPAA agreements or could be absent due to illness or vacation. Note quality depends on the individual scribe's familiarity with each clinician's style.

Traditional dictation tools require the clinician to mentally compose the note while speaking. That adds cognitive load during the visit rather than removing it afterward. The output is usually unstructured text that needs heavy reformatting before it can enter the chart.

Neither workaround solves the core problem: getting a structured, coded, EHR-ready note done without it following the clinician home.

How Does an AI Medical Scribe Actually Work?

An AI medical scribe listens to a patient encounter and builds a structured clinical note from the conversation. Unlike dictation software, which transcribes what the clinician says, an ambient scribe listens to both speakers and pulls out the clinically relevant content.

The Technology Behind the Note

Three systems work in order during every visit:

  • Speech recognition converts the audio stream into text in real time. Medical-grade systems are trained on clinical terms, drug names, and specialty-specific language. Accuracy numbers only mean something if they hold in real clinical environments, not controlled demo recordings.
  • Speaker diarization separates the conversation by speaker. The system knows which voice is delivering the history, narrating the exam, or reporting symptoms. This is what makes multi-speaker recognition work in a noisy clinic room.
  • Clinical NLP (natural language processing) maps the transcript to clinical structure. It sorts content into the right SOAP note sections and applies specialty-specific patterns. A system trained on family medicine behaves differently from one trained on behavioral health. That gap shows up most in ICD-10 selection and plan organization.

The output is a structured, coded note built to match the clinician's specialty and workflow.

What Happens When You Click End Recording

Within seconds of ending the recording, a structured SOAP note appears with ICD-10 and CPT codes already filled in. The plan section typically reflects the full scope of the visit, including differentials discussed, treatment rationale, verbal follow-up instructions, and patient education points that often get cut when charting under time pressure.

The value is not that the AI writes a better note. It is that the clinician, freed from composing the note during or right after the visit, can stay present in the room. Eye contact instead of a keyboard. Listening instead of typing. The record then reflects what actually happened, because the clinician was paying attention when it did. In a TPMG study of 7,260 physicians, 47% of patients reported that their doctor spent less time on the computer during the visit.8

What a Good AI-Generated Note Actually Looks Like

Where AI scribes earn or lose credibility is in the sections clinicians cut short when time runs out.

In the Subjective section, a clinician charting quickly might write: 'Patient reports knee pain, worse with stairs.' A system that captured the full visit would produce: 'Patient reports right knee pain starting about six weeks ago, getting worse over time, sharp going down stairs, dull at rest, with occasional swelling after long walks. Denies locking or giving way.' The second entry is not the AI's clinical judgment. It is the patient's account, captured fully rather than summarized.

In the Assessment and Plan section, the fuller record includes the differentials discussed, the treatment rationale, and the follow-up instructions given verbally. It also captures patient education points mentioned in passing. These are the sections compressed most when a clinician is charting visit #9 at 7pm. They also matter most for continuity of care, billing accuracy, and medico-legal records.

Suggested ICD-10 and CPT codes are filled in automatically on paid tiers. For practices without a dedicated coder, this cuts coding review time on routine visits without needing a dedicated coder.

Does the Note Sound Like You, or Like a Generic Template?

A well-configured AI scribe should produce notes that match your documentation style, not a one-size-fits-all format that requires heavy rewriting to feel like yours.

If routine visit notes still require significant editing after the first week, either the system is not capturing accurately or the template does not match your specialty. Custom templates let you specify the sections you want, the format you prefer, and the level of detail expected in each part.

The question is whether the tool holds up on a difficult day not on an easy morning. That is the benchmark worth tracking during any trial period.

What Should a Medical Practice Look for in an AI Medical Scribe?

Most practices want something that runs on any device, connects to the EHR already in use, and does not require an IT project to get started.

Accuracy in Real Clinical Conditions

Real clinic rooms are noisy. Medical practices should see how AI medical scribes work in real patient encounters with multiple speakers. See how they hold up with overlapping voices, background noise, and children in the room. Accuracy numbers only matter if they hold in those conditions.

What AI scribes tend to miss was examined directly in a 2025 randomised controlled trial published in NEJM AI. The finding: omissions are the most common error type, not inaccuracies.4 The AI is more likely to leave something out than to state something wrong. In practice, this means the review step matters most for completeness, not correction. Medication names spoken quickly, dosage changes mentioned in passing, and patient-reported symptoms buried in conversational language are the categories most prone to omission. Review the note after the visit, while the encounter is still fresh, to catch any omissions.

Telehealth visits introduce the same variables that affect any audio call: connection stability and device microphone quality. Both visit types are supported by leading AI scribes. If accuracy feels lower on telehealth visits, audio quality is the first variable to check before assuming a system limitation.

Test it on a real visit before committing, not on a controlled demo recording.

EHR Compatibility

The note needs to land in the chart, not in a separate document that requires manual transfer. EHR integration is one of the most frequently cited concerns when clinicians evaluate AI scribes.

The handoff between note to EHR varies by tier. At all tiers, a well-integrated scribe produces a structured, copyable note that transfers into any EHR in a single paste. Enterprise tiers typically offer one-click sync that populates chart sections directly without the manual step.

What breaks most often is the handoff between the note and the chart. EHR session timeouts, chart field structure changes after a version upgrade, and clipboard conflicts after software updates are the failure modes worth testing before going live with patients. Run the full workflow, from recording through chart entry, on a non-patient visit first.

Languages, Visit Types, and Devices

The most useful AI scribes support multiple languages with automatic detection. Both in-person and telehealth visits should be supported. The tool should run on any device the clinician already uses: mobile, tablet, or desktop. Session recording length matters for extended or complex visits. Look for at least 60 to 90 minutes of continuous audio per session.

HIPAA and Data Practices

HIPAA compliance is a baseline for any tools that touches patient conversations . The questions that matter beyond the compliance checkbox:

  • What is the data retention policy for clinical notes?
  • Who has access to the data, including employees, contractors, and third-party processors?
  • Is a Business Associate Agreement standard, or does it require a separate negotiation?

Tools that keep audio recordings create a separate category of PHI with its own access controls and breach exposure. When audio is processed and discarded, the only data left is the structured clinical note, governed by the same policies as any EHR record.

As for disclosures, the standard approach is to do so at the start of the visit. Let the patient know that an AI tool is being used to help with documentation. Give them the option to decline. In a 2025 randomised controlled trial, only 7% of patients declined when offered the option.4 Most patients, when given a straightforward explanation, have no objection. 

Something like: 'I use an AI tool to help me document our visit. It listens during our conversation and generates the notes afterward. Is that okay with you?' That covers the material disclosure, gives the patient agency, and takes about ten seconds.

What Specialties Can Use an AI Medical Scribe?

A general-purpose transcription tool doesn't distinguish between a SOAP note for family medicine and a DAP note for behavioral health. Specialty-trained systems do, and the gap shows up most in ICD-10 selection, note structure, and plan organization.

How Documentation Needs Differ by Specialty

Family Medicine and Internal Medicine: Visits often cover more than one problem in a single encounter. The challenge is capturing the full scope of each: the discussion, the decision rationale, the return instructions, without the note collapsing into a summary by visit #10. When documentation is handled, the clinician can give each problem the attention it needs in the room rather than trying to reconstruct it later.

Psychiatry and Behavioral Health: The subjective section carries particular clinical weight. Mood, affect, ideation, and response to treatment are often described by the patient in their own words. Those words matter for longitudinal care. A scribe that paraphrases loses that detail. Because the clinician is not typing during the session, the therapeutic relationship stays intact.

Physical Therapy: Documentation has distinct structural needs: functional assessments, ROM measurements, treatment units, and progress toward goals.

Pediatrics and Dentistry: Each specialty has its own conventions. Developmental milestone tracking, vaccine visit templates, dental charting nomenclature. Specialty-specific templates handle these without requiring the clinician to adapt a generic SOAP format.

Do AI Scribes Actually Save Time and Money?

A 2025 UCSF study published in JAMA Network Open found that physicians using AI scribes generated an average of $3,044 more revenue per year and saw 0.8 more patients per week compared to non-users.5 For most practices, the productivity gain covers the subscription cost well within the first year.

Time savings are real but modest in controlled conditions. The first randomised controlled trial of an ambient AI scribe, published in NEJM AI in 2025, found a 9.5% reduction in time spent in the note compared to control.4 That is not the transformative figure vendor marketing often suggests. But it compounds across 20 visits a day, and it does not account for the reduction in after-hours charting, which is where clinicians in smaller practices feel the burden most.

Note quality is more nuanced. A human scribe who has worked with a specific clinician for two years will know their documentation style well. An AI scribe performs consistently from session one, handles multiple languages, and needs no HIPAA retraining or scheduling. For most practices, the challenge of finding and keeping a quality scribe is as real as the cost.

Adoption of ambient AI documentation is growing across health systems. The Permanente Medical Group reported 15,791 hours saved across 7,260 physicians in a single year.6 UCSF found a productivity and revenue gain in a large cohort study.5 Mass General Brigham and Emory reported significant burnout reduction.3 Practices that build this into their workflow early are setting up documentation infrastructure that scales as they grow.

Commure Scribe: How It Works in Practice

Commure Scribe is an AI medical scribe built for solo clinicians and group practices. Here is what it does, and how it does it.

Workflow: Capture, Edit, Finalize

  • Capture: Open the app on any device and press Record. The AI listens in real time. No structured speaking required.
  • Edit: Within seconds of pressing End Recording, a structured SOAP note appears with suggested ICD-10 and CPT codes already filled in. The clinician reviews and edits.
  • Finalize: The clinician approves the note. It syncs to the EHR via copy-paste or one-click sync. The clinician always has the option to review before the note enters the chart.

Accuracy and Languages

99.4% transcription accuracy across real clinic environments, not controlled demo recordings. Multi-speaker recognition separates clinician and patient voices automatically.

90+ languages with automatic detection. Sessions record up to 2 hours of continuous audio per session.

EHR Compatibility

Commure Scribe connects with the 60+ EHRs, including eClinicalWorks, Athenahealth, Tebra, AdvanceMD, Practice Fusion, most common in solo and group practices.

Specialties

Commure Scribe works across a wide range of specialties, including Family Medicine, Internal Medicine, Psychiatry, Pediatrics, Behavioral Health, Dentistry, and Physical Therapy, among others, with specialty-specific templates.

Data Practices

  • Notes are securely stored. Clinical notes are kept with enterprise-grade security.
  • No third-party data sharing. Patient data is not shared with outside parties.
  • HIPAA compliant and SOC 2 certified. A BAA is standard.

Pricing

  • 7-day free trial. No credit card required.
  • Solo clinicians and small practices (1–5 clinicians): $89/month or $59/month billed annually. Unlimited transcription, custom templates, AI Copilot, ICD-10/CPT coding, copy/paste.

Medium and large practices (6–100+ clinicians): Custom pricing. Deep EHR integration, custom AI workflows, live onboarding, ROI analytics.

How Do You Evaluate an AI Medical Scribe?

The most reliable way to evaluate an AI scribe is to use it on real patient visits. Watching a vendor demo or comparing feature lists won't tell you what you need to know. The variables that determine whether a tool fits your practice and specialty aren't visible in a product tour.

A structured one-week trial gives enough data to make a real assessment. Here is a practical protocol:

  1. Start the free trial. No credit card required.
  2. Record one real patient visit using normal conversation. No changes to how you speak.
  3. Read the note before editing. Assess what the AI captured from the visit.
  4. Edit the note and track how long it takes.
  5. Transfer the note to your EHR. Confirm it landed correctly.
  6. Record five more visits across different visit types. Compare after-hours charting time to your normal baseline.
  7. Assess: does the note more fully reflect what happened in the room? If yes, consider upgrading to Pro for ICD-10/CPT coding and custom templates.

The most useful benchmark after a one-week trial is not how much time was saved. It's whether the note more fully reflects what happened in the room. A complete note is a better clinical record, a better billing document, and a better reference at the next visit.

What makes clinicians stick with a scribe past the first month comes down to whether the tool holds up on difficult days, not just easy ones. The trial period is most useful when you test it across a range of visit types, including complex multi-problem encounters, not just routine follow-ups.

For multi-provider practices, run the trial with one clinician first. Let the results speak before asking the rest of the team to change their workflow.

Common Questions About AI Medical Scribes

How accurate are AI medical scribes?

Commure Scribe achieves 99.4% transcription accuracy across real clinical environments, not controlled recordings. That includes background noise and multi-speaker conversations. No AI scribe is 100% accurate. The clinician reviews and approves every note before it enters the chart.

Are AI scribes HIPAA compliant?

HIPAA compliance is a baseline requirement, not a differentiator. Every credible AI scribe vendor should be able to provide a signed Business Associate Agreement. Commure Scribe is HIPAA compliant and SOC 2 certified. Notes are securely stored and not shared with third parties. A BAA is standard.

How much does an AI medical scribe cost?

Commure Scribe offers a 7-day free trial, no credit card required. After the trial, paid tiers scale by provider count. A human scribe typically costs $30,000 to $45,000 per year per provider. See the Commure Scribe pricing page for current plan details.

Can an AI scribe integrate with my EHR?

Commure Scribe connects with 60+ EHRs, including eClinicalWorks, Athenahealth, SimplePractice, WebPT, Elation, AdvancedMD, Cerbo, Tebra, Practice Fusion, and Kipu via Chrome Extension on all tiers, with one-click EHR sync on Enterprise.

What happens if the AI misses something during the visit?

The workflow is Capture, Edit, Finalize. The clinician reviews the note before it enters the chart. Commure Scribe records up to 2 hours of continuous audio per session, which reduces the chance of missed context. The clinician edits and approves before the note is finalized.

How much charting time do clinicians actually save after the first few weeks?

A major 2025 NEJM AI trial found that doctors spent only about 10% less time writing notes when using an AI scribe in a controlled setting. In everyday practice, bigger time savings usually come from less after‑hours “pajama time” and faster chart closing, but this trial was not designed to measure those outcomes.

How do I get the note to sound like me and not like a generic AI template?

Custom templates are the main lever. In Commure Scribe Pro, you can specify your preferred note structure, the sections you want included, and the level of detail expected in each part. If the default output requires the same edits on every note, the template does not match your style yet. Adjusting the template before continuing the trial is more useful than running more visits on a format that does not fit.

Do I need patient consent to use an AI medical scribe?

What breaks most often with AI medical scribes?

The handoff between the note and the chart is where most friction occurs, not the recording or note quality. EHR session timeouts, chart field changes after version upgrades, and browser conflicts after software updates are the common failure points. Test the full workflow on a non-patient visit before going live.

Which specialties work best with AI medical scribes?

Commure Scribe supports Family Medicine, Internal Medicine, Psychiatry, Pediatrics, Behavioral Health, Dentistry, and Physical Therapy with specialty-specific templates. Each specialty uses a different note structure and terminology. Specialty-specific templates ensure the output matches the format clinicians in that discipline actually use.

Sources

1 American Medical Association / RAND Corporation. Physician Time Spent Using the Electronic Health Record During Outpatient Encounters. 2022. https://www.rand.org/pubs/research_reports/RRA1030-1.html

2 American Medical Association. AMA Organizational Biopsy: Physician Burnout Rate and Pajama Time. 2024. https://www.ama-assn.org/practice-management/physician-health/burnout-way-down-pajama-time-stands-still

3 UCLA Health / Sinsky et al. Physicians Spend 2 Hours Documenting for Every 1 Hour of Patient Care. 2025. https://www.uclahealth.org/news/release/ucla-study-finds-ai-scribes-may-reduce-documentation-time

4 You et al. Ambient Documentation Technologies Reduce Physician Burnout. JAMA Network Open. 2025. https://www.massgeneralbrigham.org/en/about/newsroom/press-releases/ambient-documentation-technologies-reduce-physician-burnout

5 Lukac et al. Randomized Controlled Trial of Ambient AI Scribe on Documentation Time and Patient Acceptance. NEJM AI. 2025. https://pmc.ncbi.nlm.nih.gov/articles/PMC12768499/

6 Holmgren et al. AI Scribes Associated With Increased Physician Productivity and Revenue. JAMA Network Open. 2025. https://docit.ucsf.edu/news/ucsf-study-finds-ai-scribes-associated-increased-physician-productivity-and-revenue

7 The Permanente Medical Group / NEJM Catalyst. AI Scribes Save 15,000 Hours and Restore Human Side of Medicine. 2025. https://www.ama-assn.org/practice-management/digital-health/ai-scribes-save-15000-hours-and-restore-human-side-medicine

8 American Medical Association. Physician Burnout and Administrative Burden Survey. 2023. https://www.ama-assn.org/practice-management/physician-health/physician-burnout-survey

9 The Permanente Medical Group / NEJM Catalyst. AI Scribes Save Physicians Time, Improve Patient Interactions and Work Satisfaction. 2025. https://permanente.org/analysis-ai-scribes-save-physicians-time-improve-patient-interactions-and-work-satisfaction/

10 Phyx Primary Care / Rise Health. Ambient AI Scribe Cuts Small Primary Care Providers Burnout by 60%. Survey. 2025. https://www.risehealth.org/insights-articles/ambient-ai-scribe-cuts-small-primary-care-providers-burnout-by-60-new-report-reveals/

Try the #1 AI Scribe for Free

No Credit Card Required. Join 20,000 Clinicians.