ClinicalTrials.Veeva

Menu

Data-driven Identification for Substance Misuse

University of Wisconsin (UW) logo

University of Wisconsin (UW)

Status

Completed

Conditions

Substance Use
Substance-Related Disorders
Substance Abuse

Treatments

Other: Processing of clinical notes in the EHR data collected during routine care

Study type

Interventional

Funder types

Other
NIH

Identifiers

NCT03833804
SMPH/MEDICINE (Other Identifier)
A534285 (Other Identifier)
1R01DA051464 (U.S. NIH Grant/Contract)
2022-0983

Details and patient eligibility

About

The investigators propose to develop an open-source, publicly available machine learning model that health systems could download and apply to their electronic health record data marts to screen for substance misuse in their patients. The investigators hypothesize that the natural language processing algorithm can provide a standardized and interoperable approach for an automated daily screen on all hospitalized patients and provide better implementation fidelity for screening, brief intervention, and referral to treatment.

Full description

In 2016, nearly 30% hospital discharges in the United States (US) had a major diagnostic category for a substance-use related condition. Substance misuse ranks second among principal diagnoses for unplanned 7-day hospital readmission rates. Despite the availability of Screening, Brief Intervention, and Referral to Treatment (SBIRT) interventions, substance misuse is not part of the admission routine and only a minority of patients are screened for substance misuse in the hospital setting. This is particularly problematic, since among hospitalized inpatients, the prevalence of substance misuse is estimated to be as high as 25%, greater than either the general population or outpatient setting. Practical screening methods tailored for the hospital setting are needed.

In the advent of Meaningful Use in the electronic health record (EHR), efficiency for alcohol detection may be improved by leveraging data collected during usual care. Documentation of substance use is common and occurs in over 96% of provider admission notes, but their free text format renders them difficult to mine and analyze. Natural Language Processing (NLP) and machine learning are subfields of artificial intelligence (AI) that provide a solution to analyze text data in the EHR to identify substance misuse. Modern NLP has fused with machine learning, another sub-field of artificial intelligence focused on learning from data. In particular, the most powerful NLP methods rely on supervised learning, a type of machine learning that takes advantage of current reference standards to make predictions about unseen cases

In the earlier version of an NLP and machine learning tool, the investigators successfully used data from clinical notes collected in the first 24 hours of hospital admission to reach a sensitivity and specificity above 70% for identifying alcohol misuse. With nearly 36 million hospital admissions in 2016, a substance misuse classifier has potential to impact millions.

In this study, the aim is to prospectively implement a substance misuse classifier to examine its effectiveness against current practice of all hospitalized adult patients at a tertiary health system. The health system has a mature screening system to examine substance misuse classifier performance against current practice of questionnaire screening.

The hypothesis is that the substance misuse classifier may provide a standardized, interoperable, and accurate approach to screen hospitalized patients. Successful implementation of the classifier in hospitalized patients is a step towards an automated and comprehensive universal screening system for substance misuse.

Enrollment

64,996 patients

Sex

All

Ages

18 to 89 years old

Volunteers

No Healthy Volunteers

Inclusion criteria

  • Ages 18 years old to 89 years old
  • Inpatient status during hospitalization
  • Length of stay greater than 24 hours

Exclusion criteria

  • Cannot participate in the usual care SBIRT intervention
  • Death or obtunded during first 24 hours of admission
  • Discharged against medical advice
  • Transferred from another acute care hospital
  • Transferred to another acute care hospital

Trial design

Primary purpose

Screening

Allocation

N/A

Interventional model

Sequential Assignment

Masking

None (Open label)

64,996 participants in 2 patient groups

SMART-AI: NLP (natural language processing) pre-screen
Experimental group
Description:
Automated processing of clinical notes collected during routine care in first 24 hours of hospital admission to identify individuals at-risk for substance misuse to receive standard-of-care full screening and assessment, brief intervention, or referral to treatment (SBIRT) intervention.
Treatment:
Other: Processing of clinical notes in the EHR data collected during routine care
Usual Care
No Intervention group
Description:
Data collected before the intervention began

Trial documents
1

Trial contacts and locations

1

Loading...

Central trial contact

Majid Afshar, MD

Data sourced from clinicaltrials.gov

Clinical trials

Find clinical trialsTrials by location
© Copyright 2026 Veeva Systems