What question did this study set out to answer?

To evaluate the CigStopper algorithm's efficiency in predicting billing eligibility for tobacco cessation counseling.

March 16, 2026Open Access

Real-time automated billing for tobacco treatment: performance evaluation of the CigStopper machine learning framework

Key Points

To evaluate the CigStopper algorithm's efficiency in predicting billing eligibility for tobacco cessation counseling.
Trained on a 40,000-note corpus of clinical notes and synthetic data.
Used Random Forest models for flat multiclass and hierarchical classification.
Evaluated performance with a 20% holdout set using metrics like accuracy and F1 score.
Achieved F1 scores of ≥ 0.97 for billing eligibility and ≥ 0.90 for CPT code 99406.
Limited performance for intensive counseling (99407) with F1 ≤ 0.56.
Improved eligibility detection and billing accuracy with hierarchical classification over flat models.

Abstract

Abstract Objective To evaluate CigStopper, a machine learning algorithm designed to predict billing eligibility for tobacco cessation counseling (CPT 99406/99407), addressing persistent underbilling and documentation gaps in health systems. Materials and Methods We trained CigStopper on a 40,000-note corpus comprising real-world de-identified clinical notes, synthetically generated notes, and a blended dataset. Notes were categorized by billing eligibility and smoking documentation. Random Forest models were trained and evaluated using both flat multiclass and hierarchical classification approaches. Performance was assessed on a 20% holdout set of real notes using standard metrics (accuracy, precision, recall, F1). Results Models trained on real or blended datasets achieved high performance for billing eligibility (F1 ≥ 0.97) and 99406 prediction (F1 ≥ 0.90). Prediction for intensive counseling (99407) remained limited (F1 ≤ 0.56). Synthetic-only training resulted in overfitting, with poor generalization to real-world data. Hierarchical classification improved eligibility detection and CPT code prediction compared with flat multiclass models. Discussion Findings demonstrate that blended datasets mitigate class imbalance and improve generalizability, while hierarchical architectures enhance performance on billing tasks. Persistent gaps in 99407 prediction were related to low training volume, likely reflecting documentation and coding culture rather than model limitations, underscoring systemic issues in clinical note content. Conclusion CigStopper demonstrates feasibility as a scalable NLP-based billing validation tool. By automating tobacco cessation CPT coding, the algorithm can improve data integrity, reduce missed reimbursement, and support health systems in aligning clinical care with financial and population health priorities.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Derek J Baughman

Layth Qassem

Lina Sulieman

Journals

JAMIA Open

Actions

Institutions

Icahn School of Medicine at Mount Sinai

Vanderbilt University Medical Center

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Real-time automated billing for tobacco treatment: performance evaluation of the CigStopper machine learning framework

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study