What question did this study set out to answer?

The aim is to compare the performance of machine learning models with physics-based models in predicting atmospheric rivers on the U.S. West Coast.

February 17, 2026Open Access

Physics‐Based Versus AI Weather Prediction Models: A Comparative Performance Assessment of Atmospheric River Prediction

Key Points

The aim is to compare the performance of machine learning models with physics-based models in predicting atmospheric rivers on the U.S. West Coast.
Evaluated five machine learning models and one physics-based model.
Analyzed 152 daily forecast cycles from November 2023 to March 2024.
Compared variable-specific root mean square error and AR detection skill.
Machine learning models show lower RMSE for specific variables.
HRES outperforms ML models in AR detection for the first four days.
PanguWeather matches HRES skill beyond day four; other ML models are slightly less effective.
Aurora displays the lowest AR detection performance despite good RMSE metrics.

Abstract

Abstract Machine learning (ML) poses a potential paradigm shift in weather forecasting, but critical questions arise regarding its ability to predict high‐impact weather events. This study evaluates five state‐of‐the‐art ML models—Aurora, GraphCast, PanguWeather, FourCastNetV2, FourCastNet—in forecasting U.S. West Coast atmospheric rivers (ARs), compared to the high‐performing physics‐based European Center for Medium‐Range Weather Forecasts' high‐resolution system (HRES) model. Analysis of 152 daily forecast cycles (November 2023–March 2024) reveals significant performance differences between the systems. While ML models often show better variable‐specific root mean square error (RMSE), HRES has superior AR detection skill for the first four forecast days. PanguWeather matches HRES skill beyond day four; other ML models lag slightly. Aurora consistently exhibits the lowest AR detection performance, despite strong variable‐specific RMSE metrics, highlighting a disconnect between RMSE performance and its ability to predict AR events. These findings underscore the need for phenomenon‐specific metrics for ML‐based numerical weather prediction model assessment and operational implementation.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Isaac Davis

Aneesh C. Subramanian

Timothy B. Higgins

Journals

Geophysical Research Letters

Actions

Institutions

University of California, San Diego

University of Colorado Boulder

Scripps Institution of Oceanography

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Physics‐Based Versus AI Weather Prediction Models: A Comparative Performance Assessment of Atmospheric River Prediction

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study