Ryan Lagerquist: Using Deep Learning to Improve Prediction and Understanding of High-Impact Weather

Members - Faculty, students, and collaborators
News - Recent news and publicty from members of the IDEA lab
Theses and Dissertations - Publications and code releases for student theses and disserations
Publications - Recent technical papers and presentations
Software - Recent software releases
 

Abstract

This dissertation describes the application of convolutional neural networks (CNN), a type of deep-learning method, to high-impact weather. CNNs are specially designed to learn directly from spatial grids, which improves both skill and interpretability.

Specifically, I develop and test CNNs for two tasks. The first is tornado prediction, where the two CNNs predict next-hour tornado occurrence for a given storm, using datasets similar to those used by forecasters in real-time operations. The tornado models achieve an area under the receiver-operating-characteristic curve (AUC) of 0.94 and critical success index (CSI) of 0.30. This is competitive with a machine-learning (ML) model currently used in operations, which suggests that the CNNs would also be suitable for operations. Novel ML-interpretation methods highlight the importance of a deep reflectivity core and strong mesocyclone, as well as low-level instability and wind shear in the surrounding environment. Also, interpretation methods suggest that a rear-flank downdraft with too much precipitation and negative buoyancy can lead to tornadogenesis failure, which corroborates some previous literature.

The second task is front detection, where the CNN draws warm and cold fronts in reanalysis data. When evaluated on human-identified fronts, the CNN achieves a CSI of 0.29 and frequency bias of 2.03. Although this is a high frequency bias, visual analysis suggests that the humans often miss fronts, so this ``overdetection'' is considered acceptable. I use the CNN-detected fronts to create a 40-year climatology over North America and the surrounding oceans. To my knowledge, this is the only climatology that separates fronts by type (warm or cold), one of only two climatologies that investigate variability with respect to the El Nino Southern Oscillation (ENSO) over a continent-scale or larger domain, and one of only three that investigate long-term trends over a continent-scale or larger domain. I find that fronts are most common in the mid-latitude cyclone track, which migrates poleward from winter to summer, equatorward during El Nino, and poleward during La Nina. Also, I find that the cyclone track appears to be migrating poleward as a consequence of global warming. These results are broadly consistent with previous climatologies, although there are some discrepancies and interesting results that warrant future investigation.

To summarize, my contributions to atmospheric and data science include creating tornado models that perform competitively with the state of the art, using novel interpretation methods to understand physical relationships learned by the tornado models, creating a front-detection model that appears to overcome some deficiencies of human labels, using the front-detection model to create a climatology, and identifying similarities and discrepancies to the few pre-existing climatologies. Overall, this dissertation demonstrates that deep learning can be used to advance both the prediction and understanding of high-impact weather.

Dissertation

Ryan Lagerquist (2020). Using Deep Learning to Improve Prediction and Understanding of High-Impact Weather. PhD Thesis, School of Meteorology, University of Oklahoma.

Related publications and presentations

  • Lagerquist, Ryan; McGovern, Amy; Homeyer, Cameron R; Gagne II, David John; Smith, Travis. (2020) Deep Learning on Three-dimensional Multiscale Data for Next-hour Tornado Prediction. Monthly Weather Review. [https://doi.org/10.1175/MWR-D-19-0372.1]
  • Lagerquist, Ryan; Allen, John T; McGovern, Amy. (2020) Climatology and Variability of Warm and Cold Fronts over North America from 1979-2019. Journal of Climate. [https://doi.org/10.1175/JCLI-D-19-0680.1]
  • McGovern, A., R. Lagerquist, and D. Gagne (2020) Using machine learning and model interpretation and visualization techniques to gain physical insights in atmospheric science. Proceedings of the International Conference on Learning Representations, [electronically published].
  • McGovern, A., D.J. Gagne II, R. Lagerquist, K. Elmore, and G.E. Jergensen (2019) Making the black box more transparent: Understanding the physical implications of machine learning. Bulletin of the American Meteorological Society, Volume 100, Number 11, Pages 2175-2199. [paper on BAMS website]
  • Lagerquist, R., A. McGovern, and D.J. Gagne II. (2019) Deep learning for spatially explicit prediction of synoptic-scale fronts. Weather and Forecasting, Volume 34, Number 4, Pages 1137-1160. [paper on WAF website]

Code

  • GewitterGefahr is an end-to-end machine-learning library for predicting thunderstorm hazards, primarily tornadoes and damaging straight-line wind. The machine-learning methods are storm-centered, which means that each case is one storm object (one storm cell at one time step). "End-to-end" means that this library includes code for data acquisition and pre-processing; training, validation, and testing of machine-learning models; and post-processing of machine-learning output.

Data

  • Data from the dissertation is available upon request from the authors. Once OURRstore is online, we will make a link to the public facing webpage.

 


Created by amcgovern [at] ou.edu.

Last modified June 12, 2020 11:32 AM