severe%hail%forecasting%evaluation:% machine%learning%and ... · summary fi f1 o1 oj o2...

13
Severe Hail Forecasting Evaluation: Machine Learning and Severe Weather Proxy Variables David John Gagne II 1,2 Amy McGovern 3 , Nate Snook 1 , Ryan Sobash 2 John Williams 4 , Sue Haupt 2 , Ming Xue 1 1. OU CAPS, 2. NCAR, 3. OU Computer Science 4. The Weather Company/WSI AMS Annual Meeting January 12, 2016

Upload: others

Post on 20-Mar-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Severe%Hail%Forecasting%Evaluation:% Machine%Learning%and ... · Summary fi f1 o1 oj o2 Forecast’and’observed’hail’ stormsare’tracked’ and’matched Machine’learning’predicts’hail’size’

Severe  Hail  Forecasting  Evaluation:  Machine  Learning  and  Severe  

Weather  Proxy  Variables

David  John  Gagne  II1,2

Amy  McGovern3,  Nate  Snook1,  Ryan  Sobash2

John  Williams4,  Sue  Haupt2,  Ming  Xue1

1.  OU  CAPS,  2.  NCAR,  3.  OU  Computer  Science  4.  The  Weather  Company/WSI

AMS  Annual  MeetingJanuary  12,  2016

Page 2: Severe%Hail%Forecasting%Evaluation:% Machine%Learning%and ... · Summary fi f1 o1 oj o2 Forecast’and’observed’hail’ stormsare’tracked’ and’matched Machine’learning’predicts’hail’size’

MotivationGoals

• Estimate  hail  size  distributions  with  machine  learning  models  using  input  from  CAM  ensembles  

• Determine  value  added  by  ML  over  hailstorm  proxies

Hailstorm  Proxies• Updraft  Helicity

• Vertically  integrated  product  of  vertical  velocity  and  vorticity• Associated  with  strong,  rotating  updrafts

• Column  Integrated  Graupel• Total  mass  of  graupel in  a  vertical  column• Large  hail  mass  not  always  associated  with  large  hail  diameters

• HAILCAST• Hail  size  estimated  from  growing  simulated  hail  embryos• Triggered  when  vertical  velocity  exceeds  a  threshold

HAILCAST  output  example.

Page 3: Severe%Hail%Forecasting%Evaluation:% Machine%Learning%and ... · Summary fi f1 o1 oj o2 Forecast’and’observed’hail’ stormsare’tracked’ and’matched Machine’learning’predicts’hail’size’

Ensemble  and  Observation  DataSystem CAPS  Ensemble

Dynamical Model WRF-­‐ARW  3.5.1  (2014),  3.6.1  (2015)

Run  Period May-­‐June  2014  and  2015

Grid  Spacing 4 km  in  2014,  3  km  in  2015

Microphysics Thompson,  Morrison,  P3,  Milbrandt and  Yau

PBL MYJ,MYNN,  YSU

Data  Assimilation 3DVAR  with  radar  data  assimilation

Observation  Data

• NOAA  NSSL  Multi-­‐Radar  Multi-­‐Sensor  (MRMS)

• 1  km  radar  mosaic  over  US

• Maximum  Expected  Size  of  Hail  (MESH)

• Used  as  observations  for  machine  learning  models

Page 4: Severe%Hail%Forecasting%Evaluation:% Machine%Learning%and ... · Summary fi f1 o1 oj o2 Forecast’and’observed’hail’ stormsare’tracked’ and’matched Machine’learning’predicts’hail’size’

Potential  Hailstorm  Identification  • Hailstorm  Proxy:  Hourly  Max  Column  Integrated  Graupel

• Enhanced  watershed  (Lakshmanan et  al.  2009)  used  to  identify  storm  “objects”

• Objects  must  have  area  within  specified  range

Page 5: Severe%Hail%Forecasting%Evaluation:% Machine%Learning%and ... · Summary fi f1 o1 oj o2 Forecast’and’observed’hail’ stormsare’tracked’ and’matched Machine’learning’predicts’hail’size’

Hailstorm  TrackingEstimate  Storm  Motion Translate  Objects  and  Match

Use  cross-­‐correlation  filter  to  find  direction  of  motion

Calculate  translated  centroid  distances  and  optimally  match  nearest  pairs

Page 6: Severe%Hail%Forecasting%Evaluation:% Machine%Learning%and ... · Summary fi f1 o1 oj o2 Forecast’and’observed’hail’ stormsare’tracked’ and’matched Machine’learning’predicts’hail’size’

Hailstorm  Matching

fif1

o1oj

o2

• Forecast  and  observed  storm  tracks  are  matched

• Weighted  average  of  space,  time  and  track  properties• 50%:  Start  centroid  Euclidean  distance• 30%:  Start  time  absolute  difference• 10%:  Duration  absolute  difference• 10%:  Mean  area  absolute  difference

• Distance  and  time  differences  are  tightly  constrained

• Duration  and  area  are  loosely  constrained

Page 7: Severe%Hail%Forecasting%Evaluation:% Machine%Learning%and ... · Summary fi f1 o1 oj o2 Forecast’and’observed’hail’ stormsare’tracked’ and’matched Machine’learning’predicts’hail’size’

Gamma  Hail  Size  Distribution

Wide  range  of  hail  sizes  within  MESH  objectLarge  hail  occurs  within  small  area

Distribution  of  MESH  pixelsFit  gamma  PDF  to  MESH  valuesCompress  distribution  information  into  3  values

Page 8: Severe%Hail%Forecasting%Evaluation:% Machine%Learning%and ... · Summary fi f1 o1 oj o2 Forecast’and’observed’hail’ stormsare’tracked’ and’matched Machine’learning’predicts’hail’size’

Machine  Learning  Procedure

Hail:  Yes  or  NoBased  on  track  

matching

Estimate  gamma  shape,  location,  scale  parameters

Hailstorm  object  

removed  from  grid

Draw  N samples  from  forecast  distribution  and  apply  to  proxy  

object  grid  in  rank  order  

• Machine  Learning:  scikit-­‐learn• Open  source  Python  library• Contains  parallelization  options

• Models• Random  Forest• Elastic  Net

• How  to  increase  training  set  size?• Train  one  model  for  full  domain• Group  similar  ensemble  members  into  

same  training  set• Neighborhood  Ensemble  Probability• Spatial  smoothing  and  amplification• Applied  to  ML  and  storm  proxies

Generate  neighborhood  probabilities

Page 9: Severe%Hail%Forecasting%Evaluation:% Machine%Learning%and ... · Summary fi f1 o1 oj o2 Forecast’and’observed’hail’ stormsare’tracked’ and’matched Machine’learning’predicts’hail’size’

Neighborhood  Probability  Performance

Page 10: Severe%Hail%Forecasting%Evaluation:% Machine%Learning%and ... · Summary fi f1 o1 oj o2 Forecast’and’observed’hail’ stormsare’tracked’ and’matched Machine’learning’predicts’hail’size’

Neighborhood  Probability  Performance

Page 11: Severe%Hail%Forecasting%Evaluation:% Machine%Learning%and ... · Summary fi f1 o1 oj o2 Forecast’and’observed’hail’ stormsare’tracked’ and’matched Machine’learning’predicts’hail’size’

Neighborhood  Probability  Performance

Page 12: Severe%Hail%Forecasting%Evaluation:% Machine%Learning%and ... · Summary fi f1 o1 oj o2 Forecast’and’observed’hail’ stormsare’tracked’ and’matched Machine’learning’predicts’hail’size’

Case  Study:  May  27,  2015• Observed  50  mm  or  larger  hail  

within  84  km  is  shown  with  the  blue  contours

• Column-­‐integrated  graupelprobabilities  highest  in  areas  with  non-­‐hail-­‐producing  storms

• ML  models  match  highest  probabilities  with  largest  hail

• Updraft  helicity has  smallest  false  alarm  area

Page 13: Severe%Hail%Forecasting%Evaluation:% Machine%Learning%and ... · Summary fi f1 o1 oj o2 Forecast’and’observed’hail’ stormsare’tracked’ and’matched Machine’learning’predicts’hail’size’

Summaryfi

f1

o1oj

o2

Forecast  and  observed  hail  storms  are  tracked  and  matched

Machine  learning  predicts  hail  size  distribution  parameters ML  methods  outperform  

HAILCAST  but  not  updraft  helicity

ML  methods  are  more  reliable ML  methods  can  discriminate  very  large  hail  regions

Email:  [email protected]:  @DJGagneDos

Work  funded  by  the  Severe  Hail  Analysis,  Representation  and  

Prediction  project  (NSF  AGS-­‐1261776)