Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/geoscienceaustralia/nhi-tsed

Extraction and analysis of weather station observations for thunderstorm events
https://github.com/geoscienceaustralia/nhi-tsed

extremes hazards thunderstoms timeseries timeseries-analysis weather

Last synced: 4 days ago
JSON representation

Extraction and analysis of weather station observations for thunderstorm events

Awesome Lists containing this project

README

        

# Thunderstorm event database

To classify events into convective or non-convective storms based on analysis of evolution of wind speed, temperature, station pressure and other meteorlogical variables.

This code uses `sktime`[^1] to perform a supervised classification of the time series of weather variables. A training event set is developed from visual classification of wind gust events over 90 km/h at high-quality stations. The full event set comprises all storm events with a maximum wind gust in excess of 60 km/h. The approach is based on previous concepts from Spassiani and Mason (2021)[^2] and Cook (2023)[^3].

Quality flags are used to eliminate spurious gusts, but there still remain some anomalous events (instrument spikes).

Data from this analysis is used to create a geodatabase for publication and use in GIS applications.

This work has been supported by Queensland Fire and Emergency Services.

## Data source

1-minute weather observations from all available weather stations across Australia.

Variables required:
- gust wind speed
- wind speed
- wind direction
- temperature
- dew point
- station pressure
- rainfall
- quality flags for all variables

Original source data can be requested from the Bureau of Meteorology, Climate Data Services team,

## Requirements

- python 3.9
- jupyter
- numpy
- matplotlib
- pandas
- pytz
- lxml
- prov
- seaborn
- sktime-all-extras
- gitpython
- fiona
- geopandas
- cartopy
- scikit-learn
- metpy
- pycodestyle

## Installation

1. Build the conda environment
2. Install additional code from https://github.com/GeoscienceAustralia/nhi-pylib
3. Add the path to the additional code to the `PYTHONPATH` environment variable

## Process

1. `extractStationDetails.py` - extract station details from the raw data. Creates a geojson file of station locations used in subsequent scripts
2. `extractStationData.py` - extract all events from the raw data. This should be executed twice, initially with a threshold of 90 km/h and again with a threshold of 60 km/h. The outputs for each execution need to be stored in different folders. Users will need to check the path to the original source files (`OriginDir` in the configuration files.)
3. `selectHQStations.ipynb` - use this to select high-quality stations that will form the training dataset for the ML classification process
4. `classifyGustEvents.py` - classifies all daily maximum wind gusts using El Rafei et al. (2023)[^5]
5. `ClassifyEvents.ipynb` - interactive notebook to visually classify storms with maximum gust > 90 km/h at high-quality stations
6. `classifyTimeSeries.py` - use ML approach in `sktime` to classify all storm events (> 60 km/h) [^4]
7. `convertStormCounts.py` - convert classified storms to counts of storm type at each station
8. `joinStormRatesToStationDetails.py` - Join storm rate data to station details and store output in GeoJSON file
9. `joinStormClassToMaxDailyData.py` - Join storm classification data to daily maximum gust data
10. `joinDailyWeatherObservations.py`- Join the daily weather observations to the storm data

## Analysis
1. `AnalyseClassifiedStorms.ipynb` - interactive notebook to compare this classification against the El Rafei _et al._[^5] gust classification
2. `analyseStormEventTypes.ipynb` - interactive notebook to examine the full classified storm event set, e.g. median and 99th percentile wind gusts for each storm type, seasonal distribution of storm type, comparison against other metrics (e.g. gust ratio, emergence). Still a work in progress.

## Products

_Link to eCat record for the data_

## Licence

_Add a licence_

## References

[^1]: http://www.sktime.net/en/latest/index.html
[^2]: Spassiani, A. C., and M. S. Mason, 2021: Application of Self-organizing Maps to classify the meteorological origin of wind gusts in Australia. _Journal of Wind Engineering and Industrial Aerodynamics_, **210**, 104529, https://doi.org/10.1016/j.jweia.2021.104529.
[^3]: Cook, N. J., 2023: Automated classification of gust events in the contiguous USA. _Journal of Wind Engineering and Industrial Aerodynamics_, **234**, 105330, https://doi.org/10.1016/j.jweia.2023.105330.
[^4]: Dempster, A., F. Petitjean, and G. I. Webb, 2020: ROCKET: exceptionally fast and accurate time series classification using random convolutional kernels. Data Min Knowl Disc, 34, 1454–1495, https://doi.org/10.1007/s10618-020-00701-z.
[^5]: El Rafei, M., S. Sherwood, J. Evans, and A. Dowdy, 2023: Analysis and characterisation of extreme wind gust hazards in New South Wales, Australia. _Nat Hazards_, **117**, 875–895, https://doi.org/10.1007/s11069-023-05887-1.