8. Extreme Value Analysis using Extremes.jl¶

This module provides an easy-to-use wrapper for the Extremes.jl Julia package, enabling seamless integration with xarray for extreme value analysis. However, do note that juliacall is not installed by default when installing xHydro. Consult the installation page for instructions.

The Extremes.jl package is specifically designed for analyzing extreme values and offers a variety of powerful features:

Block Maxima and Threshold Exceedance methods, including popular distributions such as genextreme, gumbel_r, and genpareto.
Flexible parameter estimation techniques, supporting methods like Probability-Weighted Moments (PWM), Maximum Likelihood Estimation (MLE), and Bayesian Estimation.
Compatibility with both stationary and non-stationary models for flexible modeling of future extreme events.
Return level estimation for quantifying the risk of extreme events over different return periods.

For further information on the Extremes.jl package, consult the following resources:

[1]:

import os

os.environ["PYTHON_JULIACALL_AUTOLOAD_IPYTHON_EXTENSION"] = (
    "no"  # To prevent random crashes with GitHub's testing interface
)

import matplotlib.pyplot as plt
import numpy as np
import pandas as pd
import pooch
from IPython.display import clear_output

import xhydro.extreme_value_analysis as xhe
from xhydro.testing.helpers import deveraux

clear_output(wait=False)

8.5. Working with `dask.array` Chunks¶

Currently, the Python-to-Julia interaction is not thread-safe. To mitigate potential issues, it is recommended to use the dask.scheduler="processes" option when computing results. This ensures that tasks are executed in separate Python processes, providing better isolation and avoiding thread-related conflicts.

[ ]:

ds_c = ds.chunk({"time": -1, "station_num": 1})

fit_stationary_c = xhe.fit(
    ds_c,
    dist="genextreme",
    method="ml",
    variables=["total_precip"],
    confidence_level=0.95,
)
fit_stationary_c = fit_stationary_c.compute(scheduler="processes")
clear_output(wait=False)

[14]:

fit_stationary_c

[14]:

<xarray.Dataset> Size: 460B
Dimensions:             (station_num: 5, dparams: 3)
Coordinates:
  * station_num         (station_num) int64 40B 1001 1004 1008 1009 1012
  * dparams             (dparams) <U5 60B 'shape' 'loc' 'scale'
Data variables:
    total_precip        (station_num, dparams) float64 120B dask.array<chunksize=(1, 3), meta=np.ndarray>
    total_precip_lower  (station_num, dparams) float64 120B dask.array<chunksize=(1, 3), meta=np.ndarray>
    total_precip_upper  (station_num, dparams) float64 120B dask.array<chunksize=(1, 3), meta=np.ndarray>

8. Extreme Value Analysis using Extremes.jl¶

8.1. Data acquisition¶

8.2. Parameter estimation¶

8.3. Return levels¶

8.4. Non-stationary model¶

8.4.1. Comparison of the return level using the stationary and non-stationary model¶

8.5. Working with `dask.array` Chunks¶

8. Extreme Value Analysis using Extremes.jl¶

8.1. Data acquisition¶

8.2. Parameter estimation¶

8.3. Return levels¶

8.4. Non-stationary model¶

8.4.1. Comparison of the return level using the stationary and non-stationary model¶

8.5. Working with dask.array Chunks¶

8.5. Working with `dask.array` Chunks¶