扑克牌老千入门手法

書名： Python Data Analysis Cookbook
作者名： Ivan Idris
本章字?jǐn)?shù)： 245字
更新時(shí)間： 2021-07-14 11:05:43

Fitting aggregated data to the gamma distribution

The gamma distribution can be used to model the size of insurance claims, rainfall, and the distribution of inter-spike intervals in brains. The PDF for the gamma distribution is defined by shape k and scale θ as follows:

There is also a definition that uses an inverse scale parameter (used by SciPy). The mean and variance of the gamma distribution are described by (3.3) and (3.4). As you can see, we can estimate the shape parameter from the mean and variance using simple algebra.

How to do it...

Let's fit aggregates for the rain data for January to the gamma distribution:

Start with the following imports:

from scipy.stats.distributions import gamma
import matplotlib.pyplot as plt
import dautil as dl
import pandas as pd
from IPython.display import HTML

Load the data and select aggregates for January:

rain = dl.data.Weather.load()['RAIN'].resample('M').dropna()
rain = dl.ts.groupby_month(rain)
rain = rain.get_group(1)

Derive a value for k from the mean and variance of the distribution, and use it to fit the data:

dist = dl.stats.Distribution(rain, gamma)

a = (dist.mean() ** 2)/dist.var()
shape, loc, scale = dist.fit(a)

The rest of the code is similar to the code in Fitting data to the exponential distribution. Refer to the following screenshot for the end result (the code is in the fitting_gamma.ipynb file in this book's code bundle):

官术网_书友最值得收藏!

Fitting aggregated data to the gamma distribution

How to do it...

See also

官术网_书友最值得收藏!

Python Data Analysis Cookbook

Fitting aggregated data to the gamma distribution

How to do it...

See also