Statistics

Design

Contains utilities for the design of experiments.

This module defines utility functions for various experimental design methods.

calisim.statistics.design.get_full_factorial_design(parameter_spec: ParameterSpecification) ndarray[source]

Get a full factorial design from a parameter specification.

Parameters:

parameter_spec (ParameterSpecification) – The parameter specification.

Returns:

The full factorial design.

Return type:

np.ndarray

Distance Metrics

Contains utilities for calculating discrepancy values.

This module defines utility functions for discrepancies that can be used to compare simulated and observational data.

class calisim.statistics.discrepancy.BrayCurtisDistance[source]

The Bray-Curtis distance.

calculate(observed: ndarray, simulated: ndarray) float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) – The simulated data.

class calisim.statistics.discrepancy.CanberraDistance[source]

The Canberra distance.

calculate(observed: ndarray, simulated: ndarray) float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) – The simulated data.

class calisim.statistics.discrepancy.ChebyshevDistance[source]

The Chebyshev distance.

calculate(observed: ndarray, simulated: ndarray) float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) – The simulated data.

class calisim.statistics.discrepancy.CorrelationDistance[source]

The correlation distance.

calculate(observed: ndarray, simulated: ndarray) float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) – The simulated data.

class calisim.statistics.discrepancy.CosineDistance[source]

The Cosine distance.

calculate(observed: ndarray, simulated: ndarray) float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) – The simulated data.

class calisim.statistics.discrepancy.DistanceMetricBase[source]

The distance metric abstract class.

abstract calculate(observed: ndarray, simulated: ndarray) float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) – The simulated data.

Raises:

NotImplementedError – Error raised for the unimplemented abstract method.

class calisim.statistics.discrepancy.EnergyDistance[source]

The energy distance distance.

calculate(observed: ndarray, simulated: ndarray) float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) – The simulated data.

class calisim.statistics.discrepancy.GaussianLogLikelihood[source]

The Gaussian log likelihood.

calculate(observed: ndarray, simulated: ndarray, sigma: float | ndarray | None = None) float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) –

    The simulated data. sigma (float | np.ndarray | None, optional): The standard

    deviation. Defaults to None.

class calisim.statistics.discrepancy.JensenShannonDistance[source]

The Jensen-Shannon distance.

calculate(observed: ndarray, simulated: ndarray) float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) – The simulated data.

class calisim.statistics.discrepancy.KlDivergence[source]

The K-L divergence.

calculate(observed: ndarray, simulated: ndarray) float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) – The simulated data.

class calisim.statistics.discrepancy.L1Norm[source]

The L1 norm distance.

calculate(observed: ndarray, simulated: ndarray) float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) – The simulated data.

class calisim.statistics.discrepancy.L2Norm[source]

The L2 norm distance.

calculate(observed: ndarray, simulated: ndarray) float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) – The simulated data.

class calisim.statistics.discrepancy.MeanAbsoluteError[source]

The mean absolute error distance.

calculate(observed: ndarray, simulated: ndarray, multioutput: str = 'uniform_average') float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) –

    The simulated data. multioutput (str, optional): Defines aggregating of multiple output values.

    Defaults to ‘uniform_average’.

class calisim.statistics.discrepancy.MeanAbsolutePercentageError[source]

The mean absolute percentage error distance.

calculate(observed: ndarray, simulated: ndarray, multioutput: str = 'uniform_average') float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) –

    The simulated data. multioutput (str, optional): Defines aggregating of multiple output values.

    Defaults to ‘uniform_average’.

class calisim.statistics.discrepancy.MeanPinballLoss[source]

The mean pinball loss distance.

calculate(observed: ndarray, simulated: ndarray, multioutput: str = 'uniform_average') float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) –

    The simulated data. multioutput (str, optional): Defines aggregating of multiple output values.

    Defaults to ‘uniform_average’.

class calisim.statistics.discrepancy.MeanSquaredError[source]

The mean squared error distance.

calculate(observed: ndarray, simulated: ndarray, multioutput: str = 'uniform_average') float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) –

    The simulated data. multioutput (str, optional): Defines aggregating of multiple output values.

    Defaults to ‘uniform_average’.

class calisim.statistics.discrepancy.MedianAbsoluteError[source]

The median absolute error distance.

calculate(observed: ndarray, simulated: ndarray, multioutput: str = 'uniform_average') float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) –

    The simulated data. multioutput (str, optional): Defines aggregating of multiple output values.

    Defaults to ‘uniform_average’.

class calisim.statistics.discrepancy.MinkowskiDistance[source]

The Minkowski distance.

calculate(observed: ndarray, simulated: ndarray) float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) – The simulated data.

class calisim.statistics.discrepancy.MultivariateNormalLogLikelihood[source]

The multivariate normal log likelihood.

calculate(observed: ndarray, simulated: ndarray, cov_matrix: ndarray | None = None, jitter: float = 1e-07) float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) –

    The simulated data. cov_matrix (np.ndarray | None, optional): The covariance

    matrix. Defaults to None.

    jitter (float) Jitter to stabilise the inversion of

    the covariance matrix.

class calisim.statistics.discrepancy.PoissonLogLikelihood[source]

The Poisson log likelihood.

calculate(observed: ndarray, simulated: ndarray) float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) – The simulated data.

class calisim.statistics.discrepancy.RootMeanSquaredError[source]

The root mean squared error distance.

calculate(observed: ndarray, simulated: ndarray, multioutput: str = 'uniform_average') float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) –

    The simulated data. multioutput (str, optional): Defines aggregating of multiple output values.

    Defaults to ‘uniform_average’.

class calisim.statistics.discrepancy.StudentsTLogLikelihood[source]

The Student’s t log likelihood.

calculate(observed: ndarray, simulated: ndarray, nu: int = 1, sigma: float | ndarray | None = None) float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) –

    The simulated data. nu (int, optional): The degrees of freedom for the

    t distribution. Defaults to 1.

    sigma (float | np.ndarray | None, optional): The standard

    deviation. Defaults to None.

class calisim.statistics.discrepancy.WassersteinDistance[source]

The Wasserstein ID distance.

calculate(observed: ndarray, simulated: ndarray) float | ndarray[source]

Calculate the distance between observed and simulated data.

Parameters:
  • observed (np.ndarray) – The observed data.

  • simulated (np.ndarray) – The simulated data.

calisim.statistics.discrepancy.get_distance_metric_func(distance_metric: str) Callable[source]

Get the distance metric function by name.

Parameters:

distance_metric (str) – The distance metric name.

Returns:

The distance metric function.

Return type:

Callable