E-statistics (energy statistics)

Research and software related to E-statistics

Gabor J. Szekely, National Science Foundation
Maria L. Rizzo, Bowling Green State University, email:

Software for R: energy

This software is distributed under GNU General Public License Version 2, or later. See COPYING for the license.
R: Energy statistics are implemented in the contributed package energy for R.
R is a free software environment for statistical computing and graphics, available at CRAN.


[go to References]
[go to recent changes in energy package]
[go to energy for MATLAB]

Questions or comments on software: Maria Rizzo, email address above

Current version energy_1.2-0 released 27-Sept-2010.

Description of energy package

Package: energy
Title: E-statistics (energy statistics)
Version: 1.2-0
Date: 2010-09-27
Author: Maria L. Rizzo and Gabor J. Szekely
Description: energy: E-statistics (energy statistics) E-statistics (energy) tests and statistics for comparing distributions: multivariate normality, multivariate distance components and k-sample test for equal distributions, hierarchical clustering by e-distances, multivariate independence tests, distance correlation, goodness-of-fit tests. Energy-statistics concept based on a generalization of Newton's potential energy is due to Gabor J. Szekely.

References

  1. Maria L. Rizzo and Gabor J. Szekely (2010). DISCO Analysis: A Nonparametric Extension of Analysis of Variance, Annals of Applied Statistics Vol. 4, No. 2, 1034-1055. Reprint DOI
  2. Gabor J. Szekely and Maria L. Rizzo (2009). Brownian Distance Covariance,
    Annals of Applied Statistics, Vol. 3, No. 4, 1236-1265.    Reprint    doi:10.1214/09-AOAS312
  3. Gabor J. Szekely and Maria L. Rizzo (2009). Rejoinder: Brownian Distance. Covariance, Annals of Applied Statistics, Vol. 3, No. 4, 1303-1308.    Reprint    doi:10.1214/09-AOAS312REJ
  4. Maria. L. Rizzo (2009). New Goodness-of-Fit Tests for Pareto Distributions, ASTIN Bulletin: Journal of the International Association of Actuaries, 39/2, 691-715.
  5. G. J. Szekely, M. L. Rizzo, and N. K. Bakirov (2007). Measuring and Testing Independence by Correlation of Distances, Annals of Statistics, Vol. 35 No. 6, pp. 2769-2794. http://dx.doi.org/10.1214/009053607000000505.    Reprint
  6. Bakirov, N. K., Rizzo, M. L., and Szekely, G. J. (2006). A Multivariate Nonparametric Test of Independence, Journal of Multivariate Analysis Volume 97, Issue 8 , September 2006, Pages 1742-1756 http://dx.doi.org/10.1016/j.jmva.2005.10.005.
  7. Szekely, G. J. and Rizzo, M. L. (2005) Hierarchical Clustering via Joint Between-Within Distances: Extending Ward's Minimum Variance Method,
    Journal of Classification, 22(2) 151-183. http://dx.doi.org/10.1007/s00357-005-0012-9.
  8. Szekely, G. J. and Rizzo, M. L. (2005) A New Test for Multivariate Normality,
    Journal of Multivariate Analysis, 93/1, 58-80. http://dx.doi.org/10.1016/j.jmva.2003.12.002.
  9. Szekely, G. J. and Rizzo, M. L. (2004b) Mean Distance Test of Poisson Distribution,
    Statistics and Probability Letters, 67/3, 241-247 http://dx.doi.org/10.1016/j.spl.2004.01.005.
  10. Rizzo, M. L. (2003) Hierarchical Clustering Based on a Generalized Measure of Homogeneity,
    2003 Proceedings of the Joint Statistical Meetings, American Statistical Association, Section for Physical and Engineering Sciences [CD-ROM], Alexandria, VA: American Statistical Association.
  11. Szekely, G. J. and Rizzo, M. L. (2004) Testing for Equal Distributions in High Dimension, InterStat, Nov. (5).    Reprint
  12. M. L. Rizzo (2005) Minimum Energy Clustering Proceedings of Interface/Classification Society of North America, Joint Annual Meeting, 2005.
  13. Rizzo, M. L. (2002a). A Test of Homogeneity for Two Multivariate Populations,
    2002 Proceedings of the American Statistical Association, Physical and Engineering Sciences Section [CD-ROM], Alexandria, VA: American Statistical Association.
  14. Rizzo, M. L. (2002b). A New Rotation Invariant Goodness-of-Fit Test, Ph.D. dissertation, Bowling Green State University.    Abstract
  15. Szekely, G. J. (2000) E-statistics: Energy of Statistical Samples, Bowling Green State University, Department of Mathematics and Statistics Technical Report No. 03-05.
  16. Szekely, G. J. (1989) Potential and Kinetic Energy in Statistics, Lecture Notes, Budapest Institute of Technology (Technical University).

disco

disco (DIStance COmponents) function and test added in energy (version 1.2-0 27-Sept-2010) disco provides a nonparametric approach to analysis of structured data, using distance components rather than variance components. The statistic is related to, but not equivalent to, the ksample statistic. A disco method has been added to the eqdist.etest function and the corresponding eqdist.e statistic.

Distance Correlation

The dcov package is now merged into energy version 1.1-0 package, available on CRAN 07-Apr-2008. Please update energy and uninstall dcov.

MATLAB


Some functions in energy have been translated to Matlab.
Minimum energy clustering implemented for Matlab:
elinkage.m, Example 1, Example 2.

<-back to home