Detecting bimodality in astronomical datasets

Abstract

We discuss statistical techniques for detecting and quantifying bimodality in astronomical datasets. We concentrate on the KMM algorithm, which estimates the statistical significance of bimodality in such datasets and objectively partitions data into sub-populations. By simulating bimodal distributions with a range of properties we investigate the sensitivity of KMM to datasets with varying characteristics. Our results facilitate the planning of optimal observing strategies for systems where bimodality is suspected. Mixture-modeling algorithms similar to the KMM algorithm have been used in previous studies to partition the stellar population of the Milky Way into subsystems. We illustrate the broad applicability of KMM by analysing published data on globular cluster metallicity distributions, velocity distributions of galaxies in clusters, and burst durations of gamma-ray sources. PostScript versions of the tables and figures, as well as FORTRAN code for KMM and instructions for its use, are available by anonymous ftp from kula.phsx.ukans.edu.

Reference

Keith A. Ashman, Christina M. Bird, Stephen E. Zepf “Detecting bimodality in astronomical datasets” (1994) DOI: 10.1086/117248

@Article{ashman1994,
  title = {Detecting bimodality in astronomical datasets},
  abstract = {We discuss statistical techniques for detecting and quantifying bimodality in astronomical datasets. We concentrate on the KMM algorithm, which estimates the statistical significance of bimodality in such datasets and objectively partitions data into sub-populations. By simulating bimodal distributions with a range of properties we investigate the sensitivity of KMM to datasets with varying characteristics. Our results facilitate the planning of optimal observing strategies for systems where bimodality is suspected. Mixture-modeling algorithms similar to the KMM algorithm have been used in previous studies to partition the stellar population of the Milky Way into subsystems. We illustrate the broad applicability of KMM by analysing published data on globular cluster metallicity distributions, velocity distributions of galaxies in clusters, and burst durations of gamma-ray sources. PostScript versions of the tables and figures, as well as FORTRAN code for KMM and instructions for its use, are available by anonymous ftp from kula.phsx.ukans.edu.},
  volume = {108},
  issn = {0004-6256},
  doi = {10.1086/117248},
  journal = {The Astronomical Journal},
  publisher = {American Astronomical Society},
  author = {Ashman, Keith A. and Bird, Christina M. and Zepf, Stephen E.},
  year = {1994},
  month = {dec},
  pages = {2348}
}