Skip to main content
U.S. flag

An official website of the United States government

Manual hierarchical clustering of regional geochemical data using a Bayesian finite mixture model

June 29, 2016

Interpretation of regional scale, multivariate geochemical data is aided by a statistical technique called “clustering.” We investigate a particular clustering procedure by applying it to geochemical data collected in the State of Colorado, United States of America. The clustering procedure partitions the field samples for the entire survey area into two clusters. The field samples in each cluster are partitioned again to create two subclusters, and so on. This manual procedure generates a hierarchy of clusters, and the different levels of the hierarchy show geochemical and geological processes occurring at different spatial scales. Although there are many different clustering methods, we use Bayesian finite mixture modeling with two probability distributions, which yields two clusters. The model parameters are estimated with Hamiltonian Monte Carlo sampling of the posterior probability density function, which usually has multiple modes. Each mode has its own set of model parameters; each set is checked to ensure that it is consistent both with the data and with independent geologic knowledge. The set of model parameters that is most consistent with the independent geologic knowledge is selected for detailed interpretation and partitioning of the field samples.

Publication Year 2016
Title Manual hierarchical clustering of regional geochemical data using a Bayesian finite mixture model
DOI 10.1016/j.apgeochem.2016.05.016
Authors Karl J. Ellefsen, David Smith
Publication Type Article
Publication Subtype Journal Article
Series Title Applied Geochemistry
Index ID 70170935
Record Source USGS Publications Warehouse
USGS Organization Crustal Geophysics and Geochemistry Science Center