Skip to contents

Fits a Gaussian mixture model (GMM) to Minor allele frequency (MAF) values and returns the most prominent cluster's mean and weight. Clusters with similar means (within a threshold) are merged. If there are insufficient values or nearly no variance, a simple mean is returned.

Usage

EstimateMAFbyGMM(maf_values)

Arguments

maf_values

Numeric vector of MAF values.

Value

Named numeric vector with elements:

gmm_mean

The mean of the most prominent cluster (capped at 0.5).

gmm_weight

The mixture weight of the most prominent cluster.

gmm_G

Number of clusters after merging adjacent means.

Details

Uses Mclust for Gaussian mixture modeling.