TY - JOUR
T1 - A Faster and More Accurate Algorithm for Calculating Population Genetics Statistics Requiring Sums of Stirling Numbers of the First Kind
JF - G3: Genes|Genomes|Genetics
SP - 3959
LP - 3967
DO - 10.1534/g3.120.401575
VL - 10
IS - 11
AU - Chen, Swaine L.
AU - Temme, Nico M.
Y1 - 2020/11/01
UR - http://www.g3journal.org/content/10/11/3959.abstract
N2 - Ewen’s sampling formula is a foundational theoretical result that connects probability and number theory with molecular genetics and molecular evolution; it was the analytical result required for testing the neutral theory of evolution, and has since been directly or indirectly utilized in a number of population genetics statistics. Ewen’s sampling formula, in turn, is deeply connected to Stirling numbers of the first kind. Here, we explore the cumulative distribution function of these Stirling numbers, which enables a single direct estimate of the sum, using representations in terms of the incomplete beta function. This estimator enables an improved method for calculating an asymptotic estimate for one useful statistic, Fu’s . By reducing the calculation from a sum of terms involving Stirling numbers to a single estimate, we simultaneously improve accuracy and dramatically increase speed.
ER -