An Empirical Approach to Determine a Threshold for Assessing Overdispersion in Poisson and Negative Binomial Models for Count Data
Year of Publication
Payne, Elizabeth H.; Gebregziabher, Mulugeta; Hardin, James W.; Ramakrishnan, Viswanathan; Egede, Leonard E.
Communications in Statistics - Simulation and Computation
count data; outliers; overdispersion; Pearson chi-square; zero inflation
Overdispersion is a problem encountered in the analysis of count data that can lead to invalid inference if unaddressed. Decision about whether data are overdispersed is often reached by checking whether the ratio of the Pearson chi-square statistic to its degrees of freedom is greater than one; however, there is currently no fixed threshold for declaring the need for statistical intervention. We consider simulated cross-sectional and longitudinal datasets containing varying magnitudes of overdispersion caused by outliers or zero inflation, as well as real datasets, to determine an appropriate threshold value of this statistic which indicates when overdispersion should be addressed.