Group testing is a method of pooling a number of units together and performing a single test on the resulting group. It is an appealing option when few individual units are thought to be infected leading to reduced costs of testing as compared to individually testing the units. Group testing aims to identify the positive groups in all the groups tested or to estimate the proportion of positives (p) in a population. Interval estimation methods of the proportions in group testing for unequal group sizes adjusted for overdispersion have been examined. Lately improvement in statistical methods allows the construction of highly accurate confidence intervals (CIs). The aim here is to apply group testing for estimation and generate highly accurate Bootstrap confidence intervals (CIs) for the proportion of defective or positive units in particular. This study provided a comparison of several proven methods of constructing CIs for a binomial proportion after adjusting for overdispersion in group testing with groups of unequal sizes. Bootstrap resampling was applied on data simulated from binomial distribution, and confidence intervals with high coverage probabilities were produced. This data was assumed to be overdispersed and independent between groups but correlated within these groups. Interval estimation methods based on the Wald, the Logit and Complementary log-log (CLL) functions were considered. The criterion used in the comparisons is mainly the coverage probabilities attained by nominal 95% CIs, though interval width is also regarded. Bootstrapping produced CIs with high coverage probabilities for each of the three interval methods.
Cite this paper
Mwangi, O. , Islam, A. and Luke, O. (2015) Bootstrap Confidence Intervals for Proportions of Unequal Sized Groups Adjusted for Overdispersion. Open Journal of Statistics
, 502-510. doi: 10.4236/ojs.2015.56052
 Dorfman, R. (1943) The Detection of Defective Members of Large Populations. The Annals of Mathematical Statistics, 14, 436-440. http://dx.doi.org/10.1214/aoms/1177731363
 Fletcher, J., Russell, A. and Butler, R. (1999) Seed-Borne Cucumber Mosaic Virus in New Zealand Lentil Crops: Yield Effects and Disease Incidence. New Zealand Journal of Crop and Horticulture Science, 27, 197-204.http://dx.doi.org/10.1080/01140671.1999.9514097
 Worlund, D. and Taylor, G. (1983) Estimation of Disease Incidence in Fish Populations. Canadian Journal of Fisheries and Aquatic Science, 40, 2194-2197. http://dx.doi.org/10.1139/f83-254
 Swallow, W.H. (1985) Group Testing for Estimating Infection Rates and Probabilities of Disease Transmission. Phytopathology, 75, 882-889. http://dx.doi.org/10.1094/phyto-75-882
 Kline, R.L., Bothus, T., Brookmeyer, R., Zeyer, S. and Quinn, T. (1989) Evaluation of Human Immunodeficiency Virus Seroprevalence in Population Surveys Using Pooled Sera. Journal of Clinical Microbiology, 27, 1449-1452.
 Liu, S.-C., Chiang, K.-S., Lin, C.-H. and Deng, T.-C. (2010) Confidence Interval Procedures for Proportions Estimated by Group Testing with Groups of Unequal Size Adjusted for Overdispersion. Journal of Applied Statistics, 38, 1467-1482. http://dx.doi.org/10.1080/02664763.2010.505953
 Paul, S. and Islam, A.S. (1998) Joint Estimation of the Mean and Dispersion Parameters in the Analysis of Proportions: A Comparison of Efficiency and Bias. Canadian Journal of Statistics, 26, 83-94. http://dx.doi.org/10.2307/3315675
 Saha, K.K. and Paul, S.R. (2005) Bias-Corrected Maximum Likelihood Estimator of the Negative Binomial Dispersion Parameter. Biometrics, 61, 179-185. http://dx.doi.org/10.1111/j.0006-341X.2005.030833.x
 Williams, D.A. (1975) The Analysis of Binary Responses for Toxicological Experiments Involving Reproduction and Teratogenicity. Biometrics, 31, 946-952. http://dx.doi.org/10.2307/2529820
 Saha, K.K. (2008) Semi-Parametric Estimation for the Dispersion Parameter in the Analysis of over- or Underdispersed Count Data. Journal of Applied Statistics, 35, 1383-1397. http://dx.doi.org/10.1080/02664760802382459
 Hepworth, G. (2005) Confidence Intervals for Proportions Estimated by Group Testing with Groups of Unequal Size. Journal of Agricultural, Biological, and Environmental Statistics, 10, 478-497. http://dx.doi.org/10.1198/108571105X81698
 Efron, B. (1979) Bootstrap Methods: Another Look at the Jackknife. The Annals of Statistics, 7, 1-26.http://dx.doi.org/10.1214/aos/1176344552
 Siele, D. (2012) Interval Estimation for the Beta Binomial Dispersion Parameter. MSc Thesis, Egerton University, Njoro.
 Saha, K. and Sen, D. (2009) Improved Confidence Interval for the Dispersion Parameter in Count Data Using Profile Likelihood. Technical Report No. 4/09.
 Williams, D.A. (1982) Extra-Binomial Variation in Logistic Linear Models. Journal of the Royal Statistical Society. Series C (Applied Statistics), 31, 144-148. http://dx.doi.org/10.2307/2347977
 Crowder, M.J. (1978) Beta-Binomial Anova for Proportions. Applied Statistics, 27, 34-37. Mathematical Statistics, 14, 436-440.
 Hepworth, G. (1996) Exact Confidence Intervals for Proportions Estimated by Group Testing. Biometrics, 52, 1134-1146. http://dx.doi.org/10.2307/2533075
 Loyer, M.W. and Hamilton, M.A. (1984) Interval Estimation of the Density of Organisms Using a Serial-Dilution Experiment. Biometrics, 40, 907-916. http://dx.doi.org/10.2307/2531142
 Efron, B. and DiCiccio, T. (1996) Bootstrap Confidence Intervals. Statistical Science, 11, 189-228.http://dx.doi.org/10.1214/ss/1032280214