Abstract
In this paper, we provide some new perspectives on sequential experimental designs for statistical inference in the context of big data. A fine-tuned parallel piecewise sequential procedure is developed for estimating the mean of a normal population having an unknown variance. With the help of such fine-tuning, asymptotic unbiasedness of the terminal sample size can be achieved along with the added operational efficiency as a result of utilizing the parallel processing or distributed computing. Theory and methodology will go hand-in-hand followed by illustrations from large-scale data analyses based on simulated data as well as real data from a health study.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Aerts, M., Geertsema, J.C.: Bounded length confidence intervals in nonparametric regression. Seq. Anal. 9, 171–192 (1990)
Anscombe, F.J.: Large-sample theory of sequential estimation. Proc. Camb. Philol. Soc. 48, 600–607 (1952)
Anscombe, F.J.: Sequential estimation. J. R. Stat. Soc. Ser. B 15, 1–29 (1953)
Aoshima, M., Mukhopadhyay, N., Kobayashi, Y.: Two-stage procedures for estimating the difference of means when the sampling cost is different. Seq. Anal. 30, 160–171 (2011)
Chow, Y.S., Robbins, H.: On the asymptotic theory of fixed width confidence intervals for the mean. Ann. Math. Stat. 36, 457–462 (1965)
Dantzig, G.B.: On the non-existence of tests of student’s hypothesis having power functions independent of σ. Ann. Math. Stat. 11, 186–192 (1940)
Geertsema, J.C.: Sequential confidence intervals based on rank tests. Ann. Math. Stat. 41, 1016–1026 (1970)
Ghosh, M., Mukhopadhyay, N.: On two fundamental problems of sequential estimation. Sankhyā Ser. B 38, 203–218 (1976)
Ghosh, M., Mukhopadhyay, N.: Consistency and asymptotic efficiency of two-stage and sequential procedures. Sankhyā Ser. A 43, 220–227 (1981)
Ghosh, M., Mukhopadhyay, N., Sen, P.K.: Sequential Estimation. Wiley, New York (1997)
Ghosh, B.K., Sen, P.K.: Handbook of Sequential Analysis, edited volume. Dekker, New York (1991)
Henke, N., Bughin, J., Chui, M., Manyika, J., Saleh, T., Wiseman, B., Sethupathy, G.: The age of analytics: competing in a data-driven world. McKinsey Global Institute report (2016). https://www.mckinsey.com/business-functions/mckinsey-analytics/our-insights/the-age-of-analytics-competing-in-a-data-driven-world
Jurečkovā, J., Sen, P.K.: Robust Statistical Procedures. Wiley, New York (1996)
Lai, T.L., Siegmund, D.: A nonlinear renewal theory with applications to sequential analysis I. Ann. Stat. 5, 946–954 (1977)
Lai, T.L., Siegmund, D.: A nonlinear renewal theory with applications to sequential analysis II. Ann. Stat. 7, 60–76 (1979)
Lombard, F., Swanepoel, J.W.H.: On finite and infinite confidence sequences. South African Stat. J. 12, 1–24 (1978)
Manyika, J., Chui, M., Brown, B., Bughin, J., Dobbs, R., Roxburgh, C., Byers, A.H.: Big data: the next frontier for innovation, competition, and productivity. McKinsey Global Institute report (2011). https://www.mckinsey.com/business-functions/digital-mckinsey/our-insights/big-data-the-next-frontier-for-innovation
Mukhopadhyay, N.: Sequential estimation of location parameters in exponential distributions. Calcutta Statist. Assoc. Bull. 23, 85–93 (1974)
Mukhopadhyay, N.: A consistent and asymptotically efficient two-stage procedure to construct fixed-width confidence intervals for the mean. Metrika 27, 281–284 (1980)
Mukhopadhyay, N.: A study of the asymptotic regret while estimating the location of an exponential distribution. Calcutta Statist. Assoc. Bull. 31, 207–213 (1982)
Mukhopadhyay, N.: Sequential estimation problems for negative exponential populations. Commun. Stat. Theory Methods Ser. A 17, 2471–2506 (1988)
Mukhopadhyay, N.: Two-stage and multi-stage estimation. In: Balakrishnan, N., Basu, A.P. (eds.) The Exponential Distribution: Theory, Methods and Application, Chapter 26, pp. 429–452. Gordon and Breach, Amsterdam (1995)
Mukhopadhyay, N.: Probability and Statistical Inference. Dekker, New York (2000)
Mukhopadhyay, N., Datta, S.: On fine-tuning a purely sequential procedure and the associated second-order properties. Sankhyā Ser. A 57, 100–117 (1995)
Mukhopadhyay, N., de Silva, B.M.: Sequential Methods and Their Applications. Chapman & Hall/CRC, Boca Raton (2009)
Mukhopadhyay, N., Sen, P.K.: Replicated piecewise stop** numbers and sequential analysis. Seq. Anal. 12, 179–197 (1993)
Mukhopadhyay, N., Solanky, T.K.S.: Multistage Selection and Ranking Procedures. Dekker, New York (1994)
Mukhopadhyay, N., Vik, G.: Asymptotic results for stop** times based on U-statistics. Seq. Anal. 4, 83–110 (1985)
Mukhopadhyay, N., Zhang, C.: EDA on the asymptotic normality of the standardized sequential stop** times, part-I: parametric models. Seq. Anal. 37, 342–374 (2018)
Mukhopadhyay, N., Zhang, C.: EDA on the asymptotic normality of the standardized sequential stop** times, part-II: distribution-free models. Seq. Anal. 39, 367–398 (2020)
Ray, W.D.: Sequential confidence intervals for the mean of a normal population with unknown variance. J. R. Stat. Soc. Ser. B 19, 133–143 (1957)
Robbins, H.: Sequential estimation of the mean of a normal population. In: Cramér volume, H., Grenander, U. (eds.) Probability and Statistics, pp. 235–245. Almquist and Wiksell, Uppsala (1959)
Sen, P.K.: Sequential Nonparametrics. Wiley, New York (1981)
Siegmund, D.: Sequential Analysis: Tests and Confidence Intervals. Springer, New York (1985)
Starr, N.: The performance of a sequential procedure for fixed-width interval estimate. Ann. Math. Stat. 36, 36–50 (1966)
Starr, N.: On the asymptotic efficiency of a sequential procedure for estimating the mean. Ann. Math. Stat. 37, 1173–1185 (1966)
Starr, N., Woodroofe, M.: Remarks on sequential point estimation. Proc. Natl. Acad. Sci. USA 63, 285–288 (1969)
Stein, C.: A Two sample test for a linear hypothesis whose power is independent of the variance. Ann. Math. Stat. 16, 243–258 (1945)
Stein, C.: Some problems in sequential estimation (abstract). Econometrica 17, 77–78 (1949)
Steland, A., Chang, Y.-T.: High-confidence nonparametric fixed-width uncertainty intervals and applications to projected high-dimensional data and common mean estimation. Seq. Anal. 40, 97–124 (2021)
Swanepoel, J.W.H., van Wyk, J.W.J.: Fixed width confidence intervals for the location parameter of an exponential distribution. Commun. Stat. Theory Methods 11, 1279–1289 (1982)
Woodroofe, M.: Second order approximations for sequential point and interval estimation. Ann. Stat. 5, 984–995 (1977)
Woodroofe, M.: Nonlinear Renewal Theory in Sequential Analysis, CBMS #39. SIAM, Philadelphia (1982)
Acknowledgements
We remain indebted to Professor Ansgar Steland and the referees for critically evaluating this invited contribution. Their feedback has improved an original version of our work. We take this opportunity to thank them.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Mukhopadhyay, N., Zhang, C. (2022). Fine-Tuned Parallel Piecewise Sequential Confidence Interval and Point Estimation Strategies for the Mean of a Normal Population: Big Data Context. In: Steland, A., Tsui, KL. (eds) Artificial Intelligence, Big Data and Data Science in Statistics. Springer, Cham. https://doi.org/10.1007/978-3-031-07155-3_3
Download citation
DOI: https://doi.org/10.1007/978-3-031-07155-3_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-07154-6
Online ISBN: 978-3-031-07155-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)