Big Data Analytics: Views from Statistical and Computational Perspectives

Pyne, Saumyadipta; Prakasa Rao, B. L. S.; Rao, S. B.

doi:10.1007/978-81-322-3628-3_1

Saumyadipta Pyne⁴,
B. L. S. Prakasa Rao⁵ &
S. B. Rao⁵

5417 Accesses
2 Citations

Abstract

Without any doubt, the most discussed current trend in computer science and statistics is BIG DATA. Different people think of different things when they hear about big data. For the statistician, the issues are how to get usable information out of datasets that are too huge and complex for many of the traditional or classical methods to handle. For the computer scientist, big data poses problems of data storage and management, communication, and computation. For the citizen, big data brings up questions of privacy and confidentiality. This introductory chapter touches some key aspects of big data and its analysis. Far from being an exhaustive overview of this fast emerging field, this is a discussion on statistical and computational views that the authors owe to many researchers, organizations, and online sources.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 87.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 109.99; Price includes VAT (United Kingdom)

Hardcover Book: GBP 109.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Big data: the next challenge for statistics

Article 01 August 2015

Comments on: Data science, big data and statistics

Article 08 April 2019

Big Data, Data Scientists und amtliche Statistik

References

Kennedy R, King G, Lazer D, Vespignani A (2014) The parable of google flu. Traps in big data analysis. Science 343:1203–1205
Google Scholar
Fokoue E (2015) A taxonomy of Big Data for optimal predictive machine learning and data mining. ar**v.1501.0060v1 [stat.ML] 3 Jan 2015
Chandrasekaran V, Jodan MI (2013) Computational and statistical tradeoffs via convex relaxation. Proc Natl Acad Sci USA 110:E1181–E1190
Article MathSciNet MATH Google Scholar
Matloff N (2016) Big n versus big p in Big data. In: Bühlmann P, Drineas P (eds) Handbook of Big Data. CRC Press, Boca Raton, pp 21–32
Google Scholar
Portnoy S (1988) Asymptotic behavior of likelihood methods for exponential families when the number of parameters tends to infinity. Ann Stat 16:356–366
Article MathSciNet MATH Google Scholar
Tibshirani R (1996) Regression analysis and selection via the lasso. J R Stat Soc Ser B 58:267–288
MathSciNet MATH Google Scholar
Report of National Research Council (2013) Frontiers in massive data analysis. National Academies Press, Washington D.C
Google Scholar
Gama J (2010) Knowledge discovery from data streams. Chapman Hall/CRC, Boca Raton
Book MATH Google Scholar
Cormode G, Muthukrishnan S (2005) An improved data stream summary: the count-min sketch and its applications. J Algorithms 55:58–75
Article MathSciNet MATH Google Scholar
Aggarwal C (2007) Data streams: models and algorithms. Springer, Berlin
Google Scholar
Rastogi R, Guha S, Shim K (1998) Cure: an efficient clustering algorithm for large databases. In: Proceedings of the ACM SIGMOD, pp 73–84
Google Scholar
Ma H, Zhao W, He C (2009) Parallel k-means clustering based on MapReduce. CloudCom, pp 674–679
Google Scholar
Aflalo Y, Kimmel R (2013) Spectral multidimensional scaling. Proc Natl Acad Sci USA 110:18052–18057
Article MathSciNet MATH Google Scholar
Johnson WB, Lindenstrauss J (1984) Extensions of lipschitz map**s into a hilbert space. Contemp Math 26:189–206
Article MathSciNet MATH Google Scholar
Fern XZ, Brodley CE (2003) Random projection for high dimensional data clustering: a cluster ensemble approach. In: Proceedings of the ICML, pp 186–193
Google Scholar
Zimek A (2015) Clustering high-dimensional data. In: Data clustering: algorithms and applications. CRC Press, Boca Raton
Google Scholar
University of California at Berkeley AMP Lab. https://amplab.cs.berkeley.edu/. Accessed April 2016
Pyne S, Vullikanti A, Marathe M (2015) Big data applications in health sciences and epidemiology. In: Raghavan VV, Govindaraju V, Rao CR (eds) Handbook of statistics, vol 33. Big Data analytics. Elsevier, Oxford, pp 171–202
Google Scholar
Jordan MI, Mitchell TM (2015) Machine learning: trends, perspectives and prospects. Science 349(255–60):26
MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Indian Institute of Public Health, Hyderabad, India
Saumyadipta Pyne
C.R. Rao Advanced Institute of Mathematics, Statistics and Computer Science, Hyderabad, India
B. L. S. Prakasa Rao & S. B. Rao

Authors

Saumyadipta Pyne
View author publications
You can also search for this author in PubMed Google Scholar
B. L. S. Prakasa Rao
View author publications
You can also search for this author in PubMed Google Scholar
S. B. Rao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Saumyadipta Pyne .

Editor information

Editors and Affiliations

Indian Institute of Public Health , Hyderabad, India
Saumyadipta Pyne
CRRao AIMSCS, University of Hyderabad Campus CRRao AIMSCS, Hyderabad, India
B.L.S. Prakasa Rao
CRRao AIMSCS, University of Hyderabad Campus CRRao AIMSCS, Hyderabad, India
S.B. Rao

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Pyne, S., Prakasa Rao, B.L.S., Rao, S.B. (2016). Big Data Analytics: Views from Statistical and Computational Perspectives. In: Pyne, S., Rao, B., Rao, S. (eds) Big Data Analytics. Springer, New Delhi. https://doi.org/10.1007/978-81-322-3628-3_1

Download citation

DOI: https://doi.org/10.1007/978-81-322-3628-3_1
Published: 13 October 2016
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-3626-9
Online ISBN: 978-81-322-3628-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Big Data Analytics: Views from Statistical and Computational Perspectives

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Big data: the next challenge for statistics

Comments on: Data science, big data and statistics

Big Data, Data Scientists und amtliche Statistik

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Big Data Analytics: Views from Statistical and Computational Perspectives

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Big data: the next challenge for statistics

Comments on: Data science, big data and statistics

Big Data, Data Scientists und amtliche Statistik

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation