Case Study of Handling Scientific Queries on Very Large Datasets: The SDSS Science Archive

Thakar, Aniruddha R.; Kunszt, Peter Z.; Szalay, Alexander S.

doi:10.1007/10849171_82

Aniruddha R. Thakar¹,
Peter Z. Kunszt¹ &
Alexander S. Szalay¹

Part of the book series: ESO ASTROPHYSICS SYMPOSIA ((ESO))

57 Accesses

Abstract.

The SDSS Science Archive (SX) was designed to enable scientific data mining and interactive data exploration on the terabyte scale. It consists of a distributed object-oriented database that is accessible via a client-server interface. The lightweight SX GUI client can be run on any platform. SDSS queries are formulated in SXQL, an SQL-like query language with some object-oriented and astronomy extensions. The SX server combines a fully multithreaded query engine with a distributed parallel architecture, splitting the data among multiple hosts and allowing for parallel, scalable I/O and parallel data analysis. Each query is parsed into a query execution tree which is executed in parallel. Data on remote partitions are accessed in parallel locally by remote slave servers. This distributed and multithreaded design allows query execution to be optimized and dynamically load-balanced for any type of multi-processor architecture, from SMP machines to Beowulf-type clusters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

ExaHDF5: Delivering Efficient Parallel I/O on Exascale Computing Systems

Article 17 January 2020

Big SQL systems: an experimental evaluation

Article 11 February 2019

A Query Processing Framework for Large-Scale Scientific Data Analysis

Author information

Authors and Affiliations

The Johns Hopkins University, MD 21218, Baltimore, USA
Aniruddha R. Thakar, Peter Z. Kunszt & Alexander S. Szalay

Authors

Aniruddha R. Thakar
View author publications
You can also search for this author in PubMed Google Scholar
Peter Z. Kunszt
View author publications
You can also search for this author in PubMed Google Scholar
Alexander S. Szalay
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Anthony J. Banday Saleem Zaroubi Matthias Bartelmann

Rights and permissions

Reprints and permissions

About this paper

Cite this paper

Thakar, A.R., Kunszt, P.Z., Szalay, A.S. Case Study of Handling Scientific Queries on Very Large Datasets: The SDSS Science Archive. In: Banday, A.J., Zaroubi, S., Bartelmann, M. (eds) Mining the Sky. ESO ASTROPHYSICS SYMPOSIA. Springer, Berlin, Heidelberg. https://doi.org/10.1007/10849171_82

Download citation

DOI: https://doi.org/10.1007/10849171_82
Published: 08 October 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42468-0
Online ISBN: 978-3-540-44665-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Case Study of Handling Scientific Queries on Very Large Datasets: The SDSS Science Archive

Abstract.

Access this chapter

Preview

Similar content being viewed by others

ExaHDF5: Delivering Efficient Parallel I/O on Exascale Computing Systems

Big SQL systems: an experimental evaluation

A Query Processing Framework for Large-Scale Scientific Data Analysis

Author information

Authors and Affiliations

Editor information

Rights and permissions

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Case Study of Handling Scientific Queries on Very Large Datasets: The SDSS Science Archive

Abstract.

Access this chapter

Preview

Similar content being viewed by others

ExaHDF5: Delivering Efficient Parallel I/O on Exascale Computing Systems

Big SQL systems: an experimental evaluation

A Query Processing Framework for Large-Scale Scientific Data Analysis

Author information

Authors and Affiliations

Editor information

Rights and permissions

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation