Case Study of Handling Scientific Queries on Very Large Datasets: The SDSS Science Archive

  • Conference paper
  • First Online:
Mining the Sky

Part of the book series: ESO ASTROPHYSICS SYMPOSIA ((ESO))

  • 57 Accesses

Abstract.

The SDSS Science Archive (SX) was designed to enable scientific data mining and interactive data exploration on the terabyte scale. It consists of a distributed object-oriented database that is accessible via a client-server interface. The lightweight SX GUI client can be run on any platform. SDSS queries are formulated in SXQL, an SQL-like query language with some object-oriented and astronomy extensions. The SX server combines a fully multithreaded query engine with a distributed parallel architecture, splitting the data among multiple hosts and allowing for parallel, scalable I/O and parallel data analysis. Each query is parsed into a query execution tree which is executed in parallel. Data on remote partitions are accessed in parallel locally by remote slave servers. This distributed and multithreaded design allows query execution to be optimized and dynamically load-balanced for any type of multi-processor architecture, from SMP machines to Beowulf-type clusters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Editor information

Anthony J. Banday Saleem Zaroubi Matthias Bartelmann

Rights and permissions

Reprints and permissions

About this paper

Cite this paper

Thakar, A.R., Kunszt, P.Z., Szalay, A.S. Case Study of Handling Scientific Queries on Very Large Datasets: The SDSS Science Archive. In: Banday, A.J., Zaroubi, S., Bartelmann, M. (eds) Mining the Sky. ESO ASTROPHYSICS SYMPOSIA. Springer, Berlin, Heidelberg. https://doi.org/10.1007/10849171_82

Download citation

  • DOI: https://doi.org/10.1007/10849171_82

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-42468-0

  • Online ISBN: 978-3-540-44665-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Navigation