AstroServ: Distributed Database for Serving Large-Scale Full Life-Cycle Astronomical Data

11/27/2018
by   Chen Yang, et al.
0

In time-domain astronomy, STLF (Short-Timescale and Large Field-of-view) sky survey is the latest way of sky observation. Compared to traditional sky survey who can only find astronomical phenomena, STLF sky survey can even reveal how short astronomical phenomena evolve. The difference does not only lead the new survey data but also the new analysis style. It requires that database behind STLF sky survey should support continuous analysis on data streaming, real-time analysis on short-term data and complex analysis on long-term historical data. In addition, both insertion and query latencies have strict requirements to ensure that scientific phenomena can be discovered. However, the existing databases cannot support our scenario. In this paper, we propose AstroServ, a distributed system for analysis and management of large-scale and full life-cycle astronomical data. AstroServ's core components include three data service layers and a query engine. Each data service layer serves for a specific time period of data and query engine can provide the uniform analysis interface on different data. In addition, we also provide many applications including interactive analysis interface and data mining tool to help scientists efficiently use data. The experimental results show that AstroServ can meet the strict performance requirements and the good recognition accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/27/2018

Cloud based Real-Time and Low Latency Scientific Event Analysis

Astronomy is well recognized as big data driven science. As the novel ob...
research
12/10/2019

Ledgerdata Refiner: A Powerful Ledger Data Query Platform for Hyperledger Fabric

Blockchain is one of the most popular distributed ledger technologies. I...
research
09/30/2021

A Survey of Selected Algorithms Used in Military Applications from the Viewpoints of Dataflow and GaAs

This is a short survey of ten algorithms that are often used for militar...
research
08/22/2017

Towards a Holistic Integration of Spreadsheets with Databases: A Scalable Storage Engine for Presentational Data Management

Spreadsheet software is the tool of choice for interactive ad-hoc data m...
research
02/21/2019

A Comprehensive Survey of Interface Protocols for Software Defined Networks

Software Defined Networks has seen tremendous growth and deployment in d...
research
07/22/2020

Detecting Quality Problems in Research Data: A Model-Driven Approach

As scientific progress highly depends on the quality of research data, t...
research
05/19/2023

Channel Cycle Time: A New Measure of Short-term Fairness

This paper puts forth a new metric, dubbed channel cycle time, to measur...

Please sign up or login with your details

Forgot password? Click here to reset