Hydra – A Federated Data Repository over NDN

by   Justin Presley, et al.

Today's big data science communities manage their data publication and replication at the application layer. These communities utilize myriad mechanisms to publish, discover, and retrieve datasets - the result is an ecosystem of either centralized, or otherwise a collection of ad-hoc data repositories. Publishing datasets to centralized repositories can be process-intensive, and those repositories do not accept all datasets. The ad-hoc repositories are difficult to find and utilize due to differences in data names, metadata standards, and access methods. To address the problem of scientific data publication and storage, we have designed Hydra, a secure, distributed, and decentralized data repository made of a loose federation of storage servers (nodes) provided by user communities. Hydra runs over Named Data Networking (NDN) and utilizes the State Vector Sync (SVS) protocol that lets individual nodes maintain a "global view" of the system. Hydra provides a scalable and resilient data retrieval service, with data distribution scalability achieved via NDN's built-in data anycast and in-network caching and resiliency against individual server failures through automated failure detection and maintaining a specific degree of replication. Hydra utilizes "Favor", a locally calculated numerical value to decide which nodes will replicate a file. Finally, Hydra utilizes data-centric security for data publication and node authentication. Hydra uses a Network Operation Center (NOC) to bootstrap trust in Hydra nodes and data publishers. The NOC distributes user and node certificates and performs the proof-of-possession challenges. This technical report serves as the reference for Hydra. It outlines the design decisions, the rationale behind them, the functional modules, and the protocol specifications.


page 1

page 2

page 3

page 4


Adaptive Group-based Zero Knowledge Proof-Authentication Protocol (AGZKP-AP) in Vehicular Ad Hoc Networks

Vehicular Ad Hoc Networks (VANETs) are a particular subclass of mobile a...

Cheat-Proof Communication through Cluster Head (C3H) in Mobile Ad Hoc Network

The mobile ad hoc network (MANET) is a wireless network based on a group...

Socially-Aware Congestion Control in Ad-Hoc Networks: Current Status and The Way Forward

Ad-hoc social networks (ASNETs) represent a special type of traditional ...

Automatic Detection of Node-Replication Attack in Vehicular Ad-hoc Networks

Recent advances in smart cities applications enforce security threads su...

BATMAN : plate-forme blockchain pour l'authentification et la confiance dans les WSNs

Wireless Sensor networks (WSN) today suffer from a lack of security adap...

On Data-centric Forwarding in Mobile Ad-hoc Networks: Baseline Design and Simulation Analysis

IP networking deals with end-to-end communication where the network laye...

Dataset Factory: A Toolchain For Generative Computer Vision Datasets

Generative AI workflows heavily rely on data-centric tasks - such as fil...

Please sign up or login with your details

Forgot password? Click here to reset