Skip to main content

DIANA: Data Interface All-iN-A-place for Big Data

Wang, Frank Z. (2014) DIANA: Data Interface All-iN-A-place for Big Data. In: IEEE Big Data Science and Engineering, 24-26 September 2014, Tsinghua.


“Variety” in Big Data means we have a wide range of data types and sources: e.g. file systems and database systems co-exist for decades as two popular data-accessing interfaces. This work is to unify these two interfaces by presenting a Data Interface All-iN-A-place (DIANA). The first challenge lies in distinguishing structured and un-structured data and diverting them to different underlying platforms. It is demonstrated that a speedup of 5000 in indexing has been achieved at the expense of a slowdown of 100 in extracting attributes. A DIANA-based cloud storage system is constructed for versatile, long distance and large volume big data accessing operations to address “Volume” and “Velocity” in Big Data. It encapsulates a dynamic multi-stream/multi-path engine at the socket level, which conforms to Portable Operating System Interface (POSIX).

Item Type: Conference or workshop item (Paper)
Uncontrolled keywords: big data; variety; volume; velocity; file systems; database systems; service-oriented architecture
Subjects: Q Science
Q Science > QA Mathematics (inc Computing science)
Divisions: Faculties > Sciences > School of Computing > Data Science
Depositing User: Frank Wang
Date Deposited: 12 Aug 2014 13:32 UTC
Last Modified: 06 Feb 2020 04:09 UTC
Resource URI: (The current URI for this page, for reference purposes)
Wang, Frank Z.:
  • Depositors only (login required):


Downloads per month over past year