Projects

Diagram of line icons illustrating their use of good design principles

Interactive Graph

Interactive Graph provides a web-based interactive operating framwork for large graph data, which may come from a GSON file, or an online Neo4j graph database.


PandaDB Graph Database

PandaDB Graph Database

PandaDB is a high-performance graph database that provide Cypher support, allowing for efficient querying and manipulation of large and complex graph datasets.


Lynx Graph Query Framework

Lynx Graph Query Engine

Lynx is a general graph query framework to simplify querying graph data by converting complex statements into basic graph operations.


Specimens of an email icon, a globe icon, and a fingerprint icon

PiFlow

PiFlow is an easy to use, strong scalability, powerful big data pipeline system developed based on the distributed computing framework Spark.


Publications

Lynx: A Graph Query Framework for Multiple Heterogeneous Data Sources [VLDB 2023]

Lynx is a flexible graph query framework designed to streamline the process of querying graph data across diverse data sources. By abstracting complex query statements into basic graph operations, Lynx allows developers to retrieve data through user-defined interfaces rather than direct connections to data sources. This approach simplifies the integration of heterogeneous data sources, offering a robust and generic foundation for building graph query engines.



The structure of pandadb.

PandaDB: An AI-Native Graph Database for Unified Managing Structured and Unstructured Data [DASFAA 2023]

The paper addresses the increasing demand for handling hybrid queries in graph databases, especially in applications like social networks and smart cities. PandaDB, the proposed AI-native graph database, offers a unified approach to managing both structured and unstructured data. Key features include the online extraction and indexing of semantic information from unstructured data, as well as optimization strategies for hybrid queries.


Structure of scidg.

A Key-Value Based Approach to Scalable Graph Database [DEXA 2023]

This research introduces KVGDB, a lightweight and scalable graph database designed to handle varying scales of graph data, from small to large-scale datasets.


Built on key-value storage and implemented using RocksDB, KVGDB efficiently maps graph data to key-value structures, enabling effective management and query processing across different scales.


The database supports both embedded usage and distribution across environments, making it adaptable for a wide range of applications

A Model and Query Language for Multi-modal Hybrid Query.

A Model and Query Language for Multi-modal Hybrid Query [SSDBM 2024]

This research tackles the challenge of querying multimodal data, which includes both structured data and unstructured forms such as audio, images, and videos.

This research contains the following contributions:

  • Data Model: We design extended property graph model, which supports the representation of semantic information of multimodal data on graphs. We describe the basic operations required for hybrid query.
  • Query Language: We design CypherPlus, a user-friendly query language that provides the semantics needed to query multimodal data on graphs.
  • Prototype System: We built a prototype database system based on Neo4j to demonstrate the advantages of the proposed model and query language.

Structure of scidg.

SciDG: Benchmarking Scientific Dynamic Graph Queries [SSDBM 2023]

This research presents SciDG, a benchmark framework designed to evaluate the performance of graph database systems managing dynamic graph data, which is critical for scientific applications. SciDG assesses how storage structures influence query latency, particularly for version-related queries, and helps developers choose the most effective storage solutions.


BIT: Using Bitmap Index to Speed Up NCBI Taxonomy Computing [SSDBM 2024]

This research introduces BIT, an innovative indexing method aimed at improving the performance of computational tasks in NCBI Taxonomy, widely used in biomedical and ecological research.


BIT encodes tree-like structures into bit-vectors using the Polychotomic encoding algorithm, storing them efficiently in bitmaps. By leveraging parallel bit operations, BIT significantly accelerates tasks such as finding the lowest common ancestor and listing descendants.


Experimental results show that BIT outperforms existing tools, offering a faster solution for managing large-scale taxonomy data.


Team page of the Prolific Interactive website, with profile photos of several employees

MMDBench: A Benchmark for Hybrid Query in Multimodal Database [Bench 2023]

MMDBench introduces a comprehensive benchmark program designed to assess the performance of hybrid queries in multimodal databases. The benchmark includes a data generator, a query workload, and a unified integration framework. Specifically targeting social network scenarios, the data generator produces multimodal data, while the query workload replicates typical operations that query both structured and unstructured data.


All Chuan Hu's Publications

2024
  • A Model and Query Language for Multi-modal Hybrid Query
    Chuan Hu, Zihao Zhao, Along Mao, Zhihong Shen
  • BIT: Using Bitmap Index to Speed Up NCBI Taxonomy Computing
    Chuan Hu, Jiawei Cai, Zihao Zhao, Zhihong Shen
2023
  • Lynx: A Graph Query Framework for Multiple Heterogeneous Data Sources
    Zhihong Shen, Chuan Hu, Zihao Zhao
  • MMDBench: A Benchmark for Hybrid Query in Multimodal Database
    Along Mao, Chuan Hu, Chong Li, Huajin Wang, Junjian Rao, Kainan Wang, Zhihong Shen
  • A Key-Value Based Approach to Scalable Graph Database
    Zihao Zhao, Chuan Hu, Zhihong Shen, Along Mao, Hao Ren
  • S2CTrans: Building a Bridge from SPARQL to Cypher
    Zihao Zhao, Xiaodong Ge, Zhihong Shen, Chuan Hu, Huajin Wang
  • SciDG: Benchmarking Scientific Dynamic Graph Queries
    Chenglin Zeng, Chuan Hu, Huajin Wang, Zhihong Shen
  • PandaDB: An AI-Native Graph Database for Unified Managing Structured and Unstructured Data
    Zihao Zhao, Zhihong Shen, Along Mao, Huajin Wang, Chuan Hu
2022
  • gcCov: Linked open data for global coronavirus studies
    Wenyu Shi, Guomei Fan, Zhihong Shen, Chuan Hu, et al.

Hobbies

Hu Chuan photographs at the Summer Palace.

Photography


Cocktails


Sports