FLASc: a formal algebra for labeled property graph schema

作者:Chandan Sharma, Roopak Sinha

摘要

Contemporary labeled property graph databases are either schema-less or schema-optional to support frequent changes in the structure of data found in domains requiring high flexibility. However, the lack of structure impacts data transformation and loading operations from heterogeneous sources into graph databases. We present a formal algebra FLASc for specifying and generating graph schema for labeled property graph databases. We formally define FLASc and demonstrate the use of FLASc generated graph schemas to systematically transform and load data-sets related to domains of cyber-physical systems, big data analytics and tourism. Findings from three disparate case studies show that FLASc-generated schemas assist in enforcing integrity constraints that reduce the chance of data corruption, hence assuring data consistency and integrity.

论文关键词:Graph schema, Labeled property graph databases, ETL, Data transformation and loading, Neo4j, Cypher

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10515-022-00336-y