Apache GraphAr

An open source, standard data file format for graph data storage and retrieval

Efficient format design

GraphAr is designed to be efficient for storage and retrieval of large-scale graph data with techniques such as chunking, columnar storage, and maintain CSR/CSC semantics.

Out-of-core queries

Designed for out-of-core scenarios, enabling the storage and querying of large-scale graphs outside of memory, such as in data lakes.

Cross-language support

Provides libraries in C++, Java, Scala with Spark, and Python with PySpark for generating, accessing, and transforming files in GraphAr format.