DataFusion

DataFusion is an extensible framework for planning and executing SQL queries.

Query Interface

SQL

Data Model

Relational

System Architecture

Shared-Everything

Parallel Execution

Intra-Operator (Horizontal)

Storage Format

Apache Parquet Apache Arrow