Drill

Drill is a database system designed for Big Data exploration. It is an open-source, distributed SQL query system based on Google's Dremel query system, and its design features a columnar execution engine. Drill is the only distributed SQL engine in the world that does not require schemas. It was designed from the ground up, and it supports many NoSQL databases and file systems, with the ability for a single query to join data from multiple datastores.

History

In 2010, Google published a paper titled "Dremel: Interactive Analysis of Web-Scale Datasets" that described a scalable database system designed for "interactive analysis of nested data". The Dremel system is available today under Google's BigQuery system. Development of Apache Drill began in 2012, with the goal of replicating the capabilities of Dremel. Initial goals of the system included support for multiple storage systems, file formats, query languages, and data sources, as well as the ability to scale over 10,000 servers and process petabytes of data in seconds.

Query Interface

SQL

System Architecture

Shared-Disk

Drill Logo
Website

https://drill.apache.org

Source Code

https://github.com/apache/drill

Tech Docs

https://drill.apache.org/docs/

Developer

Apache Foundation

Country of Origin

US

Start Year

2012

Project Type

Open Source

Written in

Java

Supported languages

SQL

Inspired By

BigQuery

Compatible With

HBase, MongoDB

Operating Systems

Linux, OS X, Windows

Licenses

Apache v2

Wikipedia

https://en.wikipedia.org/wiki/Apache_Drill