Apache Drill OSX Quickstart

Apache Drill is a really easy to use dremel-based big-data analysis tool. So it's perfect if you have a lot of static data (read-only workloads) and want to use SQL. And the best of all: it does not require Hadoop/HDFS :)

First you need to install Java (in my case it was installed before) and set the JAVA_HOME env variable. I will use the embedded mode, because I don't want to install zookeeper and it's just a single node setup.

cd ~
curl -o apache-drill-1.8.0.tar.gz http://apache.mirrors.hoobly.com/drill/drill-1.8.0/apache-drill-1.8.0.tar.gz
tar -xvf apache-drill-1.8.0.tar.gz
mv apache-drill-1.8.0 apache_drill

Now put the following line into your ~/.bash_profile file and restart your terminal:

export PATH=$PATH:~/apache_drill/bin

To start the service you have to run the following command:

drill-embedded

In embedded mode it will automatically open a drill sql-shell (called sqlline), but you can also run queries by using the online UI (http://localhost:8047).

Drill Explorer

You can also use a nice GUI called Drill Explorer (You have to install the ODBC driver to get it).

Links