How to Install Druid on Ubuntu Server Latest
Introduction
Druid is an open-source, distributed data store designed for easy aggregation and exploration of large-scale data sets. It provides fast querying, real-time data ingestion, and supports advanced analytics including machine learning. In this tutorial, we will guide you through the process of installing Druid on Ubuntu Server latest edition.
Prerequisites
Before we proceed with the installation of Druid, we need to ensure that our Ubuntu Server is up-to-date with the latest software packages. Open the terminal and run the following command:
sudo apt-get update && sudo apt-get upgrade -y
Step 1: Install Java
Druid is written in Java, and therefore requires Java to be installed on our system. We will install OpenJDK 8, which is the latest stable version of Java.
Run the following command to install OpenJDK 8:
sudo apt-get install openjdk-8-jdk-headless
Step 2: Download and Extract Druid
To download the latest version of Druid, go to the official Druid website (http://druid.io/) and click on the "Download" button. Alternatively, you can use the following command to download it:
wget https://downloads.apache.org/druid/0.22.1/apache-druid-0.22.1-bin.tar.gz
Once the download is complete, extract the downloaded file by running the following command:
tar -xzf apache-druid-0.22.1-bin.tar.gz
This will create a new directory with the name "apache-druid-0.22.1".
Step 3: Configure Druid
Druid requires some configuration before it can be run. We will start by configuring the ZooKeeper cluster, which is used by Druid to coordinate communication between its different nodes.
Create a new directory for ZooKeeper:
mkdir -p ~/zookeeper/data ~/zookeeper/logs
Create the ZooKeeper configuration file:
nano ~/zookeeper/zoo.cfg
Add the following lines to the file:
tickTime=2000
dataDir=/home/<username>/zookeeper/data/
dataLogDir=/home/<username>/zookeeper/logs/
clientPort=2181
initLimit=5
syncLimit=2
server.1=localhost:2888:3888
Note: Replace <username> with your system username.
Next, we need to configure the Druid cluster by editing the "common.runtime.properties" file:
nano apache-druid-0.22.1/conf/druid/_common/common.runtime.properties
Add the following lines to the file:
druid.zk.service.host=localhost:2181
druid.zk.paths.base=/druid
Save and close the file.
Step 4: Start Druid
To start Druid, we need to run a set of scripts in the following order:
- Start ZooKeeper:
~/apache-druid-0.22.1/bin/start-micro-quickstart
- Start the Coordinator:
~/apache-druid-0.22.1/bin/start-micro-quickstart coordinator
- Start the Overlord:
~/apache-druid-0.22.1/bin/start-micro-quickstart overlord
- Start the Historical node:
~/apache-druid-0.22.1/bin/start-micro-quickstart historical
- Start the Broker:
~/apache-druid-0.22.1/bin/start-micro-quickstart broker
- Start the Realtime node:
~/apache-druid-0.22.1/bin/start-micro-quickstart realtime
Wait for a few seconds for Druid to start up. You can check the status of Druid by visiting the following URL in your web browser:
http://localhost:8081/
Conclusion
Congratulations! You have successfully installed Druid on your Ubuntu Server. Now you can start exploring the advanced data analytics and querying capabilities of this powerful software!