Apache Druid | Apache® Druid
Skip to main content

Apache® Druid

A high performance, real-time analytics database that delivers sub-second queries on streaming and batch data at scale and under load.

Download Join Slack GitHub

Overview

Sub-second queries at any scale

Execute OLAP queries in milliseconds on high-cardinality and high-dimensional data sets with billions to trillions of rows without pre-defining or caching queries in advance.

High concurrency at the lowest cost

Build real-time analytics applications that supports 100s to 100,000s queries per second at consistent performance with a highly efficient architecture that uses less infrastructure than other databases.

Real-time and historical insights

Unlock streaming data potential through Druid's native integration with Apache Kafka and Amazon Kinesis as it supports query-on-arrival at millions of events per second, low latency ingestion, and guaranteed consistency.

Key Druid Features

Interactive Query Engine

Druid utilizes scatter/gather for high speed queries with data preloaded into memory or local storage to avoid data movement and network latency.

Tiering & QoS

Configurable tiering with quality of service enables the ideal price-performance for mixed workloads, guarantees priority, and avoids resource contention.

Optimized Data Format

Ingested data is automatically columnarized, time-indexed, dictionary-encoded, bitmap-indexed, and type-aware compressed.

Elastic Architecture

Loosely coupled components for ingestion, queries, and orchestration combined with a deep storage layer enable easy & quick scale-up & scale-out.

True Stream Ingestion

A connector-free integration with streaming platforms enables query-on-arrival, high scalability, low latency, and guaranteed consistency.

Non-stop Reliability

Automatic data services including continuous backup, automated recovery, and multi-node replication ensure high availability and durability.

Schema Auto-Discovery

Druid can automatically detect, define, and update column names and data types upon ingestion, providing the ease of schemaless and the performance of strongly typed schemas.

Flexible Joins Support

Druid supports join operations during data ingestion and at query-time execution, with the fastest query performance when tables are pre-joined during ingestion.

SQL Support

Developers and analysts can easily use the familiar SQL API for end-to-end data operations across ingestion, transformation, and querying.


Learn more

Powered By

Druid is proven in production at the world's leading companies at massive scale.

FAQ

Learn about some of the most common questions about Druid.

Quickstart

Get started with Druid in minutes. Load your own data and query it.

Get Help

Get help from a wide network of community members about using Druid.

Podcast

Hear from the Druid community on Apple, Spotify, and YouTube Music.