We’ve been using Apache Druid for over 5 years, to provide customers with real-time analytics tools for various use-cases, including in-flight analytics, reporting and building target audiences.
The common challenge of these use-cases is counting distinct elements in real-time at scale, and we will show why Druid is a great tool for that.
In this talk, we will also share some of the best practices and tips we’ve gathered over the years.
We will cover the following topics:
- Data modeling
- Retention and deletion
- Query optimization