Apache Druid for Data Engineers (Hands-On)
Bigdata Engineer
2:23:29
Description
Learn everything about Apache Druid a modern real-time analytics database.
What You'll Learn?
- Understanding of basic architecture of Apache Druid
- Installing and Configuring Apache Druid
- Apache Druid Design, Ingestion, Data management, Querying
- Frequently asked Questions
Who is this for?
What You Need to Know?
More details
DescriptionDruid is a high-performance, real-time analytics database that delivers sub-second queries on streaming and batch data at scale and under load.
Apache Druid is a real-time analytics database designed for fast slice-and-dice analytics ("OLAP" queries) on large data sets. Most often, Druid powers use cases where real-time ingestion, fast query performance, and high uptime are important.
Druid is commonly used as the database backend for GUIs of analytical applications, or for highly-concurrent APIs that need fast aggregations. Druid works best with event-oriented data.
One of the most valuable technology skills is the ability to Real-time analytics databases handle analytics on large amounts of data by optimizing resources to enable compute-heavy workloads, and this course is specifically designed to bring you up to speed on one of the best technologies for this task, Apache Duid! The top technology companies like Google, Facebook, Netflix, Airbnb, Amazon, NASA, and more are all using Apache Druid!
Apache Druid Essentials: Unleashing Real-time Analytics and Scalable Data Exploration
Unlock the potential of real-time analytics and scalable data exploration with our comprehensive Apache Druid Essentials course. In this dynamic program, participants will delve into the world of Apache Druid, an open-source, high-performance analytics database designed for fast query response and seamless scalability.
Key Learning Objectives:
Introduction to Course
Real-time Analytics Databases
What is Apache Druid?
Key Features of Druid
Technology
Use cases
When to use Druid
When not to use Druid
List of Company using Apache Druid
Installation of Apache Druid
Start up Druid services
Open the web console
Load data
Query data
Overview of the Druid Web Console
Architecture of Druid
Druid Servers
External Dependencies
Storage Design
Datasources and Segments
Segment Identifiers
Segments
Introduction to Segments
Segment File Structure
Data Loading in Druid
Load Data from Local Files
Load Data from URI
Load Data from Kafka (Prerequisite Introduction to Kafka)
Installing Single Node Kafka Cluster
Change the following to avoid Zookeeper Issue conflict
Load Data from Kafka
Query Data Explain Plan
Aggregate data with rollup
Frequently Asked Questions
Who this course is for:
- Database Engineer, Big Data Engineer, Data Engineer, Data Analyst, Data Scientist, Machine Learning Engineer
Druid is a high-performance, real-time analytics database that delivers sub-second queries on streaming and batch data at scale and under load.
Apache Druid is a real-time analytics database designed for fast slice-and-dice analytics ("OLAP" queries) on large data sets. Most often, Druid powers use cases where real-time ingestion, fast query performance, and high uptime are important.
Druid is commonly used as the database backend for GUIs of analytical applications, or for highly-concurrent APIs that need fast aggregations. Druid works best with event-oriented data.
One of the most valuable technology skills is the ability to Real-time analytics databases handle analytics on large amounts of data by optimizing resources to enable compute-heavy workloads, and this course is specifically designed to bring you up to speed on one of the best technologies for this task, Apache Duid! The top technology companies like Google, Facebook, Netflix, Airbnb, Amazon, NASA, and more are all using Apache Druid!
Apache Druid Essentials: Unleashing Real-time Analytics and Scalable Data Exploration
Unlock the potential of real-time analytics and scalable data exploration with our comprehensive Apache Druid Essentials course. In this dynamic program, participants will delve into the world of Apache Druid, an open-source, high-performance analytics database designed for fast query response and seamless scalability.
Key Learning Objectives:
Introduction to Course
Real-time Analytics Databases
What is Apache Druid?
Key Features of Druid
Technology
Use cases
When to use Druid
When not to use Druid
List of Company using Apache Druid
Installation of Apache Druid
Start up Druid services
Open the web console
Load data
Query data
Overview of the Druid Web Console
Architecture of Druid
Druid Servers
External Dependencies
Storage Design
Datasources and Segments
Segment Identifiers
Segments
Introduction to Segments
Segment File Structure
Data Loading in Druid
Load Data from Local Files
Load Data from URI
Load Data from Kafka (Prerequisite Introduction to Kafka)
Installing Single Node Kafka Cluster
Change the following to avoid Zookeeper Issue conflict
Load Data from Kafka
Query Data Explain Plan
Aggregate data with rollup
Frequently Asked Questions
Who this course is for:
- Database Engineer, Big Data Engineer, Data Engineer, Data Analyst, Data Scientist, Machine Learning Engineer
User Reviews
Rating
Bigdata Engineer
Instructor's Courses
Udemy
View courses Udemy- language english
- Training sessions 32
- duration 2:23:29
- Release Date 2024/03/01