Companies Home Search Profile

Hands-On Introduction: Data Engineering

Focused View

Vinoo Ganesh

1:34:50

28 View
  • 01 - Hands-on data engineering.mp4
    00:44
  • 02 - Background knowledge.mp4
    01:52
  • 03 - The history of data in the enterprise.mp4
    03:08
  • 04 - Using GitHub Codespaces with this course.mp4
    03:42
  • 01 - Data engineering and data pipelines.mp4
    03:11
  • 02 - Orchestration In the modern sense.mp4
    04:05
  • 03 - Extract, transform, load (ETL).mp4
    03:33
  • 04 - Tasks, DAGs, and dependencies.mp4
    02:11
  • 01 - Introduction to Airflow.mp4
    03:44
  • 02 - Installing Airflow.mp4
    06:24
  • 03 - Running the Airflow webserver and scheduler.mp4
    04:48
  • 04 - Adjusting Airflow configuration settings.mp4
    06:26
  • 05 - Build a 1 Task DAG.mp4
    04:53
  • 06 - Build a 2 Task DAG.mp4
    04:06
  • 01 - ETL in Airflow.mp4
    02:32
  • 02 - A real-world data engineering task.mp4
    02:15
  • 03 - Extracting data manually.mp4
    02:55
  • 04 - Extracting data with Airflow.mp4
    02:33
  • 05 - Transforming data manually.mp4
    03:26
  • 06 - Transforming data with Airflow.mp4
    03:13
  • 07 - Loading data manually.mp4
    04:14
  • 08 - Loading data with Airflow.mp4
    04:06
  • 09 - Building an ETL DAG with Airflow.mp4
    05:47
  • 10 - Challenge Review ETL questions.mp4
    02:18
  • 11 - Solution Solutions to ETL questions.mp4
    06:04
  • 01 - The future of data engineering.mp4
    02:40
  • Description


    Suggested prerequisites

    • Know basic Python data types, control structures, functions, and classes.
    • Have a good enough understanding of SQL to write queries to extract, transform, and load data in Apache Airflow pipelines.
    • Have some knowledge of Bash script or Unix for basic Airflow installation and administration.
    • Be familiar with text editors.
    • Know some of the basic principles behind cloud computing.

    Projects

    • Author, import, and execute a basic one-task DAG in Airflow: one Python file with one DAG and one task.
    • Author, import, and execute a basic two-task DAG in Airflow, where one task depends on the completion of another task.
    • Build a DAG to analyze top-level domains.

    In this course, instructor Vinoo Ganesh gives you an overview of the fundamental skills you need to become a data engineer. Learn how to solve complex data problems in a scalable, concrete way. Explore the core principles of the data engineer toolkit—including ELT, OLTP/OLAP, orchestration, DAGs, and more—as well as how to set up a local Apache Airflow deployment and full-scale data engineering ETL pipeline. Along the way, Vinoo helps you boost your technical skill set using real-world, hands-on scenarios.

    This course is integrated with GitHub Codespaces, an instant cloud developer environment that offers all the functionality of your favorite IDE without the need for any local machine setup. With GitHub Codespaces, you can get hands-on practice from any machine, at any time—all while using a tool that you’ll likely encounter in the workplace. Check out the “Using GitHub Codespaces with this course” video to learn how to get started.

    More details


    User Reviews
    Rating
    0
    0
    0
    0
    0
    average 0
    Total votes0
    Focused display
    Category
    Vinoo Ganesh
    Vinoo Ganesh
    Instructor's Courses
    LinkedIn Learning is an American online learning provider. It provides video courses taught by industry experts in software, creative, and business skills. It is a subsidiary of LinkedIn. All the courses on LinkedIn fall into four categories: Business, Creative, Technology and Certifications. It was founded in 1995 by Lynda Weinman as Lynda.com before being acquired by LinkedIn in 2015. Microsoft acquired LinkedIn in December 2016.
    • language english
    • Training sessions 26
    • duration 1:34:50
    • English subtitles has
    • Release Date 2024/08/12