Companies Home Search Profile

Databricks and PySpark for Big Data: From Zero to Expert

Focused View

Data Bootcamp

2:54:02

162 View
  • 1. How to get the most out of this course.html
  • 2. Spark Fundamentals.mp4
    01:37
  • 3. How Apache Spark works.mp4
    01:57
  • 4. Apache Spark ecosystem and official documentation.mp4
    04:59
  • 5. PySpark cluster management and architecture.mp4
    03:38
  • 1. Spark Optimization Techniques.mp4
    02:24
  • 2. Lazy Evaluation.mp4
    01:31
  • 3. Wide and Narrow Transformations.mp4
    01:17
  • 4. Parquet file in Spark.mp4
    01:32
  • 5. Parallelism and Partitions.mp4
    03:37
  • 6. Shuffling.mp4
    03:07
  • 7. Caching and Storage Levels.mp4
    03:18
  • 1. Introduction to Databricks.mp4
    02:37
  • 2. Databricks Terminology and Databricks Community.mp4
    04:03
  • 3. Create a free Databricks account.mp4
    01:57
  • 4. Introduction to the Databricks environment.mp4
    10:15
  • 5. First steps with Databricks.mp4
    07:30
  • 1. Importing notebooks, language configuration and markdown.mp4
    03:00
  • 2. Databricks File Dystem (DBFS).mp4
    01:59
  • 3. Create, manipulate and visualize tables.mp4
    02:00
  • 4. Databricks widgets.mp4
    01:57
  • 1. Creating and saving DataFrames in Databricks.mp4
    04:40
  • 2. Transformation and visualization of data in Databricks.mp4
    05:51
  • 3. Population Data Analytics Lab.mp4
    07:50
  • 1. Spark SQL and SQL Dataframe API.mp4
    02:09
  • 2. Temporary Views vs Global Temporary Views.mp4
    01:24
  • 3. Spark Dataframes.mp4
    02:21
  • 4. Spark SQL and SQL Dataframe API Lab.mp4
    07:46
  • 1. Introduction to Spark Column Expresions.mp4
    04:19
  • 2. Column Expressions, operators and methods.mp4
    03:12
  • 3. DataFrame Transformation Methods.mp4
    02:54
  • 4. Subset Rows in Dataframe.mp4
    01:57
  • 1. Spark Aggregation Methods.mp4
    03:00
  • 2. Grouped data methods.mp4
    02:22
  • 3. Aggregate Functions and Math Functions.mp4
    01:23
  • 4. Functions and built-in functions review.mp4
    02:38
  • 5. Dataframe NaN functions and dataframe join.mp4
    01:49
  • 1. Import and exploratory analysis of data.mp4
    04:33
  • 2. Variable preprocessing with PySpark and Databricks.mp4
    05:21
  • 3. Definition of the Machine Learning model and development of the Pipeline.mp4
    04:09
  • 4. Model evaluation with PySpark and Databricks.mp4
    04:28
  • 5. Hyperparameter tuning and registration in MLFlow.mp4
    03:19
  • 6. Predictions with new data and visualization of the results.mp4
    03:29
  • 1. Spark Koalas Fundamentals.mp4
    02:39
  • 2. Feature Engineering with Koalas.mp4
    03:51
  • 3. Creating DataFrames with Koalas.mp4
    04:10
  • 4. Data Manipulation and DataFrames with Koalas.mp4
    02:08
  • 5. Working with missing data in Koalas.mp4
    02:24
  • 6. Data visualization and graph generation with Koalas.mp4
    03:07
  • 7. Import and export data with Koalas.mp4
    01:51
  • 1. Example of Streaming word count with Spark Streaming.mp4
    04:18
  • 2. Spark Streaming Configurations Output Modes and Operation Types.mp4
    03:05
  • 3. Spark Streaming Capabilities.mp4
    01:20
  • Description


    Complete course to learn Databricks, including PySpark, Dataframes, Machine Learning, Advanced Analytics and Streaming

    What You'll Learn?


    • Processing Big Data with PySpark in Databricks
    • Databricks environment and Platform
    • ETL, Dataframes and data visualization in Databricks
    • PySpark in Databricks with RDDs, Spark Dataframes API or Spark SQL
    • Spark Column Expresions and Dataframe Agregations
    • Spark Data Sources and Format types
    • Spark Architecture Concepts and Query Optimization
    • Advanced analytics and data visualization with Databricks
    • Machine Learning with Spark at Databricks
    • Spark Streaming at Databricks

    Who is this for?


  • Anyone who wants to learn Databricks
  • Anyone who wants to learn advanced big data skills
  • Anyone wants to make a career as a data engineer, data analyst or data scientist
  • Anyone interested in learning Apache Spark and PySpark for Big Data analytics
  • Anyone wants to learn cutting-edge technology in data processing
  • More details


    Description

    If you are looking for a hands-on, complete and advanced course to learn Databricks and PySpark, you have come to the right place.

    Databricks is a data analytics platform powered by Apache Spark for data engineering, data science, and machine learning. Databricks has become one of the most important platforms to work with Spark, compatible with Azure, AWS and Google Cloud. This makes Databricks and Apache Spark some of the most in-demand skills for data engineers and data scientists, and some of the most valuable skills today. This course will teach you everything you need to know to position yourself in the Big Data job market.


    This course is designed to prepare you to learn everything related to Databricks and Apache Spark, from the Databricks environment, platform and functionalities, to Spark SQL API, Spark Dataframes, Spark Streaming, Machine Learning, advanced analytics and data visualization in Databricks.

    With a complete training, downloadable study guides, hands-on exercises, and real-world use cases, this is the only course you'll ever need to learn Databricks and Apache Spark. You will learn Databricks, starting from the basics to the most advanced functionalities. To do so, we will use visual  presentations, sharing clear explanations and useful professional advice.


    This course covers the following sections:


    • Introduction to Big Data and Apache Spark

    • Spark Fundamentals with Spark RDDs, Dataframes

    • Databricks environment

    • Advanced analytics and data visualization with Databricks

    • Machine Learning with Spark at Databricks

    • Spark Streaming at Databricks


    If you're ready to improve your skills, increase your career opportunities, and become a Big Data expert, join today and get immediate and lifetime access to:

    • Complete Guide to Databricks with Apache Spark (PDF e-book)

    • Downloadable project files

    • Practical exercises and questionnaires

    • Databricks resources such as: Cheatsheets and summaries

    • 1 to 1 expert support

    • Forum of questions and answers of the course



    See you there!

    Who this course is for:

    • Anyone who wants to learn Databricks
    • Anyone who wants to learn advanced big data skills
    • Anyone wants to make a career as a data engineer, data analyst or data scientist
    • Anyone interested in learning Apache Spark and PySpark for Big Data analytics
    • Anyone wants to learn cutting-edge technology in data processing

    User Reviews
    Rating
    0
    0
    0
    0
    0
    average 0
    Total votes0
    Focused display
    Data Bootcamp
    Data Bootcamp
    Instructor's Courses
    Data Bootcamp transforma a los profesionales en expertos de datos al optimizar, simplificar y personalizar la experiencia de aprendizaje en línea.Desde hace años hemos ayudado a estudiantes y equipos en más de 150 países a desarrollar las habilidades de análisis e inteligencia empresarial más buscadas, a través de cursos, evaluaciones de habilidades, rutas de aprendizaje y capacitación empresarial.Aprender nuevas habilidades en el sector del dato es fácil. Data Bootcamp será tu equipo personal de instructores, expertos y mentores que te ayudarán en el proceso de aprendizaje y a desarrollar las habilidades profesionales más demandadas.Nuestro equipo se compone por expertos reconocidos en el campo del data anytics, MVPs, MCTs y expertos certificados de Microsoft.
    Students take courses primarily to improve job-related skills.Some courses generate credit toward technical certification. Udemy has made a special effort to attract corporate trainers seeking to create coursework for employees of their company.
    • language english
    • Training sessions 52
    • duration 2:54:02
    • Release Date 2023/02/12