Companies Home Search Profile

Understanding PySpark and SparkSQL

Focused View

Adnan Waheed

2:47:01

17 View
  • 1. The world of PySpark.mp4
    00:40
  • 2. PySpark 101.mp4
    04:26
  • 3. Spark Components.mp4
    01:55
  • 4. Setting up PySpark on Google colabs.mp4
    14:57
  • Files.zip
  • 1. What is a dataframe.mp4
    07:02
  • 2. What is RDD.mp4
    06:01
  • 3. How to create RDDs.mp4
    11:52
  • 4. Creating python and lambda functions.mp4
    05:38
  • 5. RDD transformation - Map and filter methods.mp4
    12:25
  • 6. flatMap and Set transformations.mp4
    07:57
  • 7. How about doing multiple transformations.mp4
    09:35
  • Files.zip
  • 1. Understanding pySpark Dataframes.mp4
    09:35
  • 2. Create a dataframe from a schema.mp4
    05:48
  • 3. Create a dataframe from a CSV file.mp4
    02:53
  • 4. PySpark to Pandas dataframe.mp4
    02:32
  • Files.zip
  • 1. Creating Dataframes.mp4
    13:22
  • 2. Applying groupBy and aggregation data.mp4
    14:29
  • 3. multiple aggregation with filtering.mp4
    11:59
  • 4. filtering data with filters.mp4
    10:50
  • 5. Apply pure SQL queries.mp4
    13:05
  • Description


    Learn Spark dataframes, RDD, Transformation, SparkSQL and more...

    What You'll Learn?


    • Perform complex data manipulations with PySpark
    • Execute SQL queries within PySpark for data analysis
    • Understand the importance and components of PySpark.
    • Create and transform RDDs and DataFrames

    Who is this for?


  • Anyone who want to explore the world of Spark Computing
  • Data engineers, database administrators and data professionals curious about the emerging field of Spark based computing
  • Software developers interested in integrating PySpark and SparkSQL into their applications.
  • What You Need to Know?


  • Basic Python programming knowledge
  • Desire to learn and excel more
  • More details


    Description

    Do you know the transformative power of Spark computing?

    If you're ready to stand out in the competitive world of data science and big data analytics, this course is your gateway to mastering this essential skill.

    This single course will teach you the fundamentals and more about PySpark and SparkSQL.

    In Section 1, you'll embark on an exciting journey into the world of PySpark, starting with an engaging introduction that highlights its critical role in big data processing. You'll explore the fundamental components of Spark and learn how to set up PySpark on Google Colab, ensuring you're equipped for hands-on practice from day one.


    Section 2 delves deep into the core concepts of DataFrames and RDDs. You'll uncover what DataFrames are, their importance, and how to create and manipulate RDDs with Python and lambda functions. This section also covers advanced transformation techniques, enabling you to perform complex data manipulations with ease.


    In Section 3, we focus on PySpark DataFrames, providing you with the expertise to create DataFrames from schemas and CSV files, and seamlessly convert PySpark DataFrames to Pandas DataFrames. These skills are crucial for versatile data manipulation and analysis, setting you apart as a data professional.


    Finally, Section 4 introduces you to SparkSQL, where you'll learn to create DataFrames, apply groupBy and aggregation techniques, and filter data with precision. You'll also gain the ability to execute pure SQL queries within PySpark, enhancing your data querying capabilities.


    Join us now and elevate your data processing skills to new heights with our PySpark course.

    Equip yourself with the knowledge and expertise to excel in the fast-paced world of big data, and distinguish yourself from the crowd.

    Enroll today and become a PySpark PRO!

    Who this course is for:

    • Anyone who want to explore the world of Spark Computing
    • Data engineers, database administrators and data professionals curious about the emerging field of Spark based computing
    • Software developers interested in integrating PySpark and SparkSQL into their applications.

    User Reviews
    Rating
    0
    0
    0
    0
    0
    average 0
    Total votes0
    Focused display
    Category
    Adnan Waheed
    Adnan Waheed
    Instructor's Courses
    Hello Everyone, I am an entrepreneur, a deliverer, a dreamer, a believer, a darer and a doer. I had worked in Bloomberg for 17 years, and build, manage, and lead several projects and team on global basis. After Bloomberg, had build my own companies like KlickAnalytics, ClickAPIs, ZoomMarkets providing global financial markets analytics with tera bytes data on cloud servers.I've worked extensively with PHP, Python, Angular, Rest APIs, Cloud systems, time series database, financial data analytics, UNIX systems, MongoDB, PostgreSQL, and advanced system architecture design and more.My utmost passion is the invention, innovation, changing paradigms, game-changing disruptions, people, personal development and the true adventures of life. Master the getting the job done well.My Motto: Change the Game!
    Students take courses primarily to improve job-related skills.Some courses generate credit toward technical certification. Udemy has made a special effort to attract corporate trainers seeking to create coursework for employees of their company.
    • language english
    • Training sessions 20
    • duration 2:47:01
    • Release Date 2024/08/12