Companies Home Search Profile

Reducing Dimensions in Data with scikit-learn

Focused View

Janani Ravi

2:28:54

80 View
  • 0101.Course Overview.mp4
    02:13
  • 0201.Module Overview.mp4
    01:02
  • 0202.Prerequisites and Course Outline.mp4
    01:33
  • 0203.The Curse of Dimensionality.mp4
    05:59
  • 0204.Overfitted Models and Data Sparsity.mp4
    04:03
  • 0205.Exploring Techniques for Reducing Dimensions.mp4
    03:33
  • 0206.Demo Exploring the Classification Dataset.mp4
    06:23
  • 0207.Demo Performing Classification with All Features.mp4
    03:00
  • 0208.Demo Exploring the Regression Dataset.mp4
    05:17
  • 0209.Demo Performing Kitchen Sink Regression Using ML and Non-ML Techniques.mp4
    03:27
  • 0210.Feature Selection and Dictionary Learning.mp4
    05:02
  • 0211.Demo Using Univariate Linear Regression Tests to Select Features.mp4
    05:56
  • 0212.Demo Defining Helper Functions to Build and Train Multiple Models with Different Training Features.mp4
    03:56
  • 0213.Demo Finding the Best Value of K.mp4
    04:25
  • 0214.Demo Using Mutual Information to Select Features.mp4
    03:21
  • 0215.Demo Dictionary Learning to Find Sparse Representations of Data.mp4
    07:37
  • 0216.Summary.mp4
    01:10
  • 0301.Module Overview.mp4
    01:05
  • 0302.The Intuition Behind Principal Components Analysis.mp4
    07:18
  • 0303.Demo Implementing Principal Component Analysis.mp4
    06:42
  • 0304.Demo Building Regression Models with Principal Components.mp4
    03:05
  • 0305.Factor Analysis Using Singular Value Decomposition.mp4
    02:26
  • 0306.Demo Implementing Factor Analysis.mp4
    07:19
  • 0307.Linear Discriminant Analysis for Dimensionality Reduction.mp4
    02:59
  • 0308.Demo Observing Class Seperation Boundaries on the Iris Dataset.mp4
    05:34
  • 0309.Demo Linear Discriminant Analysis for Classification.mp4
    04:09
  • 0310.Summary.mp4
    01:34
  • 0401.Module Overview.mp4
    00:50
  • 0402.The Manifold Hypothesis and Manifold Learning.mp4
    06:09
  • 0403.Demo Generate S-curve Manifold and Setup Helper Functions.mp4
    05:49
  • 0404.Demo Metric and Non-metric Multi Dimensional Scaling.mp4
    02:34
  • 0405.Demo Manifold Learning Using Spectral Embedding TSNE and Isomap.mp4
    04:01
  • 0406.Demo Manifold Learning with Locally Linear Embedding.mp4
    02:20
  • 0407.Demo Preparing Images to Apply Manifold Learning for Dimensionality Reduction.mp4
    04:21
  • 0408.Demo Manifold Learning with Handwritten Digits.mp4
    04:08
  • 0409.Demo Preparing the Olivetti Faces Dataset for Manifold Learning.mp4
    03:47
  • 0410.Demo Manifold Learning on Olivetti Faces Dataset.mp4
    03:04
  • 0411.Summary and Further Study.mp4
    01:43
  • Description


    This course covers a wide range of the important techniques of dimensionality reduction and feature selection available in scikit-learn, allowing model builders to optimize model performance by reducing overfitting, save on model training time and cost, and better visualize the results of machine learning models.

    What You'll Learn?


      Dimensionality Reduction is a powerful and versatile machine learning technique that can be used to improve the performance of virtually every ML model. Using dimensionality reduction, you can significantly speed up model training and validation, saving both time and money, as well as greatly reduce the risk of overfitting.

      In this course, Reducing Dimensions in Data with scikit-learn, you will gain the ability to design and implement an exhaustive array of feature selection and dimensionality reduction techniques in scikit-learn.

      First, you will learn the importance of dimensionality reduction, and understand the pitfalls of working with data of excessively high-dimensionality, often referred to as the curse of dimensionality.

      Next, you will discover how to implement feature selection techniques to decide which subset of the existing features we might choose to use, while losing as little information from the original, full dataset as possible.

      You will then learn important techniques for reducing dimensionality in linear data. Such techniques, notably Principal Components Analysis and Linear Discriminant Analysis, seek to re-orient the original data using new, optimized axes. The choice of these axes is driven by numeric procedures such as Eigenvalue and Singular Value Decomposition.

      You will then move to dealing with manifold data, which is non-linear and often takes the form of swiss rolls and S-curves. Such data presents an illusion of complexity, but is actually easily simplified by unrolling the manifold. Finally, you will explore how to implement a wide variety of manifold learning techniques including multi-dimensional scaling (MDS), isomap, and t-distributed Stochastic Neighbor Embedding (t-SNE). You will round out the course by comparing the results of these manifold unrolling techniques with different datasets, including images of faces and handwritten data.

      When you’re finished with this course, you will have the skills and knowledge of Dimensionality Reduction needed to design and implement ways to mitigate the curse of dimensionality in scikit-learn.

    More details


    User Reviews
    Rating
    0
    0
    0
    0
    0
    average 0
    Total votes0
    Focused display
    Janani has a Masters degree from Stanford and worked for 7+ years at Google. She was one of the original engineers on Google Docs and holds 4 patents for its real-time collaborative editing framework. After spending years working in tech in the Bay Area, New York, and Singapore at companies such as Microsoft, Google, and Flipkart, Janani finally decided to combine her love for technology with her passion for teaching. She is now the co-founder of Loonycorn, a content studio focused on providing high-quality content for technical skill development. Loonycorn is working on developing an engine (patent filed) to automate animations for presentations and educational content.
    Pluralsight, LLC is an American privately held online education company that offers a variety of video training courses for software developers, IT administrators, and creative professionals through its website. Founded in 2004 by Aaron Skonnard, Keith Brown, Fritz Onion, and Bill Williams, the company has its headquarters in Farmington, Utah. As of July 2018, it uses more than 1,400 subject-matter experts as authors, and offers more than 7,000 courses in its catalog. Since first moving its courses online in 2007, the company has expanded, developing a full enterprise platform, and adding skills assessment modules.
    • language english
    • Training sessions 38
    • duration 2:28:54
    • level advanced
    • Release Date 2023/10/12