Companies Home Search Profile

Building Features from Numeric Data

Focused View

Janani Ravi

2:25:02

35 View
  • 01 - Course Overview.mp4
    01:58
  • 02 - Module Overview.mp4
    00:55
  • 03 - Prerequisites and Course Outline.mp4
    01:44
  • 04 - Scaling and Standardization.mp4
    04:09
  • 05 - Mean Variance and Standard Deviation.mp4
    03:50
  • 06 - Understanding Variance.mp4
    03:44
  • 07 - Demo - Calculating Mean Variance and Standard Deviation.mp4
    06:57
  • 08 - Demo - Box Plot Visualization and Data Standardization.mp4
    06:34
  • 09 - Standard Scaler.mp4
    04:13
  • 10 - Demo - Standardize Data Using the Scale Function.mp4
    05:29
  • 11 - Demo - Standardize Data Using the Standard Scalar Estimator and Apply Bessels Correction.mp4
    03:53
  • 12 - Robust Scaler.mp4
    03:54
  • 13 - Demo - Scaling Data Using the Robust Scaler.mp4
    07:23
  • 14 - Summary.mp4
    01:19
  • 15 - Module Overview.mp4
    00:49
  • 16 - What Is Normalization.mp4
    01:35
  • 17 - Normalization and Cosine Similarity.mp4
    08:03
  • 18 - Demo - Cosine Similarity and the L2 Norm.mp4
    06:54
  • 19 - Demo - Normalizing Data to Simplify Cosine Similarity Calculations.mp4
    04:29
  • 20 - Demo - K-means Clustering with Cosine Similarity.mp4
    03:52
  • 21 - L1 L2 and Max Norms.mp4
    02:39
  • 22 - Demo - Normalization Using L1 L2 and Max Norms.mp4
    05:14
  • 23 - Summary.mp4
    01:09
  • 24 - Module Overview.mp4
    01:11
  • 25 - Converting Continuous Data to Categorical.mp4
    02:47
  • 26 - Demo - Convert Numeric Data to Binary Categories Using a Binarizer.mp4
    05:25
  • 27 - Demo - Using the KBinsDiscretizer to Categorize Numeric Values.mp4
    05:44
  • 28 - Demo - Using Bin Values to Flag Outliers.mp4
    02:54
  • 29 - Scaling Data.mp4
    01:30
  • 30 - Demo - Scaling with the MaxAbsScaler.mp4
    02:14
  • 31 - Demo - Scaling with the MinMaxScaler.mp4
    03:05
  • 32 - Custom Transformations.mp4
    00:33
  • 33 - Demo - Performing Custom Transforms Using the FunctionTransformer.mp4
    03:03
  • 34 - Generating Polynomial Features.mp4
    02:23
  • 35 - Demo - Using Polynomial Features to Transform Data.mp4
    06:14
  • 36 - Transforming Features to Gaussian-like Distributions Using Power Transformers.mp4
    01:22
  • 37 - Demo - Working with Chi Squared Distributed Input Features.mp4
    04:49
  • 38 - Demo - Applying Power Transformers to Get Normal Distributions.mp4
    03:55
  • 39 - Transforming Data to Normal or Uniform Distributions Using Quantile Transformers.mp4
    01:01
  • 40 - Demo - Tranforming to a Normal Distribution Using the QuantileTransformer.mp4
    03:58
  • 41 - Summary and Further Study.mp4
    02:08
  • File.zip
  • Description


    This course exhaustively covers data preprocessing techniques and transforms available in scikit-learn, allowing the construction of highly optimized features that are scaled, normalized and transformed in mathematically sound ways to fully harness the power of machine learning techniques.

    What You'll Learn?


      The quality of preprocessing that numeric data is subjected to is an important determinant of the results of machine learning models built using that data. With smart, optimized data pre-processing, you can significantly speed up model training and validation, saving both time and money, as well as greatly improve model performance in prediction.

      In this course, Building Features from Numeric Data, you will gain the ability to design and implement effective, mathematically sound data pre-processing pipelines.

      First, you will learn the importance of normalization, standardization and scaling, and understand the intuition and mechanics of tweaking the central tendency as well as dispersion of a data feature.

      Next, you will discover how to identify and deal with outliers and possibly anomalous data. You will then learn important techniques for scaling and normalization. Such techniques, notably normalization using the L1-norm, L2-norm and Max norm, seek to transform feature vectors to have uniform magnitude. Such techniques find wide usage in ML model building - for instance in computing the cosine similarity of document vectors, and in transforming images before techniques such as convolutional neural networks are applied to them.

      You will then move from normalization and standardization to scaling and transforming data. Such transformations include quantization as well as the construction of custom transformers for bespoke use cases. Finally, you will explore how to implement log and power transformations. You will round out the course by comparing the results of three important transformations - the Yeo-Johnson transform, the Box-Cox transform and the quantile transformation - in converting data with non-normal characteristics, such as chi-squared or lognormal data into the familiar bell curve shape that many models work best with.

      When you’re finished with this course, you will have the skills and knowledge of data preprocessing and transformation needed to get the best out of your machine learning models.

    More details


    User Reviews
    Rating
    0
    0
    0
    0
    0
    average 0
    Total votes0
    Focused display
    Janani has a Masters degree from Stanford and worked for 7+ years at Google. She was one of the original engineers on Google Docs and holds 4 patents for its real-time collaborative editing framework. After spending years working in tech in the Bay Area, New York, and Singapore at companies such as Microsoft, Google, and Flipkart, Janani finally decided to combine her love for technology with her passion for teaching. She is now the co-founder of Loonycorn, a content studio focused on providing high-quality content for technical skill development. Loonycorn is working on developing an engine (patent filed) to automate animations for presentations and educational content.
    Pluralsight, LLC is an American privately held online education company that offers a variety of video training courses for software developers, IT administrators, and creative professionals through its website. Founded in 2004 by Aaron Skonnard, Keith Brown, Fritz Onion, and Bill Williams, the company has its headquarters in Farmington, Utah. As of July 2018, it uses more than 1,400 subject-matter experts as authors, and offers more than 7,000 courses in its catalog. Since first moving its courses online in 2007, the company has expanded, developing a full enterprise platform, and adding skills assessment modules.
    • language english
    • Training sessions 41
    • duration 2:25:02
    • level preliminary
    • Release Date 2023/10/10