Companies Home Search Profile

Preparing Data for Modeling with scikit-learn

Focused View

Janani Ravi

3:40:06

27 View
  • 00. Course Overview.mp4
    01:50
  • 00. Module Overview.mp4
    01:22
  • 01. Prerequisites and Course Outline.mp4
    01:42
  • 02. Scaling and Standardization.mp4
    04:58
  • 03. Normalization.mp4
    02:42
  • 04. Transforming Data to Gaussian Distributions.mp4
    01:38
  • 05. Calculating and Visualizing Summary Statistics.mp4
    05:13
  • 06. Using the Standard Scaler for Standardizing Numeric Features.mp4
    05:57
  • 07. Using the Robust Scaler to Scale Numeric Features.mp4
    03:52
  • 08. Normalization and Cosine Similarity.mp4
    06:12
  • 09. Transforming Bimodally Distributed Data to a Normal Distribution Using a Quantile Transformer.mp4
    04:47
  • 10. Reducing Dimensionality Using Factor Analysis.mp4
    06:06
  • 11. Module Summary.mp4
    01:19
  • 00. Module Overview.mp4
    01:14
  • 01. Outliers and Novelties.mp4
    03:27
  • 02. Detecting and Coping with Outlier Data.mp4
    04:26
  • 03. Local Outlier Factor.mp4
    03:18
  • 04. Elliptic Envelope.mp4
    03:08
  • 05. Isolation Forest.mp4
    04:00
  • 06. Outlier Detection Using Local Outlier Factor.mp4
    06:51
  • 07. Outlier Detection Using Isolation Forest.mp4
    05:18
  • 08. Outlier Detection Using Elliptic Envelope.mp4
    02:35
  • 09. Novelty Detection Using Local Outlier Factor.mp4
    05:27
  • 10. Using the Predict Score Samples and Decision Function.mp4
    02:40
  • 11. Outlier Detection Using the Head Brain Dataset.mp4
    03:54
  • 12. Module Summary.mp4
    01:18
  • 00. Module Overview.mp4
    01:03
  • 01. Representing Text Data in Numeric Form.mp4
    05:26
  • 02. Bag-of-words and Bag-of-n-grams Models.mp4
    02:45
  • 03. Vectorize Text Using the Bag-of-words Model.mp4
    05:17
  • 04. Vectorize Text Using the Bag-of-n-grams Model.mp4
    03:08
  • 05. Vectorize Text Using Tf-Idf Scores.mp4
    02:38
  • 06. Hashing for Dimensionality Reduction.mp4
    03:12
  • 07. Reducing Dimensions Using the Hashing Vectorizer.mp4
    03:07
  • 08. Performing Feature Extraction on a Python Dictionary .mp4
    02:14
  • 09. Module Summary.mp4
    01:18
  • 00. Module Overview.mp4
    01:04
  • 01. Representing Images as Matrices.mp4
    03:07
  • 02. Feature Extraction from Images.mp4
    05:43
  • 03. Extracting Patches from Image Data.mp4
    04:23
  • 04. Using Dictionary Learning to Denoise and Reconstruct Images.mp4
    06:41
  • 05. Clustering Image Data Using a Pixel Connectivity Graph.mp4
    06:43
  • 06. Clustering Images Using a Gradient Connectivity Graph.mp4
    05:47
  • 07. Module Summary.mp4
    01:17
  • 00. Module Overview.mp4
    01:23
  • 01. Internal, Artificial, and External Datasets in Scikit Learn.mp4
    02:45
  • 02. Exploring Internal Datasets.mp4
    06:42
  • 03. Creating Artificial Datasets for Regression, Classification, Clustering, and Dimensionality Reduction.mp4
    07:43
  • 04. Generating Manifold Data.mp4
    07:29
  • 05. Module Summary.mp4
    01:08
  • 00. Module Overview.mp4
    01:06
  • 01. Support Vector Classifiers and the Kernel Trick.mp4
    04:11
  • 02. Kernel Approximations.mp4
    06:46
  • 03. Preparing Image Data.mp4
    04:57
  • 04. Comparing Classifiers Trained Using Implicit and Explict Features.mp4
    07:22
  • 05. Comparing Accuracy and Runtime for Different Sample Sizes.mp4
    06:32
  • 06. Summary and Further Study.mp4
    01:55
  • Description


    This course covers important steps in the pre-processing of data, including standardization, normalization, novelty and outlier detection, pre-processing image and text data, as well as explicit kernel approximations such as the RBF and Nystroem methods.

    What You'll Learn?


      Even as the number of machine learning frameworks and libraries increases on a daily basis, scikit-learn is retaining its popularity with ease. Scikit-learn makes the common use-cases in machine learning - clustering, classification, dimensionality reduction and regression - incredibly easy. In this course, Preparing Data for Modeling with scikit-learn, you will gain the ability to appropriately pre-process data, identify outliers and apply kernel approximations. First, you will learn how pre-processing techniques such as standardization and scaling help improve the efficacy of ML algorithms. Next, you will discover how novelty and outlier detection is implemented in scikit-learn. Then, you will understand the typical set of steps needed to work with both text and image data in scikit-learn. Finally, you will round out your knowledge by applying implicit and explicit kernel transformations to transform data into higher dimensions. When you’re finished with this course, you will have the skills and knowledge to identify the correct data pre-processing technique for your use-case and detect outliers using theoretically robust techniques.

    More details


    User Reviews
    Rating
    0
    0
    0
    0
    0
    average 0
    Total votes0
    Focused display
    Janani has a Masters degree from Stanford and worked for 7+ years at Google. She was one of the original engineers on Google Docs and holds 4 patents for its real-time collaborative editing framework. After spending years working in tech in the Bay Area, New York, and Singapore at companies such as Microsoft, Google, and Flipkart, Janani finally decided to combine her love for technology with her passion for teaching. She is now the co-founder of Loonycorn, a content studio focused on providing high-quality content for technical skill development. Loonycorn is working on developing an engine (patent filed) to automate animations for presentations and educational content.
    Pluralsight, LLC is an American privately held online education company that offers a variety of video training courses for software developers, IT administrators, and creative professionals through its website. Founded in 2004 by Aaron Skonnard, Keith Brown, Fritz Onion, and Bill Williams, the company has its headquarters in Farmington, Utah. As of July 2018, it uses more than 1,400 subject-matter experts as authors, and offers more than 7,000 courses in its catalog. Since first moving its courses online in 2007, the company has expanded, developing a full enterprise platform, and adding skills assessment modules.
    • language english
    • Training sessions 57
    • duration 3:40:06
    • level advanced
    • Release Date 2023/10/11

    Courses related to Data Science

    Courses related to Artificial Intelligence