Statistical Thinking for Data Analysis

Focused View

Anthony Donoghue

5:08:42

131 View

1 - Introduction

1 - Section 1 Introduction.mp4

05:29

2 - Populations and Samples

2 - Data Types of Variables Datasets.mp4

07:15

3 - Examples The MMR Vaccine Autism Link COVID Testing in NYC Population.mp4

06:37

3 - Types of Studies

4 - Randomized Experiments Migraine Study Example.mp4

07:27

5 - Randomized Experiment Hydroxychloroquine Study Example.mp4

05:44

6 - Observational Studies Hydroxychloroquine Study Example.mp4

05:36

4 - Measuring Uncertainty with Probability

7 - What is Probability.mp4

09:47

8 - Conditional Probability Hospital Example.mp4

16:14

9 - Conditional Probability Blood Pressure Cholesterol Example.mp4

07:47

10 - Bayes Rule Testing for Disease Example.mp4

06:31

5 - Visualizing and Summarizing Sample Data

11 - Histograms Means and Standard Deviations Climate Change and College Football.mp4

12:29

12 - The Normal Distribution and ZScores.mp4

08:18

13 - The Normal Distribution and ZScores Blood Pressure Example.mp4

15:33

6 - Visualizing and Summarizing Sample Statistics

14 - Sample Proportion Election Polling and CoinToss Example.mp4

11:25

15 - Sample Means Height Example.mp4

09:52

16 - Sample Means Pregnancy Duration Example.mp4

09:22

7 - Confidence Intervals

17 - Population Mean Heart Disease Example.mp4

11:49

18 - Population Proportion National Youth Fitness Survey and Election Polling.mp4

09:27

19 - Comparing Means National Youth Fitness Survey Example.mp4

08:24

8 - Hypothesis Testing

20 - Population Proportion CoinToss Example.mp4

11:06

21 - Population Mean Height and Heart Disease Data Examples.mp4

11:57

22 - OneSided versus TwoSided Alternatives Height and Heart Disease Data Examples.mp4

06:59

23 - What Does the Pvalue Really Mean.mp4

07:57

24 - Comparing Means National Youth Fitness Survey Example.mp4

07:12

25 - Comparing Means College Football Study Example.mp4

03:46

26 - Comparing Proportions Analysis of 2x2 Tables National Youth Fitness Survey.mp4

09:21

27 - Comparing Proportions Analysis of 2x2 Tables The PowerPose Study Example.mp4

02:32

28 - Types of Error and Power of the Test.mp4

16:04

29 - Types of Error and Power of the Test Mask Study and Hydroxychloroquine Example.mp4

05:10

9 - Linear Regression

30 - Correlation National Youth Fitness Survey Data Example.mp4

06:57

31 - The Line of Best Fit COVID19 and National Youth Fitness Survey Data Example.mp4

07:24

32 - Understanding How the Line of Best Fit is Chosen.mp4

07:09

33 - Statistical Inference for the Line of Best Fit National Youth Fitness Survey.mp4

08:01

34 - Multiple Linear Regression National Youth Fitness Survey Data Example.mp4

07:04

10 - Integrity in Research

35 - The Extra Sensory Perception and PowerPose Studies.mp4

06:58

36 - The Vioxx Study Oxycontin and The Opioid Epidemic.mp4

07:59

Description

Truly understand how to interpret and communicate data analysis results by learning how to think statistically

What You'll Learn?

Learn correct and insightful explanations of statistical concepts from an experienced Columbia professor
Develop a solid foundation in statistical thinking necessary to become a good data analyst
Learn how to conduct descriptive and statistical analysis on real-world data using R and RStudio
Learn how to correctly interpret data visualizations and summary statistics
Gain a proper understanding of confidence intervals used for making decisions using data
Truly understand the reasoning of hypothesis testing used for making decisions using data
Understand the strengths and limitations of hypothesis testing and how it can be misused
Build and interpret linear regression models with confidence
Learn statistical concepts through media, research and real-world data examples

Who is this for?

Professionals, like journalists and medical practitioners, who need to be able to read and assess the quality of research findings presented in journal articles

Professionals working with data who need to conduct, interpret and communicate of the results of data analysis

Managers and executives who need to be able to read and understand data analysis reports

Students taking an introductory course in statistics who want to gain a proper understanding of the statistical concepts

Students who have taken more advanced statistics courses who want a more solid foundational understanding of hypothesis testing, confidence intervals and linear regression

Anyone who wants to develop their statistical thinking skills to better question statistical information presented in the media

What You Need to Know?

Basic Arithmetic. No other background knowledge required

More details

Description
Course Description
Most students go into the working world not really understanding statistics at its foundation. In this course, we will learn how to think like a statistician (as opposed to like a mathematician) about statistical calculations. There is a world of difference. In the workplace, you will "get it" while those around you won't, putting you ahead of the curve.
Learn How to Think Statistically About Data Analysis Results
Â· Build your knowledge and understanding of statistics on solid foundations
Â· Learn how to visualize, measure, minimize and reason with variation in data
Â· Learn how to reason with the variation in sample statistics in order to make good decisions
Â· Understand the reasoning of hypothesis testing and what a p-value really means
Â· Understand linear regression and gain the insight â€œAll models are wrong, but some are usefulâ€
Â· Engrain your understanding through media, research and real-world data examples
Stand Out By Understanding What Your Data Analysis Really Means
Knowing how to interpret and communicate the results of data analysis has become an absolute necessity for employees of numerous types of businesses and organizations working with data. However, many students leave introductory statistics courses (and college) without a solid understanding of what the statistical calculations mean.
This course is for anyone who needs to be able to conduct data analysis and/or interpret the results of data analysis with confidence. The focus is on understanding and reasoning with the statistical calculations and not the calculations themselves. The use of media, research and real-world data examples makes the statistical concepts more engaging, palatable and relevant.
Content Overview
The course contains 42 lectures with 6 hours of content and requires no previous experience in data analysis. You can also gain access to the first chapter of my textbookÂ the course is based upon titled "Statistical Thinking through Media Examples".
Statistical thinking is a necessary foundation for any science that involves the analysis of data. If you work with data in any capacity, you need to have a solid foundation in statistical thinking. Without it, you will always be on shaky ground, not exactly sure what the results of your data analysis actually mean.
Statistical thinking begins with the question about the world of populations. We pursue answers to those questions using sample data and statistics. A sample statistic is an estimate of an unknown truth in a population of interest known as a population parameter.
We will contrast the complexity of data collected from observational studies, where causal conclusions canâ€™t be made due to what are known as confounding factors, with the power of randomized experiments to enable us to make causal conclusions about how the world works. This understanding of the complexity of data, and the questioning of the quality of your data, is a important first step in becoming a good data analyst.
We will learn how to measure the level of uncertainty in our decision making with regard to how the world works using probability. A p-value, the end result of the reasoning of hypothesis testing used for decision making, is a conditional probability. Therefore we will focus primarily on understanding what we mean by conditional probability.
We will learn how to measure and reason with the variation in data through the interpretation of data visualizations and summary statistics. As random as the world of variation appears, order is often found in the variation of measurements, in the shape of the normal (probability) distribution or bell-shaped curve. We will become familiar with the characteristics of this curve and how to use it to calculate the probability of particular events occurring.
This order in randomness (in the shape of the normal distribution) is also to be found in the distribution of sample statistics (known as the sampling distribution), around the population parameter of interest. We will use this foundational framework, along with our sample statistic, to make decisions regarding possible values for the population parameter of interest. A solid foundational understanding of these concepts is an absolute necessity for understanding the reasoning of statistical analysis we cover over much of the remainder of the course.
We will reason with the sampling distribution to try and capture the population parameter of interest within a range of plausible values known as a confidence interval. We will reason with the meaning and interpretation of confidence intervals through media, research and real-world data examples.
We will reason with the sampling distribution to learn a very powerful, but often misunderstood process of reasoning for making decisions about the value of population parameters known as hypothesis testing. We will reason with what a p-value and statistical significance really mean, plus gain a solid understanding of very important concepts like the power of the test. We will engrain our understanding of hypothesis testing through media, research and real-world data examples.
We will learn a very powerful and versatile statistical technique known as linear regression, much used in both the field of statistics and data science, for building statistical models that approximate the complex relationships between variables in the observational world of data. Once you gain the insight that â€œAll models are wrong, but some are usefulâ€, you will have succeeded in this course and are truly a statistical thinker.
Finally, we will look at how the process of hypothesis testing can be misused (or simply misunderstood) by researchers due to a lack of a solid foundation in statistical thinking.
The course is based on the third edition of my textbook titled â€œStatistical Thinking through Media Examplesâ€ published by Cognella Inc. It works very well as an armchair study as you work through the course lectures. It was written as a clear and concise narrative with numerous media, research and real-world data examples. This approach makes the statistical concepts more engaging, palatable and relevant to the world around you.
Chapter 2 and chapter 3 of the textbook are not covered in this course. In these chapters we learn how to critically access the quality of research including polls and surveys. We go beyond the news headlines to the source of the research to question the quality of the study design and data collected. No matter how solid a foundation you have in statistical thinking, the bedrock of data analysis is data, so it is very important to have an understanding of the quality of the data you are working with.
It is not necessary to purchase the book in order to succeed in this course. However, it provides strong support as you work through the lectures and a more complete learning experience with many more media, research and real-world data examples.
Who this course is for:
Professionals, like journalists and medical practitioners, who need to be able to read and assess the quality of research findings presented in journal articles
Professionals working with data who need to conduct, interpret and communicate of the results of data analysis
Managers and executives who need to be able to read and understand data analysis reports
Students taking an introductory course in statistics who want to gain a proper understanding of the statistical concepts
Students who have taken more advanced statistics courses who want a more solid foundational understanding of hypothesis testing, confidence intervals and linear regression
Anyone who wants to develop their statistical thinking skills to better question statistical information presented in the media

User Reviews

Rating

average 0

Total votes0

Focused display

I have taught statistics and statistical computing at Columbia University for almost 14 years. I have also taught at Yale, Berkeley and NYU. I have focused on teaching the foundations of statistical thinking to a broad range of students at Columbia University. I observed early on that most introductory statistics courses were simply teaching students how to make statistical calculations. The focus needs to be on why we make statistical calculations and a proper understanding of methods of statistical inference like confidence intervals and hypothesis testing. For example, most students go out into the working world with the knowledge that if a p-value is less than 0.05, the test is statistically significant, but they could not tell you what that means. They have learned how to think mathematically about statistical calculations. They have not learned how to think statistically.My course titled “Statistical Thinking for Data Analysis” is a clear and concise version of the course I teach at Columbia titled “Statistical Thinking for Data Science”. Both courses are based on the 3rd edition of my book titled “Statistical Thinking through Media Examples”.Statistics, the discipline, is about the pursuit of truth using data and statistics. I am simply a teacher, writer, statistical thinker, fortunate enough to be standing on the shoulders of giants – the great statisticians who developed the reasoning and tools for data analysis. From my point of view, your future is bright if you learn how to think statistically. You will gain the strength of mind to see through all the noise to get to the truth. You will gain the ability to make decisions using data and statistics with confidence. You will be ahead of the curve, in a position to lead and make a real difference in your chosen field.If you want your understanding of the tools of data analysis to be built on solid foundations, then my course is for you!

Udemy

View courses Udemy

Students take courses primarily to improve job-related skills.Some courses generate credit toward technical certification. Udemy has made a special effort to attract corporate trainers seeking to create coursework for employees of their company.