Lab: Probability distributions
Load packages and data
Exercise 1 (Having 80 or less LSAS at screening)
Check the mean and SD of LSAS scores at screening. Assuming it is normally distributed, what is the probability that a person will have a score of 80 or less?
Use the mean()
and sd()
functions to find the values in the dataset df_clean
mean <- mean(df_clean$lsas_screen)
sd <- sd(df_clean$lsas_screen)
pnorm(80, mean, sd)
<- mean(df_clean$lsas_screen)
mean <- sd(df_clean$lsas_screen)
sd pnorm(80, mean, sd)
Exercise 2 (Probability of having a LSAS score of 100 or more)
What is the probability that a person will have a LSAS score at screening of 100 or more, assuming the scores are normally distributed?
Use the mean()
and sd()
functions to find the values in the dataset df_clean
.
Also, since we are looking at a value equal or greater to a certain point of the cumulative probability distribution, we can use 1-pnorm()
.
mean <- mean(df_clean$lsas_screen)
sd <- sd(df_clean$lsas_screen)
1-pnorm(100, mean, sd)
<- mean(df_clean$lsas_screen)
mean <- sd(df_clean$lsas_screen)
sd 1-pnorm(100, mean, sd)
Exercise 3 (Inclusion rate in the STEpS study)
Let’s say that the average inclusion rate in the STEpS study was 5 participants per week. What is the probability that they’ll include 3 or less people the coming week?
Since we are looking at a rate we can use the function for the cumulative probabililty of the Poisson distribution, using the ppois()
function
lambda = 5
ppois(3,5)
= 5
lambda ppois(3,5)
Exercise 4 (Probability of selecting only women in a sample of 5 participants)
In the STEpS study, 78% of the participants were women. What is the probability that 5 randomly chosen participants would all be women?
Since we are looking at a binary variable, we can get the probability density function for the binomial distribution, using the dbinom()
function
dbinom(x=5, size = 5, prob = 0.78)
# or since these are random events, we could get the joint probability by multiplying the probabilities 5 times
0.78*0.78*0.78*0.78*0.78
#which is the same as
0.78^5
dbinom(x=5, size = 5, prob = 0.78)
# or since these are random events, we could get the joint probability by multiplying the probabilities 5 times
0.78*0.78*0.78*0.78*0.78
#which is the same as
0.78^5
Summary
In this lab, we learned about the probability distributions and how to use them to calculate the probability of different events. We used helpful base R functions to calculate the probability of variables being over or under a certain value.