STA 210 - Fall 2022 - Introduction to multilevel models

id	diary	perform_type	na	gender	instrument
1	1	Solo	11	Female	voice
1	2	Large Ensemble	19	Female	voice
1	3	Large Ensemble	14	Female	voice
43	1	Solo	19	Female	voice
43	2	Solo	13	Female	voice
43	3	Small Ensemble	19	Female	voice

Questions we want to answer

The goal is to understand variability in performance anxiety (na) based on performance-level and musician-level characteristics.

Specifically:

What is the association between performance type (large ensemble or not) and performance anxiety? Does the association differ based on instrument type (orchestral or not)?

Linear regression model

What is the problem with using the following model to draw conclusions?

term	estimate	std.error	statistic	p.value
(Intercept)	15.721	0.359	43.778	0.000
orchestra	1.789	0.552	3.243	0.001
large_ensemble	-0.277	0.791	-0.350	0.727
orchestra:large_ensemble	-1.709	1.062	-1.609	0.108

Other modeling approaches

1️⃣ Condense each musician’s set of responses into a single outcome (e.g., mean max, last observation, etc.) and fit a linear model on these condensed observations

Leaves few observations (37) to fit the model
Ignoring a lot of information in the multiple observations for each musician

2️⃣ Fit a separate model for each musician understand the association between performance type (Level One models). Then fit a system of Level Two models to predict the fitted coefficients in the Level One model for each subject based on instrument type (Level Two model).

Let’s look at approach #2

Level One model

We’ll start with the Level One model to understand the association between performance type and performance anxiety for the musician.

Why is it more meaningful to use performance type for the Level One model than instrument?

For now, estimate and using least-squares regression.

Level One model for one student

Below is partial data for observation #22

id	diary	perform_type	instrument	na
22	1	Solo	orchestral instrument	24
22	2	Large Ensemble	orchestral instrument	21
22	3	Large Ensemble	orchestral instrument	14
22	13	Large Ensemble	orchestral instrument	12
22	14	Large Ensemble	orchestral instrument	19
22	15	Solo	orchestral instrument	25

Level One model for musician 22

id_22 <- music |>
  filter(id == 22)

linear_reg() |>
  set_engine("lm") |>
  fit(na ~ large_ensemble, data = id_22) |>
  tidy() |> kable(digits = 3)

term	estimate	std.error	statistic	p.value
(Intercept)	24.500	1.96	12.503	0.000
large_ensemble	-7.833	2.53	-3.097	0.009

Application exercise

📋 AE 15: Introduction to Multilevel models

See Part 3: Level One Models to fit the Level One model for all 37 musicians.

Level One model summaries

Recreated from BMLR Figure 8.9

Now let’s consider if there is an association between the estimated slopes, estimated intercepts, and the type of instrument.

Level Two Model

The slope and intercept for the musician can be modeled as

Note the response variable in the Level Two models are not observed outcomes but the (fitted) slope and intercept from each musician

Application exercise

📋 AE 15: Introduction to Multilevel models

See Part 4: Level Two Models.

Estimated coefficients by instrument

Level Two model

Model for intercepts

term	estimate	std.error	statistic	p.value
(Intercept)	16.283	0.671	24.249	0.000
orchestra	1.411	0.991	1.424	0.163

Model for slopes

term	estimate	std.error	statistic	p.value
(Intercept)	-0.771	0.851	-0.906	0.373
orchestra	-1.406	1.203	-1.168	0.253

Writing out the models

Level One

for each musician.

Level Two

Composite model

(Note that we also have the error terms that we will discuss next class.)

What is the predicted average performance anxiety before solos and small ensemble performances for vocalists and keyboardists? For those who place orchestral instruments?
What is the predicted average performance anxiety before large ensemble performances for those who play orchestral instruments?

Disadvantages to this approach

⚠️ Weighs each musician the same regardless of number of diary entries

⚠️ Drops subjects who have missing values for slope (7 individuals who didn’t play a large ensemble performance)

⚠️ Does not share strength effectively across individuals.

Application exercise

📋 AE 15: Introduction to Multilevel models

See Part 5: Distribution of values.

Next time

We will use a unified approach that utilizes likelihood-based methods to address some of these drawbacks.

Acknowledgements

The content in the slides is from
- BMLR: Chapter 7 - Correlated data
- BMLR: Chapter 8 - Introduction to Multilevel Models
Sadler, Michael E., and Christopher J. Miller. 2010. “Performance Anxiety: A Longitudinal Study of the Roles of Personality and Experience in Musicians.” Social Psychological and Personality Science 1 (3): 280–87. http://dx.doi.org/10.1177/1948550610370492.

Introduction to multilevel models

Announcements

Learning goals

Correlated observations

Examples of correlated data

Multilevel data

Two types of effects

Example

Practice

Multilevel models

Data: Music performance anxiety

Data: Music performance anxiety

Look at data

Univariate exploratory data analysis

Bivariate exploratory data analysis

Application exercise

Fitting the model

Questions we want to answer

Linear regression model

Other modeling approaches

Level One model

Level One model for one student

Level One model for musician 22

Application exercise

Level One model summaries

Level Two Model

Application exercise

Estimated coefficients by instrument

Level Two model

Writing out the models

Composite model

Disadvantages to this approach

Application exercise

Next time

Acknowledgements