Random effect models with lmer function in R
Published on
19 August 2021
Nik Samoylov image
Nik Samoylov
Founder

Random effects are everywhere in survey data. Let's try to do appropriate modelling for them in R!


This note was prepared for Conjointly researchers, but it would be a shame to hide it inside internal documentation. It is very technical. You do not need to know any of this to use the tools. 😊

Understanding random effects models starts with Jared Knowles’ article Getting Started with Mixed Effect Models in R. Now, let’s get our hands dirty and generate some random data:

library(lme4)
library(data.table)

N=300

# Assign real coefficients
real_coefficients=list(
    blue=10,
    yellow=-10,
    green=5,
    speed=5,
    intercept=50
)

# Colour is a property of the object
data=data.table(
    colour=sample(c("blue","yellow","green"),N,replace=T),
    speed=runif(N,1,50)
)

# And assigning innate preference for each colour
for(i in names(real_coefficients)){
  data[colour==i,  innate_preference:= real_coefficients[[i]]]
}

# Actual preference is sum of innate preference for colour, innate preference for speed and error
data[,preference:=
  real_coefficients[["intercept"]]+
  innate_preference+
  (real_coefficients[["speed"]]*speed)+
  rnorm(N)
]

# Review what we have
data

Let’s estimate a simple model

summary(lm(preference~colour+speed,data=data))

Compare its coefficients to real_coefficients.

Let’s add random effect: measurement instrument (an intercept)

data$instrument=sample(c(-10,0,10),N,replace=T)

data[, new_preference:=preference+instrument]

# Explore results of model
model=lmer(new_preference~colour+speed + (1 | instrument), data = data)
summary(model)
coef(model)$instrument

Let’s add random effect: speed of measurement instrument (a slope)

data$instrument_speed=sample(c(-10,0,10),N,replace=T)

data[, new_preference:=preference+speed*instrument_speed+rnorm(N)]

# Explore results of model
model=lmer(new_preference~colour+speed + (speed | instrument_speed), data = data)
summary(model)
coef(model)$instrument_speed

That’s the end! Now we understand mixed effect models!


Read these articles next:

Synthetic respondents

Synthetic respondents are the homoeopathy of market research

This article covers how synthetic responses are generated, their unreliability and invalidity, the allure of synthetic respondents, and the effects of this trend.

View article
Guidelines for conducting surveys with children

Best practice guidelines for conducting surveys with children

Learn how to design effective surveys for children by understanding cognitive development stages, optimising question formats, and minimising response bias to ensure valid and reliable data collection.

View article
How to Select Statistics for a Single Variable

How to Select Statistics for a Single Variable

What are the different statistics available for a single variable? Selecting statistics for a single variable depends of the type of variable: nominal, ordinal, or interval.

View article