Hi,
I’m modelling some reaction time data, and stumbled upon the shifted log-normal family argument. It’s described as well-suited for modelling reaction time, so I was intrigued.
The residual plot and qqplot of my model show equal variance and generally a normal distribution. However, my summary statistics (Est. error, credible intervals, and R-hat) are way better, when I use a shifted log-normal distribution, than a Gaussian distribution.
I’ve searched the web for general knowledge on the shifted log-normal distribution, its assumptions or when to apply this instead of Gaussian or just log-normal distributions, but without luck.
So my question is:
WHEN do you use the shifted log-normal distribution? What kind of data is it suitable for? And why/when do you choose it over the Gaussian or log-normal distributions?
Hi @IdaC,
The big thing for the shifted log-normal is that you assume that no valid reaction time can be smaller than the shift. This could be modelling something like an experiment where subjects should press a button after a light flashed. You know from the experiment design, that no reaction time smaller than the time it takes for the light to flash can be valid.
The brmsfamily: Special Family Functions for 'brms' Models in brms: Bayesian Regression Models using 'Stan' page also proposes exgaussian for reaction times so you could just try that out as well.
One reason the shifted log-normal is a better fit than the normal could be the symmetric nature of the normal distribution. It is trying to match the (probably existing) right tail of your data and through symmetry is then putting too much emphasis on the left tail. A nice way to visualize this is through the pp_check function. Just beware that the smoothing in the plot can make hard bounds look like smooth tails.
On why you might want to add a shift:
Are unshifted distributional models appropriate for response time? https://link.springer.com/article/10.1007%2Fs11336-005-1297-7 [paywalled, let me know if you have trouble finding access elsewere]
However, the shifted-lognormal model can be very sensitive to your lowest data points, which may lead to problems using loo (see e.g. this post: Error using Loo with moment matching)
But of course it always depends on your specific data and questions.