Understand the run time in Gaussian finite mixture models

jaslenelin · August 2, 2018, 12:00am

Dear stan users: I am conducting a simulation study to compare the same models fitted with 2 different likelihood functions with varying data matrix.
The first model is a single Gaussian model where I fit 3 data matrices of size (12 \times 336, 101 \times 336 and 201 \times 336) where the rows (12, 101, 201 are number of observations per block and altogether there are 336 blocks). It is assumed that y_{it} \sim N(\mu_t, \sigma) where i is the observation index and t is the block index.
Then I run this model fitted with different data size (12, 101 and 201) 50 times and each time samples 1000 posterior samples (running 4 chains with 500 iterations per chain) from this model and record the time.
Here is my result
N Mean Time SE
12 261.1882 (3.8304)
101 774.0944 (33.4542)
201 778.2316 (29.5691)

From what I understand to estimate parameters associated with a Gaussian distribution, Stan would just need the sufficient statistics in this case would be the sample mean (column mean of the data matrix), rather than calculate the loglikelihood at each observation. so with increasing number of observations, I would suspect the mean run time stays roughly the same. However, increasing N from 12 to 101, there is also a significant increase in run time but with further increases to 201, the increases in time is relatively small. If my understanding is correct, can someone please suggest me the reason why a huge increase when N=12 to N=101? as the need is just to calculate a column mean?

Thank you so much for your suggestion/ advice.

bbbales2 · August 2, 2018, 9:44am

Stan doesn’t do sufficient statistics automatically like that unfortunately. There’s a pull request coming down the line that’ll let folks with certain models take advantage of them (https://github.com/stan-dev/stan/pull/2441), but for now if you want to take advantage of sufficient statistics you’ll need to do it yourself.

The performance of this model will also heavily be influenced by how many parameters you are fitting. So it’s not just the cost of the extra calculations or whatever, the exploration itself will be different. It’s hard to really nail down performance on things because there’s a lot of stuff interacting.

Bob_Carpenter · August 12, 2018, 11:24am

The chapter on efficiency in the manual (user’s guide from 2.18 on) that explains how to do this in some cases.

As @bbbales2 points out, we will be releasing some compound functions that do this internally. It’s a particularly big win for some GLMs.

Bob_Carpenter · August 12, 2018, 11:26am

Having said that, there’s no good way to compute the sufficient statistics for a mixture. For instance, you can run the forward algorithm for HMMs, but just by doing so, you compute all the necessary derivatives so there’s point in running backward to collect sufficient statistics. To do that efficiently, we need to get into the guts of the C++ implementation and pull the double values out of the autodiff types and build analytic partials.

jaslenelin · August 14, 2018, 12:35am

thank you for your reply I will take a look at the relevant sections in manual to understand it better.

jaslenelin · August 14, 2018, 1:49am

sorry for replying again but I cannot seem to find manual 2.18 on the website it is still 2.17. so it has not yet been released? could you please inform me when it will be released? thank you

bbbales2 · August 14, 2018, 12:00pm

I think he meant the 2.17 manual has this information (https://github.com/stan-dev/stan/releases/download/v2.17.0/stan-reference-2.17.0.pdf), but when the 2.18 docs come out, it’ll be in a different place.

So depending on when you get around to looking at this you might end up looking in different places.

Topic		Replies	Views
Assessing speed of multilevel model Modeling	1	407	March 25, 2020
Mixture Models for sets of observations Modeling	3	650	May 8, 2020
Simple Gaussian model: three variants with dramatically different sampling times Modeling specification	7	618	May 31, 2021
Seeking expert stan modeler for help speeding up a complex stan model Jobs fitting-issues , specification , performance	4	926	July 29, 2020
Speeding up multivariate normal model Modeling techniques	7	3160	September 9, 2017

Understand the run time in Gaussian finite mixture models

Related topics