Quantization in estimation/storage/posterior manipulation

Angelo_D_Ambrosio · November 18, 2023, 3:56pm

Hello,

I’m throwing this, but it may not make sense at all for the core devs.

Would quantization (i.e. decreased floating point precision) make sense in Stan at estimation, storage, or posterior generation time? Would it improve speed and storage requirements?

I’m afraid that lower precision may be problematic for the HWN steps, with incremental error, but usually the sampler doesn’t do many steps, no?

For posterior manipulation (e.g. summarisation, diagnosis, visualization, etc), especially when one has many parameters, it may bring speed improvements at virtually no cost.

Where I’m wrong?

ahartikainen · November 19, 2023, 11:39am

I think this was more or less tested when GPU calculations were introduced and I think the result was that double accuracy is needed.

Not sure how many decimals CmdStan saves by default, but probably going to binary output will have impact for IO.

Angelo_D_Ambrosio · November 21, 2023, 9:12am

What you mean with with “binary output”?

Btw, I also suspect that there could be an impact during sampling, but I cannot see how it could greatly affect model storage and summarization. Maybe also things like the loo computation could be sped up?

Topic		Replies	Views
Proposal: including a "canary" variable to illustrate poor exploration of the posterior General techniques	11	807	June 15, 2020
Implementing and evaluating a new inference algorithm Developers	5	910	February 16, 2022
Generated quantites as a mechanism to define model-specific proposals Developers	3	519	June 5, 2020
Bayesian Benchmarking 1.0 General	6	675	July 20, 2021
Stan backend for NumPyro + performance comparison Publicity	10	3247	January 17, 2021

Quantization in estimation/storage/posterior manipulation

Related topics