How to fit a Multivariate Dirichlet Distribution?

Aminsn · February 12, 2022, 12:22pm

I need to fit a probability distribution to a Markovian transition matrix, rows of which are probability vectors. However, I don’t want to assume independence among the probability vectors of the matrix and so I need a joint pdf on these vectors to account for their possible dependence. The only distribution that comes to my mind is the Multivariate Dirichlet Distribution (MDD) which I couldn’t find many resources on it. Does anyone know if MDD is the proper joint pdf to use with the markovian transition matrix and if yes how it can be fit in Stan? Is there any examples of fitting such a pdf?

jsocolar · February 12, 2022, 1:26pm

I don’t know if there’s a more direct way to do this, but one approach would be to fit the matrix elements as a multivariate gamma and then normalize the rows. Note that this requires additional constraints for identifiability.

Aminsn · February 12, 2022, 3:03pm

The goal is to fit a pdf to those stochastic matrices so that given a new stochastic matrix you should be able to calculate its density. Your suggested approach sounds like a hack and I am not sure if it properly constructs the pdf. Please correct me if I am wrong.

jsocolar · February 12, 2022, 3:39pm

My original suggestion was overly roundabout. What I am suggesting is that you insert some hierarchical structure across rows that sits atop the parameters of the Dirichlet distribution. This will yield a multivariate Dirichlet in the sense that each row will be Dirichlet but the rows will not be independent. If this isn’t what you need, then sorry for the noise!

Aminsn · February 12, 2022, 4:54pm

Actually I liked the idea of using a hierarchical structure. Thank you for suggesting it. I was too busy with the idea of using a multivariate Dirichlet that I didn’t really think about modelling the dependency through a hierarchical structure.

p.s: Surprisingly I couldn’t find much on multivariate Dirichlet distribution except for one or two old papers so I guess implementing that would be too fiddly.

LucC · February 13, 2022, 10:20am

The proposed use of gamma distributions by @jsocolar is far from a hack. If X, Y, and Z are independently gamma-distributed with same rate but different shapes \alpha_{1:3}, then the normalised version of X, Y, and Z is actually Dirichlet-distributed with scale vector \alpha_{1:3}.

I’m not sure there is an identifiability issue here. If you specify a hierarchical model, the hyperparameters should shrink all the \alpha such that they sufficiently capture the variability across the different types of outcomes, correct?

Topic		Replies	Views
Dirichlet-Multinomial for Transition Matrix Estimation Modeling rstan , specification , math	23	3202	February 27, 2022
How to model matrix with known 0s Modeling	3	365	July 2, 2020
Tips for efficiency using a multivariate normal on a hidden markov model Modeling techniques , specification , multivariate-normal	4	548	August 30, 2022
Hierarchical Dirichlet Process: divergent transitions with hyperprior Modeling	1	1231	October 7, 2019
Defining dirichlet distributions for segments of a matrix Modeling rstan , techniques , specification	3	462	May 27, 2022

How to fit a Multivariate Dirichlet Distribution?

Related topics