Parameters declared in the model block

Chuan_Gao · July 24, 2021, 12:44pm

Hi,

I am building a model Y=X\beta + \epsilon, where both X and Y are matrices. I want to model the correlation between the Y.

As you can see, I have some pretty big parameter matrices, so I want to declare them in the model block so that they can be discarded. But I got error, saying “std_normal_lpdf: Random variable[1] is nan, but must not be nan!”

However, if I move the declaration to the transformed parameters blocks, then it runs fine, although very slow.

Can you advise where it went wrong?

Also beside the error message, I feel that there are many places that the code can be improved, can you also advise on that?

Thanks a lot!

data {

  int<lower=0> nSamples; //number of samples 
  int<lower=0> nBio; //number of y
  int<lower=0> nX; // number of x

  matrix[nSamples, nBio] y; // the multivariate outcome matrix
  matrix[nSamples, nX] X; // predictor matrix
  
}

parameters {

  matrix[nX, nBio] beta; // betas from N(0,1)
 
  vector<lower=0>[nBio] sigma_eps; // sd of the residual
  
  vector<lower=0>[nBio] tau;     // prior scale
  
  cholesky_factor_corr[nBio] L;

}

transformed parameters {

}

model {
  matrix[nSamples, nBio] z;
  matrix[nSamples, nBio] mu;
  
  mu = X * beta + z * (diag_pre_multiply(tau,L))';

  sigma_eps ~ exponential(1);
 
  tau ~ cauchy(0, 2.5);

  L ~ lkj_corr_cholesky(1);
 
  for(i in 1:nX){
    to_vector(beta[i]) ~ student_t(10, 0, 10);
  }

  for(i in 1:nSamples){
    to_vector(z[i]) ~ std_normal();
  }
  
  for(i in 1:nSamples){
    for(j in 1:(nBio)){
      y[i][j] ~ normal(mu[i][j], sigma_eps[j]);
    }
  }
}

mike-lawrence · July 25, 2021, 12:06am

z seems to be a true parameter proper, so you can’t omit it from the parameters section.

Also, this:

  for(i in 1:nX){
    to_vector(beta[i]) ~ student_t(10, 0, 10);
  }

is way more efficient as:

to_vector(beta) ~ student_t(10, 0, 10);

Ditto:

  for(i in 1:nSamples){
    to_vector(z[i]) ~ std_normal();
  }

should be:

    to_vector(z) ~ std_normal();

Chuan_Gao · July 25, 2021, 12:55am

Cool! Thanks so much!

mike-lawrence · July 25, 2021, 1:01am

Oh, I also think something is fundamentally wrong with your model; it doesn’t make sense that mu and z have the same dimensions.

mike-lawrence · July 25, 2021, 1:03am

Oh, nevermind, that’s fine. I’d been misreading what mu was.

Chuan_Gao · July 25, 2021, 1:07am

Yup, since we don’t care about z in here. Anyway to throw it away after each sampling iteration? carrying them along eats up too much memory.

mike-lawrence · July 25, 2021, 1:08am

Unfortunately no, though they aren’t kept in memory but instead written to file. Are you observing gradually increasing ram during sampling?

Chuan_Gao · July 25, 2021, 1:11am

I see. I didn’t specifically test if Z increase my memory, but keeping mu definitely increased my fitted object size. And they significantly slowed down the program.

mike-lawrence · July 25, 2021, 1:13am

Btw, see the “hwg” model at the repo here for a demo of a possibly-faster version of what you’re doing.

Chuan_Gao · July 25, 2021, 1:14am

I worked on some other project with many Ys. In that case, keeping all the intermediate values gave me hundred of Gb of fitted object. Getting rid of those slim the object to a couple of Mb

Chuan_Gao · July 25, 2021, 1:14am

will do! Thanks a bunch!

jroon · July 13, 2023, 4:55pm

I hope it’s ok to revive a thread like this but I have a very similar query.

In my case my model block looks likes this:

model {
    // latent gp
    vector[n_obs] h;
    
    // Priors
    sigma ~ normal(0, 1);
    //sigma ~ inv_gamma(1, 1);
    tau ~ cauchy(0, 0.1);
    beta ~ std_normal();
    r ~ cauchy(0, 1);

    // Likelihood
    h ~ multi_normal_cholesky(rep_vector(0, n_obs), cholesky_decompose(cov_Kzr(Z, r, tau, 1e-6)));
    y ~ normal(h + X*beta, sigma);
}

If I run this I get the error: Exception: multi_normal_cholesky_lpdf: Random variable[1] is nan, but must not be nan!

However if I simply cut and paste the line vector[n_obs] h; to the parameters block, it runs without a problem, but I do not understand why. Scoping doesn’t seems to explain it, so what else is at play here? (h is not referred to anywhere else in the code)

Forgive me but what is a “true parameter proper” and whats the alternative to a true parameter proper?

jsocolar · July 13, 2023, 7:52pm

If you declare h in the model block, you haven’t assigned any values to h. Until you do, h consists of nothing but nan. If you declare h as a parameter, then a whole bunch of things happen. Conceptually, the most important but also the most abstract is that you parameterize the posterior distribution in terms of h. This affects the posterior geometry, the gradients… everything that is important to Stan.

In practical terms, the model initializes the parameters somewhere, populating h declared as a parameter with a set of candidate values from which the HMC algorithm proceeds, simulating the evolution of h along the Hamiltonian trajectory and then updating the value of h via multinomial sampling along the trajectory.

He meant that you are writing down a model that you intend to be parameterized by h and not a model wherein you can recover h deterministically from the values of the data and the “true parameters”.

Note that ~ in Stan does not mean “draw a random number from this distribution”. It means “increment the target by a log probability density function corresponding to this distribution evaluated at the current values of the parameters and data”.

jroon · July 13, 2023, 8:33pm

Ah ok I think I get it. I was comparing to the latent GP example in the Stan manual 10.3 Fitting a Gaussian process | Stan User’s Guide - but I see the difference is that in that model, even though f is declared in the model block, f depends on data and parameters declared in the parameter block, so the situation is different. Thanks.

Topic		Replies	Views
How to declare a variable in model block Modeling	24	4515	August 12, 2018
Block covariance matrix Modeling	3	1797	August 28, 2018
Waldo: normal_lpdf: Location parameter is nan, but must be finite!` General	4	469	September 8, 2023
Question about noncentered parameterization sampling structure Modeling	5	432	December 8, 2023
Random variable is nan, but must not be nan! Modeling	1	5125	August 9, 2017

Parameters declared in the model block

Related topics