Troubleshooting Stan code that estimates two models simultaneously

richardCrandall · November 27, 2024, 10:06pm

Hi, I’m learning Stan using the McElreath Statistical Rethinking textbook and R package and I’ve run into some difficulty with which I’d be grateful to receive some help. I initially asked my question on the Rethinking message board and was directed here, as no one there could replicate the issue I was having.

I’m trying to run the following model from one of the lectures (2023 lecture 12, available on YouTube), and all the chains fail. Best I can tell, it has something to do with how the model is trying to estimate X & Y simultaneously.

Y model:
Y \sim \text{Bernoulli}(p)
\text{logit}(p) = \alpha_g + \beta_{XY}X + \beta_{ZY}Z_g + \beta_{UY} U_g
\alpha = \bar{\alpha} + Z * \tau

X model:
X \sim \text{Normal}(\mu, \sigma)
\mu = \alpha_X + \beta_{UX}U_g
U \sim \text{Normal}(0,1)

Priors:
Z_g \sim \text{Normal}(0, 1)
\alpha_X \sim \text{Normal}(0, 1)
\beta_{XY} \sim \text{Normal}(0, 1)
\beta_{UY} \sim \text{Normal}(0, 1)
\beta_{ZY} \sim \text{Normal}(0, 1)
\beta_{UX} \sim \text{Exponential}(1)
\bar{\alpha} \sim \text{Normal}(0, 1)
\tau \sim \text{Exponential}(1)
\sigma \sim \text{Exponential}(1)

The model uses the following simulated data. I think all the simulations can be run outside of the Rethinking package except rbern(), so I rewrote that.

library(rethinking)
N_groups <- 30
N_id <- 200
a0 <- (-2)
bZY <- (-0.5)
g <- sample(1:N_groups,size=N_id,replace=TRUE) # sample into groups
Ug <- rnorm(N_groups,1.5) # group confounds
X <- rnorm(N_id, Ug[g] ) # individual varying trait
Z <- rnorm(N_groups) # group varying trait (observed)
Y <- rbinom(N_id, size=1, p=plogis( a0 + X + Ug[g] + bZY*Z[g]))
#Y <- rbern(N_id, p=inv_logit( a0 + X + Ug[g] + bZY*Z[g] ) ) # Rethinking version
dat <- list(Y=Y,X=X,g=g,Ng=N_groups,Z=Z)

Here is the Stan code for the model:

functions{
    real reducer( 
            array[] int Y,
            int start , int end , 
            vector Xbar,
            int Ng,
            vector Z,
            vector a,
            vector X,
            array[] int g,
            vector u,
            vector z,
            real bzy,
            real buy,
            real bxy,
            real aX,
            real bux,
            real abar,
            real tau,
            real sigma ) { 
         vector[size(Y)] p;
         vector[size(Y)] mu;
        for ( i in 1:size(Y) ) {
            mu[i] = aX + bux * u[g[start+i-1]];
        }
        return normal_lpdf( X | mu , sigma );
        for ( i in 1:size(Y) ) {
            p[i] = a[g[start+i-1]] + bxy * X[start+i-1] + bzy * Z[g[start+i-1]] + buy * u[g[start+i-1]];
            p[i] = inv_logit(p[i]);
        }
        return bernoulli_lpmf( Y | p );
    } 
}
data{
     vector[30] Xbar;
     int Ng;
    array[200] int Y;
     vector[30] Z;
     vector[200] X;
    array[200] int g;
}
parameters{
     vector[Ng] u;
     vector[30] z;
     real bzy;
     real buy;
     real bxy;
     real aX;
     real<lower=0> bux;
     real abar;
     real<lower=0> tau;
     real<lower=0> sigma;
}
transformed parameters{
     vector[Ng] a;
    a = abar + z * tau;
}
model{
    sigma ~ exponential( 1 );
    tau ~ exponential( 1 );
    abar ~ normal( 0 , 1 );
    bux ~ exponential( 1 );
    aX ~ normal( 0 , 1 );
    bxy ~ normal( 0 , 1 );
    buy ~ normal( 0 , 1 );
    bzy ~ normal( 0 , 1 );
    z ~ normal( 0 , 1 );
    u ~ normal( 0 , 1 );
    target += reduce_sum( reducer , Y , 1 , 
            Xbar,
            Ng,
            Z,
            a,
            X,
            g,
            u,
            z,
            bzy,
            buy,
            bxy,
            aX,
            bux,
            abar,
            tau,
            sigma );
}

For reference, the Rethinking code looks like this:

mru <- ulam(
  alist(
    # Y model
    Y ~ bernoulli(p),
    logit(p) <- a[g] + bxy*X + bzy*Z[g] + buy*u[g],
    transpars> vector[Ng]:a <<- abar + z*tau,
    # X model
    X ~ normal(mu,sigma),
    mu <- aX + bux*u[g],
    vector[Ng]:u ~ normal(0,1),
    # priors
    z[g] ~ dnorm(0,1),
    c(aX,bxy,buy,bzy) ~ dnorm(0,1),
    bux ~ dexp(1),
    abar ~ dnorm(0,1),
    tau ~ dexp(1),
    sigma ~ dexp(1)
  ) , data=dat , chains=4 , cores=14 , threads=4, sample=TRUE )

I’ve tried uninstalling & reinstalling cmdstan, cmdstanr, rethinking, and even as a last resort R, RStudio, and Rtools. Nothing helped. One last note: When I reinstalled cmdstanr, old Stan models that used to work stopped working. I had to ensure that my computer downgraded from C++20 to C++17.

Any help would be greatly appreciated!

jonah · December 4, 2024, 8:57pm

At first glance, the reducer function looks wrong to me because I don’t see how it can ever reach the second return statement. I think if you remove the threads argument it will generate different Stan code (avoiding reduce_sum and creating the reducer function) and it should run OK. Does that at least get it to run for you?

richardCrandall · December 4, 2024, 9:48pm

Thank you Jonah! Yes, when I reran the code without the threads, I was able to get ulam() to run a successful Stan model. You were right that removing the threads argument changes the Stan code that ulam() produces. Greatly appreciate your help!

jonah · December 4, 2024, 9:54pm

I just posted a follow up comment to your GitHub issue where I successfully got the version using reducer to run (by passing the generated Stan code directly to CmdStanR, not running it via Ulam) and I think it’s not estimating the same model as the version without reducer, probably because the binomial isn’t evaluated.

github.com/rmcelreath/rethinking

Can't work "Full luxury Bayes" from 2023 lecture 12

opened 02:12AM - 25 Nov 24 UTC

richardCrandall

I'm working my way through _Statistical Rethinking_ and have run into some troub…le running the models in the bonus material of the 2023 lecture 12. I initially had a lot of trouble replicating the posteriors shown on the slides, but that may be due to sampling differences since we're not using a ```set.seed()```. However, when I attempted to run the "full luxury Bayes" model at the end of the bonus material, Stan couldn't run the model at all. Before attempting to run the chains, I got an error message saying, "line 27, column 8: Unreachable statement (following a reject, fatal_error, break, continue, or return) found, is this intended?" The code at line 27, column 8 of the Stan text file is the next line after a return line, which from what I gather from ChatGPT is part of the problem: Stan can't run two different returns as it terminates after the first return. Here is the Stan code I'm seeing: ``` return normal_lpdf( X | mu , sigma ); // line 26 for ( i in 1:size(Y) ) { p[i] = a[g[start+i-1]] + bxy * X[start+i-1] + bzy * Z[g[start+i-1]] + buy * u[g[start+i-1]]; p[i] = inv_logit(p[i]); } return bernoulli_lpmf( Y | p ); ``` Once the code tried to run, it produced these errors: "Chain 1 Unrecoverable error evaluating the log probability at the initial value. Chain 1 Exception: Exception: normal_lpdf: Random variable has size = 200, but Location parameter has size 1; and they must be the same size." It also referred to line 27, and then referred to this section of the Stan code as well: ``` target += reduce_sum( reducer , Y , 1 , Xbar, Ng, Z, a, X, g, u, z, bzy, buy, bxy, aX, bux, abar, tau, sigma ) ``` For reference, here is the full R code that I am using, from the 2023 lecture 12 slides & video: ```r N_groups <- 30 N_id <- 200 a0 <- (-2) bZY <- (-0.5) g <- sample(1:N_groups,size=N_id,replace=TRUE) # sample into groups Ug <- rnorm(N_groups,1.5) # group confounds X <- rnorm(N_id, Ug[g] ) # individual varying trait Z <- rnorm(N_groups) # group varying trait (observed) Y <- rbern(N_id, p=inv_logit( a0 + X + Ug[g] + bZY*Z[g] ) ) dat <- list(Y=Y,X=X,g=g,Ng=N_groups,Z=Z) xbar <- sapply( 1:N_groups , function(j) mean(X[g==j]) ) dat$Xbar <- xbar ``` ```r mru <- ulam( alist( # Y model Y ~ bernoulli(p), logit(p) <- a[g] + bxy*X + bzy*Z[g] + buy*u[g], transpars> vector[Ng]:a <<- abar + z*tau, # X model X ~ normal(mu,sigma), mu <- aX + bux*u[g], vector[Ng]:u ~ normal(0,1), # priors z[g] ~ dnorm(0,1), c(aX,bxy,buy,bzy) ~ dnorm(0,1), bux ~ dexp(1), abar ~ dnorm(0,1), tau ~ dexp(1), sigma ~ dexp(1) ) , data=dat , chains=4 , cores=4 , sample=TRUE ) ``` I'm not sure if there was a code update in Stan/R/Rethinking/etc. that makes the code no longer work, if it's something unique to my computer, if it's an error on my part, etc. Any help would be greatly appreciated.

Topic		Replies	Views
Error running the same model twice: cmdstan with ulam function (Rethinking package) Modeling	6	1916	January 18, 2022
Nested longitudinal data and estimating joint model Modeling rstan	6	848	November 10, 2020
Stuck at Warmup iteration with no error : CmdStanR CmdStan techniques , fitting-issues	48	3154	April 21, 2020
Four chains vs four jobs General cmdstan	28	215	June 19, 2024
A modified Stan model estimates only after a laptop reboot Modeling specification	24	1046	August 14, 2021

Troubleshooting Stan code that estimates two models simultaneously

Related topics