Question about variable types

hahawalk · January 10, 2023, 3:18pm

Hi everyone, I run a lengthy stan code and got stuck at getting index for tensors. The function “mapping” takes an integer index and transforms it into a 3D vector T, which is used as index later in the transformed parameter block. The issue is that the type of T[1], T[2], T[3] is real not integer so it gets me the error “Index must be of type int or int or must be a range. Instead found type real.” in " transformed parameters {
real<lower=0> r_n[K, D0, S, U0]; // tensor topic
vector[3] T;
for (k in 1:K)
for (i in 1:I) {
T = mapping(i);
r_n[k, T[1], T[2], T[3]] = r[k, i];
}
}".
May I ask how I can modify the function mapping to fix this problem? Thanks!!!

functions {
	vector mapping(int I){
		int L;
		int remainder;
		vector[3] T;
		L = I - 1;
		remainder = L%16;
		T[2] = L/16; //label for S
		T[1] = remainder/4; //label for D
		T[3] = remainder%4;	//label for U
		return T; 
	}
}
data {
	int<lower = 1> S;	//number of single-base substitution
        int<lower = 1> J;	//number of samples
        int<lower = 1> K;   //number of sigs
        int<lower = 1> D0;	//number of downstream bases at #0
        int<lower = 1> U0;   //number of upstream bases at #0
        int<lower = 1> D1;	//number of downstream bases at #1
        int<lower = 1> U1;   //number of upstream bases at #1
        int<lower = 1> I;	//number of tri-nucleotide substitutions
        
        int<lower = 0> X[D1, D0, S, U0, U1, J];	// the data mat motifys by samples
        
        real<lower=0> alpha;   // hyperparam for signature rates
        real<lower=0> beta;    //hyperparam for up/downstream bases rates
        
        real<lower=0> gamma0;  // hyperparam for shape parameters
        real<lower=0> gamma1;  // hyperparam for shape parameters
        real<lower=0> delta0;  // hyperparam for mean loadings
        real<lower=0> delta1;  // hyperparam for mean loadings
    }

    transformed data {
            vector<lower = 0>[I] alpha_array = rep_vector(alpha, I); //Dirichlet params for signatures
            vector<lower = 0>[D0] beta_array = rep_vector(beta, D0);	
    }


    parameters {
            vector<lower=0>[K] nu; //inferred shaped parameters for loadings
            vector<lower=0>[K] mu; // inferred mean loadings
            matrix<lower=0>[K, J] theta;	//inferred loadings
            simplex[U1] u[K, U0, S];	//inferred upstream bases
            simplex[D1] d[K, D0, S];	//inferred downstream bases
            simplex[I] r[K];	//inferred sigs K by I
    }

    transformed parameters {
		real<lower=0> r_n[K, D0, S, U0]; 	// tensor topic 
		vector[3] T;
		for (k in 1:K)
			for (i in 1:I) {
				T = mapping(i);
				r_n[k, T[1], T[2], T[3]] = r[k, i];
			}
    }
	
    model {
	real mutation_rate;
	vector[3] T;
	int d0;
	int s;
	int u0;
	
	for (k in 1:K) {
		nu[k] ~ inv_gamma(gamma0, gamma1);
		mu[k] ~ inv_gamma(delta0, delta1);
		r[k] ~ dirichlet(alpha_array);
		
		for (u0 in 1:U0)
			for (s in 1:S){
				u[u0, s, k] ~ dirichlet(beta_array);
				d[u0, s, k] ~ dirichlet(beta_array);
		}
	}
	
	for (j in 1:J)
		for (k in 1:K) {
			theta[k, j] ~ gamma(nu[k], nu[k]/mu[k]);
		}
		
	for (j in 1:J)
		for (d1 in 1:D1)
			for (u1 in 1:U1)
				for (i in 1:I){
					mutation_rate = 0;
					T = mapping(i);
					d0 = T[1];
					s = T[2];
					u0 = T[3];
					for (k in 1:K){
						mutation_rate += u[k, s, u0, u1]*r_n[k, d0, s, u0]*d[k, d0, s, d1]*theta[k, j];
					}
					X[d1, d0, s, u0, u1, j] ~ poisson(mutation_rate);
			
			}
		
	}

instead of

model{
vector[N] mu = alpha+beta*x;
y~normal(mu,sigma);
}

To include mathematical notation in your post put LaTeX syntax between two $ symbols, e.g.,
p(\theta | y) \propto p(\theta) p(y | \theta).

Don’t forget to add relevant tags to your topic (top right of this form) for application area and/or class of models you work with.

WardBrian · January 10, 2023, 4:01pm

Rather than vector you can use the type array[] int for the return type of mapping(). Similarly, where you say vector[3] T you can use array[3] int T

Bob_Carpenter · January 11, 2023, 5:55pm

The advantage of using arrays is that if you have

array[M] vector[N] x;

then the operation x[m] can operate by reference (i.e., it’s constant time that does not depend on the size of the vector N). Alternatively, if you do this

matrix[M, N] x;

and access x[m], then there are two problems. First, we have to allocate an N-dimensional vector and copy. Second, the copy is not memory local because matrices are stored in column-major rather than row-major order.

hahawalk · January 12, 2023, 4:28am

Got it, thanks! I will switch from matrix to arrays.

hahawalk · January 12, 2023, 4:41am

Hi, may I double check, is array[3] int T the same as int T[3]? I am a bit confused about the variable type in Stan.

WardBrian · January 12, 2023, 1:10pm

Yes, array is a newer keyword to replace that older syntax

Topic		Replies	Views
Integer loop index in transformed parameters block General	6	1845	June 14, 2019
Vectorization of real valued parameter Modeling	8	585	January 9, 2020
SYNTAX PROBLEM: modifying data in model block or define int in transformed parameter block Modeling rstan	2	437	June 16, 2022
Create parameter constraint index vector inside the transformed data block Modeling specification	5	1561	October 16, 2018
Create temporary stan object for indexing (needs to be integer) that is not a data object Modeling specification , cognitive-science	2	598	April 25, 2021

Question about variable types

Related topics