Concatenating N vectors

Tommaso11397 · July 13, 2021, 3:33pm

Hi, I have N vectors v_n of dimension K x 1, I would like to concatenate all these vectors into a a unique vector of dimension N*K.
Obviously I cannot use append_row, I tried with for loop but I could not do it. Any idea? Thanks in advance.

andrjohns · July 13, 2021, 3:55pm

You would need to define your own function that creates an N*K vector and copies each vector into it, try something like (untested):

functions {
  vector append_vector(vector[] vector_arr) {
    int dims[2] = dims(vector_arr);
    vector[dims[1]*dims[2]] rtn;

    for(i in 1:dims[1]) {
      rtn[(1+(i-1)*dims[2]):(i*dims[2])] = vector_arr[i];
    }

    return rtn;
  }
}

Tommaso11397 · July 13, 2021, 4:25pm

Thanks a lot for the answer, but I am not sure I understand (I am sort of new to Stan).

The function takes as second input the array of N vectors v_n correct? What is the first input?

Also, I tried to put it on Stan and it says “variable identifier (name) may not be reserved word” at the second line.

andrjohns · July 13, 2021, 4:31pm

The function takes as second input the array of N vectors v_n correct? What is the first input?

The function only takes one input, the array of vectors. When there’s a function declaration like:

vector append_vector(vector[] vector_arr)

The first vector is the return type of the function and the section vector[] vector_arr indicates that the function takes a single argument (vector_arr) that is an array of vectors (vector[]).

Also, I tried to put it on Stan and it says “variable identifier (name) may not be reserved word” at the second line.

Ah right, that’s because I was using dims as a variable, which is also the name of a function. Try this instead:

functions {
  vector append_vector(vector[] vector_arr) {
    int v_dims[2] = dims(vector_arr);
    vector[v_dims[1]*v_dims[2]] rtn;

    rtn[1:v_dims[2]] = vector_arr[1];

    for(i in 2:v_dims[1]) {
      rtn[(1+(i-1)*v_dims[2]):(i*v_dims[2])] = vector_arr[i];
    }

    return rtn;
  }
}

Tommaso11397 · July 14, 2021, 9:20am

Thanks again, now it is all very clear. My last question is that, I defined the vectors:

parameters {
  vector[K] v[N];
}

How do I generate an array with all the ordered vectors v_n ? If I use simply v, does Stan treat it as an array?

Tommaso11397 · July 20, 2021, 1:06pm

The function does not work for whomever reads this post.

Tommaso11397 · July 20, 2021, 10:37am

Hi, a user on this forum created a function to concatenate J column vectors of dimension K, but I cannot understand why it is not working. I am probably doing something wrong, but I am fairly new to Stan.

This is the function:

 vector append_vector(vector[] vector_arr) {
    int v_dims[2] = dims(vector_arr);
    vector[v_dims[1]*v_dims[2]] rtn;

    rtn[1:v_dims[2]] = vector_arr[1];

    for(i in 2:v_dims[1]) {
      rtn[(1+(i-1)*v_dims[2]):(i*v_dims[2])] = vector_arr[i];
    }

    return rtn;
  }

And this is how I use it:

vector[K] beta[J];
vector[J*K] Beta

Beta = append_vector(beta[])

The input of the function is an array of vectors, what am I doing wrong? How do I transform the betas in an array of vectors?

Funko_Unko · July 20, 2021, 11:01am

I think this has to be changed to

Beta = append_vector(beta)

and it should work (ignoring the missing semicolon).

Tommaso11397 · July 20, 2021, 12:41pm

This was the first thing I tried, it tells me the dimensions do not match at line

Beta = append_vector(beta)

But it also tells me this for the line

“int v_dims[2] = dims(vector_arr)”, so maybe also the function is wrong?

Is there a way to test the function and understand what it is creating, without running the entire model?

Funko_Unko · July 20, 2021, 1:01pm

Huh, how very awkward.

I think the best way to do this is to build an as-simple-as-possible model which uses the function and let it sample with few warmup/sampling iterations. Case in point:

functions{
  vector append_vector(vector[] vector_arr) {
   int v_dims[1] = dims(vector_arr);
   print("HELLO?", v_dims, dims(vector_arr));
   vector[v_dims[1]*v_dims[2]] rtn;

   rtn[1:v_dims[2]] = vector_arr[1];

   for(i in 2:v_dims[1]) {
     rtn[(1+(i-1)*v_dims[2]):(i*v_dims[2])] = vector_arr[i];
   }

   return rtn;
 }
}
transformed data{
  int K = 2;
  int J = 3;
}
parameters {
  vector[K] beta[J];
}
transformed parameters{
  vector[J*K] Beta = append_vector(beta);
}
model {
  for(j in 1:J){
    beta[j] ~ normal(0,1);
  }
}

yields, for me

HELLO?[3,2,1][3,2,1]

which I find very strange. My stan version as reported in the header of the csv file is:

# stan_version_major = 2
# stan_version_minor = 26
# stan_version_patch = 1

Tommaso11397 · July 20, 2021, 1:06pm

So the function does not work basically. Do you have any idea how to concatenate J column vectors?

Btw thank you, you have been very helpful

Funko_Unko · July 20, 2021, 1:16pm

Well it should work, but there appears to be a big somewhere (on Stan’s side).

Try setting the length of v_dims to 3.

Tommaso11397 · July 20, 2021, 1:27pm

I noticed you changed v_dims[2] with v_dims[1] in the second line of your code. This also confused me, since one of the two is not defined in the code. Maybe this causes the issue?

I think the issue is with the dims() function, maybe with num_elements() it works.

martinmodrak · July 20, 2021, 1:40pm

Rstan has Expose user-defined Stan functions to R for testing and simulation — expose_stan_functions • rstan (which still requires compilation)

Funko_Unko · July 20, 2021, 1:55pm

Yes this is the weirdest to me thing, that this works. The compiler should be able to infer the appropriate length at compile time…

Tommaso11397 · July 20, 2021, 1:57pm

So there does not seem to be a solution yet :/

It seems such an easy problem but nobody manages to figure it out…

andrjohns · July 20, 2021, 2:16pm

Please avoid opening multiple threads to continue an existing topic.

By selecting only the first two dims and only using the loop, the following function works:

functions {
  vector append_vector(vector[] vector_arr) {
    int v_dims[2] = dims(vector_arr)[{1,2}];
    vector[v_dims[1]*v_dims[2]] rtn;

    for(i in 1:v_dims[1]) {
      rtn[(1+(i-1)*v_dims[2]):(i*v_dims[2])] = vector_arr[i];
    }

    return rtn;
  }
}

Tested:

library(rstan)
#> Loading required package: StanHeaders
#> Loading required package: ggplot2
#> rstan (Version 2.21.2, GitRev: 2e1f913d3ca3)
#> For execution on a local, multicore CPU with excess RAM we recommend calling
#> options(mc.cores = parallel::detectCores()).
#> To avoid recompilation of unchanged Stan programs, we recommend calling
#> rstan_options(auto_write = TRUE)

modcode = "functions {
  vector append_vector(vector[] vector_arr) {
    int v_dims[2] = dims(vector_arr)[{1,2}];
    vector[v_dims[1]*v_dims[2]] rtn;

    for(i in 1:v_dims[1]) {
      rtn[(1+(i-1)*v_dims[2]):(i*v_dims[2])] = vector_arr[i];
    }

    return rtn;
  }
}"

expose_stan_functions(stanc(model_code=modcode))

append_vector(list(c(1,2,3),c(4,5,6)))
#> [1] 1 2 3 4 5 6

^{Created on 2021-07-20 by the reprex package (v2.0.0)}

Tommaso11397 · July 20, 2021, 2:25pm

It works now, I apologize for the additional thread, I was in a bit of a hurry. Thanks a lot to both of you. So that small mistake in dims(vector_arr)[{1,2}] caused the issue?

andrjohns · July 20, 2021, 2:27pm

Yep, I believe so

martinmodrak · July 20, 2021, 2:52pm

Note: I merged the two threads

Topic		Replies	Views
Declaring a Vector and adding elements in it General	13	2528	April 23, 2020
Append multiple vectors with append_row in generated quantities Modeling	3	2271	August 9, 2019
Concatenating matrix columns to get a vector General	2	854	November 26, 2017
Function to reshape a vector to array of vectors General	5	894	September 23, 2019
hey,"vector[N] c = rep_vector(0,N) " (N=3). What what will return to me? General techniques	4	476	November 20, 2020

Concatenating N vectors

Related topics