Pkl generates wrong declaration dimension

Bhomass1 · December 11, 2017, 2:48am

This appears it may be a bug. I have the following declarations

data {
	int N; 
	int T; 
	int D; 
	int S; 
	
	row_vector[D+1] z[N, T]; 
	row_vector[D] X[N, T]; 
	int y[N, 2, T]; // outcome
	
	int transD; 		
	int dphi; 		
	vector[dphi] mu_phi;		 
    matrix[dphi,dphi] Omega_phi;
}

When I run the code, it complains that I declared z to have the wrong dimension
initialization; variable name=z; position=0; dims declared=(34,80,6); dims found=(100,80,6)

N is definitely 100. If I re-pkl the .stan file, sometime it will complain the declaration is (33,80,6)
finally, I forced the declaration to be

	row_vector[D+1] z[100, T];

then it runs without the complaint.

Is that indeed a bug?

syclik · December 11, 2017, 3:14am

What is pkl?

Bhomass1 · December 11, 2017, 3:14am

Pickle

syclik · December 11, 2017, 4:09am

Thanks. Does this only apply to PyStan?

Bhomass1 · December 11, 2017, 4:11am

Yes it’s part of running ep Stan. All in python

syclik · December 11, 2017, 4:26am

Mind moving the thread to Interfaces -> PyStan? I don’t think this has to do with stanc based on what info you’ve provided.

Bhomass1 · December 11, 2017, 4:31am

Really I would have thought if it misinterpreted the dimension N it rests w
the compiler

syclik · December 11, 2017, 4:37am

AFAIK, the pickling / unpickling process doesn’t run the stanc compiler a second time.

stanc is a compiler and it translates the .stan file to C++. Once the compiler is compiled, it will always create the same C++ file given the same .stan input. There really isn’t a chance for it to interpret differently based on multiple runs. (Unless there’s something really funky going on in Python, which is possible, but I think if you’re talking about behavior just when pickling / unpickling, then stanc isn’t the culprit.)

Bhomass1 · December 11, 2017, 5:39am

sorry I believe I made a confusing presentation. What I am saying is I repickled the stan file and ended up with a different result, meaning there is something random about this. However, the entire process is that the pkl file is passed to the client code and model.sampling called on it. downstream there will actually be a cython compilation process which takes the pkl as the input. this is as far I understand the process. nevertheless, it seems obvious that the compilation process mis-read N dimension in the the z[N, T] decalration.

ahartikainen · December 11, 2017, 8:18am

Could it be that N is changing depending on the subprocess?

Add prints to your code and see what they say.

Bhomass1 · December 17, 2017, 9:07pm

I am quite sure this is a bug.

All I did is send in a second parameter Nz with identical value, now the declaration works correctly.

syclik · December 17, 2017, 9:51pm

Sorry, I still don’t know what you’re trying to do / have done. Can you put down the full Stan program and a reproducible example in the form of a script or even just Python code?

I didn’t see an Nz in the original code, so it’s really hard to tell how this is relevant. (I’m not saying that it isn’t; I’m just saying that there isn’t enough information to help.)

Bhomass1 · December 23, 2017, 7:40am

after playing with code, I can tell the offending code is where I used [D + 1] as the declaring dimension. The compiler (or the process of instantiating StanModel, pickle, then cython compile) gets confused about the both N and D + 1 dimensions for that variable.

feel free to test that out with any code you might have. Just declare a dimension with [something + 1].

syclik · December 23, 2017, 11:32am

Mind posting a minimal example that shows the bug? I still don’t understand what you do to trigger the bug.

I really don’t understand this. The Stan language only allows for one declaration per variable.

ahartikainen · December 23, 2017, 11:37am

Sorry, but please give us a minimal working example. With each step, you are doing. Otherwise, this is not going anywhere.

Is this a normal PyStan issue or something else? I’m confused why you are doing pickle-repickle step and for what? For .stan file?

The 33/34 vs 100 would indicate that you split your data in three parts.

Bhomass1 · December 23, 2017, 7:19pm

Hey guys, I apologize for creating this confusing issue, and thanks for following the thread painstakingly. I realize now that this bug only occurs in the combination of stan + ep-stan + the modifications I personally made to ep-stan. I am the only person that can really dig into it to the root, and I can only do that if I learn a lot more about cython, since the bug appears in the generated cython. For now, I am happy to find the work around, which is eliminating the use of [D + 1].

The 33/34 vs 100 would indicate that you split your data in three parts.

This is good insight, I didn’t notice that. It will be a good hint if I dig into the root cause.

I really don’t understand this. The Stan language only allows for one declaration per variable

This is referring to the declaration

row_vector[D+1] z[N, T];

You can see that even though it is a single declaration, it has 3 dimension variables.

Topic		Replies	Views
Unexpected dimension mismatch for integer vector in data initialization Modeling	4	654	June 29, 2019
Mismatch in number dimensions General	22	2747	March 10, 2020
Exception: mismatch in number dimensions declared RStan rstan	4	4419	May 22, 2018
Declaring integer of length 1 fails? as in: `int X[1]` General	2	627	August 2, 2019
Possible bug in command stan init file CmdStan bug	7	1195	June 2, 2017

Pkl generates wrong declaration dimension

Related topics