Computing autocorrelation/autocovariance in Stan

mike-lawrence · November 10, 2020, 5:19pm

Has anyone tackled coding the computation of the sample autocorrelation (or autocovariance) in Stan yet? I know there are a variety of methods for modelling structure in the autocorrelation, but I’m not finding anything for simply computing what the autocorrelation is. (I’ll be looking at the autocorrelation on a latent parameter, so I can’t just compute it from the observed data)

I see some neat fft-based code in the posterior package’s autocovariance() function, which presumably is a computationally efficient way to do it. Thought I’d check if anyone had coded it for Stan yet.

bgoodri · November 10, 2020, 10:08pm

You mean in generated quantities?

mike-lawrence · November 10, 2020, 10:11pm

Sure, though I’m also thinking of including the distribution of autocorrelations as part of the model likelihood to help constrain the latent parameter.

bgoodri · November 10, 2020, 10:13pm

You mean autocorrelation not necessarily in an AR1 structure?

mike-lawrence · November 10, 2020, 10:20pm

Yes. The context is periodic models where there are two classes of model configurations that I think are competing: High estimated signal-to-noise-ratio with accurate estimates of the frequency and phase of the periodic signal, versus low estimated signal-to-noise-ratio with arbitrary estimates of the frequency and phase. It occurred to me today that the residuals in the latter should retain high autocorrelation, so explicitly modelling the autocorrelation of the residuals as samples from a population with a true correlation of zero should change the geometry of the parameter space to eliminate this spurious mode.

andrjohns · November 11, 2020, 2:37am

The Math library has autocorrelation and autocovariance functions, but these look to be for internal use since they write the results into a provided output. It might be worth opening an issue in the Math library to provide versions of these that can be exposed to the Stan language.

mike-lawrence · November 12, 2020, 7:10am

Below is code for computing them by hand. I haven’t had a chance to benchmark but I suspect it’s very slow. I also opened an issue over on stan-math as you suggested.

functions{
	// correlation
	real cor(vector x, vector y){
		real ex = mean(x);
		real ey = mean(y);
		real exy = mean(x .* y);
		real covxy = (exy - (ex*ey)) ;
		real sdx = sd(x);
		real sdy = sd(y);
		return( covxy / (sdx*sdy) ) ;
	}
	// normalized autocorrelations (for lags with >=4 observations)
	vector autocorz(vector x){
		int nx = num_elements(x);
		vector[nx-4] z ;
		for(i in 4:(nx-1)){
			z[i-3] = atanh(cor( x[i:] , x[:(nx-i+1)] ));
		}
		return(z);
	}
]

Edit: benchmarked and yikes it slows things down by a couple orders of magnitude (and I left off the bit implementing my idea to constain the likelihood, so that’s just the computation, not computation+change-of-posterior-geometry). Hopefully the stan-math implementation performs better.

mike-lawrence · November 17, 2020, 9:56pm

Oops, didn’t code the autocorrelation loop properly. Here’s the fixed version:

	// normalized autocorrelations (for lags with >=4 observations)
	vector autocorz(vector x){
		int nx = num_elements(x);
		vector[nx-4] z ;
		for(i in 1:(nx-4)){
			z[i] = atanh( //atanh "normalizes" correlations (Fisher's r-to-z transform)
				cor( x[1:(nx-i)] , x[(1+i):nx] )
			);
		}
		return(z);
	}

Topic		Replies	Views
Estimation of effectve size in stan code Modeling	8	1080	April 13, 2018
Probit and Correlated Errors Modeling	4	962	October 31, 2018
Autocorrelations and differencing General	3	526	September 13, 2019
Any guidance for how to use Stan math library autodiff? (new to C++) General stan-math	9	969	March 17, 2023
Specifying Correlation in Stan Modeling	10	1035	October 9, 2021

Computing autocorrelation/autocovariance in Stan

Related topics