Computational cost of unused parameters?

walkerharrison · October 16, 2020, 1:31am

When modeling, for example, random intercepts in a regression, I follow the standard of indexing a vector the length of the number of groups in the variable:

data {

  int<lower = 0> N;
  int<lower = 0> N_grps;

  int<lower = 1, upper = N_grps> grp [N];
}
parameters {
  vector [N_grps] mu_grp; 
}

Sometimes I test my code on subsets of the data without rows from every member of the group. I like to avoid re-indexing the group for simplicity, so instead of declaring a smaller N_grps, I keep it the same and end up having elements of the vector mu_grp that are never referred to in the likelihood and therefore (I think) never actually “fit,” beyond their prior.

This doesn’t seem to cause much of a computational difference for modest N and N_grp, but I’m wondering if I should expect worse performance for bigger datasets or, more broadly speaking, if I should avoid including unused parameters for other reasons.

bgoodri · October 16, 2020, 1:34am

It’s OK if you have a proper prior (ideally with two finite moments) on the otherwised unused parameters.

mike-lawrence · October 16, 2020, 12:23pm

Note that a related thread is here, though you’ve asked the question more succinctly!

Topic		Replies	Views
Faster sampling with unused parameters included in model Modeling	3	573	October 28, 2021
Missing parameters and priors Modeling	25	808	June 27, 2020
Prior of matrix expressed as vector : lead to the degradation of the n_eff Modeling	4	210	March 23, 2024
Infer mean and std of four separate groups, need help indexing data vector Modeling specification	9	426	June 29, 2022
How to make it efficient for group of Gaussian mixture of different size? Modeling	0	276	July 21, 2019

Computational cost of unused parameters?

Related topics