Frobenius inner product

Dalton · December 8, 2017, 10:14pm

I’m working on a model where it is convenient to use a triangular matrix for calculation of different log-likelihoods for different observed data. I can then increment these log-likelihoods by the number of observations that correspond to each log-likelihood in the modelling step and sum them to obtain the total log-likelihood. It turns out this operation has a name: the Frobenius inner product.

For example, in my transformed parameters block I wind up with a T x T matrix with entries [LL_11, LL_12, LL_13, …, LL_1T] in the first row, [0, LL_22, LL_23, … , LL_2T] in the second row on down to the final row which is T - 1 zeroes with LL_TT in the final position. In my transformed data block, I gather my data such that I have a matrix of counts: [n_11, n_12, …, n_1T], [0, n_22, … , n_2T], etc.

In pseudocode these are:

matrix[T, T] log_likeli;
matrix[T, T] the_Ns;

So my question is: what’s the most efficient way to calculate the Frobenius inner product, given that a little over half of my matrix consists of zeroes? I can think of three ways.

Sum the matrix resulting from element-wise multiplication:

target += sum(log_likeli .* the_Ns);
Vectorise to a row_vector and vector and multiply:

target += to_row_vector(log_likeli) * to_vector(the_Ns);
Transpose and loop over rows/columns omitting zero terms through vector slicing:

for (i in 1:T)
target += log_likeli[i,i:t] * the_Ns[i, i:t]’;

I’ll be doing some testing, but I’d curious to hear if this has already been considered and what the best way might be for those who have already walked down this path.

Dalton · December 15, 2017, 7:33pm

I guess I’ll answer my own question. For this particular use case, the third option transposing and omitting zero terms is the only algorithm which will work. This is because log(0) * 0 results in nan.

Bob_Carpenter · December 22, 2017, 1:59am

Thanks for following up with the solution. We have a specialized

lmultiply(a, b) == a * log(b)

that returns 0 if a == b == 0.

Omitting zero terms will be more efficient if it can be done without tests at runtime (anywhere other than in the transformed data block).

Topic		Replies	Views
Log_sum_exp(matrix) by column or by row Modeling hmm	8	1347	May 2, 2025
Efficient way to fill up matrix of log-likelihoods Modeling performance	5	692	August 31, 2021
Elementwise multiplications with row sums appear to be slow Modeling fitting-issues , specification	32	1456	August 23, 2021
Log_sum_exp of real[N,N,N] array Modeling	1	393	January 24, 2019
Rows_dot_product is intended as a shorthand rather than compute optimization, right? Modeling	10	1179	February 15, 2021

Frobenius inner product

Related topics