I am using a train/test for a binomial three-level model in rstanarm (obs in babies in pregnancies). If I were to compare two models, am I correct that the process in rstanarm would be to use:
log_lik(train, newdata = test)
Average the results by column, add them by row, and then compare the two models to each other?