Baysian Data Analysis of PGA Golf Scores with Py-Stan

jamiebernardin · September 4, 2017, 2:05pm

Hi Everyone,

The link below is a survey of some analysis and prediction of golf scores using py-stan. I use three different models to understand and predict scores. It’s not ground breaking research, but could be good for a Stan conference presentation. The readme contains and embedded slide presentation with overview, analysis, and results.

Best,
Jamie

bgoodri · September 4, 2017, 4:08pm

Cool. You might be interested in presenting this at StanCon in January. You can go faster if you replace Stan constructs like

for (n in 1:N) {
  y[n] ~ normal(alpha[t[n]] + tau[p[n]], sigma[p[n]]); 
}

with the single line

y ~ normal(alpha[t] + tau[p], sigma[p]);

Since t is an integer array with size N, alpha[t] copies the elements of alpha the appropriate number of times so that the total size of alpha[t] is also N.

bbbales2 · September 4, 2017, 4:20pm

I’m totally into people posting sports examples! They usually come with cool plots.

The intervals in this plot: https://github.com/jamiebernardin/bayesian_golf#-4 , are these like mins and maxes of round scores for each player? Or 50% intervals?

And do the orange dots here (https://github.com/jamiebernardin/bayesian_golf#-11) come from generated quantities and the green line the original data?

I was looking at “Different tournaments/different courses have different coefficients that fit SG to score”. Does that factor into the regression here: https://github.com/jamiebernardin/bayesian_golf#-19 ? I think adding comments to the model there would be good (I wasn’t exactly sure what N_T is… Is it number of tournaments?)

Is it possible to plot the AR coefficients along with the score predictions for a few players?

Fun stuff!

jamiebernardin · September 4, 2017, 6:36pm

makes sense… wasn’t sure I could do that with the mapping array. thank you!

jamiebernardin · September 4, 2017, 6:37pm

thanks for the feedback, will definitely do a second pass at legends and more explanation. great idea for AR coef.

jamiebernardin · September 10, 2017, 6:38pm

doesn’t seem to work, fyi.

No matches for:

real[] + real[]

bbbales2 · September 10, 2017, 6:44pm

Oh, if you can define alpha and tau to be vectors and then you can add them.

Either that or you can use to_vector(alpha) + to_vector(tau). Arrays and vector/matrix things are kept distinct in Stan (arrays are std::vectors, and vector/matrix things are Eigen types).

jamiebernardin · September 10, 2017, 7:52pm

Thanks for pointing that out. Yes, indeed… took like 40% the time of
the non-vectorized version.

Topic		Replies	Views
Bradley-Terry model: Stan vs. iterative conjugate fit Modeling	1	499	October 11, 2022
New online Stan coding course: 80 videos + hosted live coding environment General stan , cmdstanr , education	7	2635	November 28, 2024
Help with Bayesian Modelling Modeling rstan , prior-choice , priors , initialization	6	274	July 4, 2024
GSoC 2021 - Q/A thread Google Summer of Code	30	2858	April 13, 2021
Looking for speaker to talk about BridgeStan for Bayesian Data Analysis Meetup Meetings bridgestan	7	454	November 12, 2023

Baysian Data Analysis of PGA Golf Scores with Py-Stan

Related topics