Thank you both! I tested both approaches and they work, complying with my purpose -to have an element greater than or equal to 0.05 in the simplex-.
However, this has lead to my second question. The aim of such constraint is because I’m fitting a mixture model of bivariate Gaussians and I want to force one of such components to be responsible of (at least) 5% of the data points. This may seem strange, but it is my intention to separate the tail of the green component -shown in the Figure- into two (green and yellow) during the fitting process instead of performing it as a post-processing step. The red data points belong to a third component. In the figure, the ellipses correspond to the geometrical representation of the Gaussian distributions with a 95% confidence.
That is why I wanted to impose a constraint on the mixing proportions to force the yellow component to be on top of some datapoints -5% at least-. The thing is that the resulting mixing proportion for the yellow component is close to the lower bound (0.05), but it can be seen that the distribution is not on top of any data point (left image). The desired output would be somethin similar to the right image.
Does it make sense to impose a constraint over the mixing proportions in a mixture model in this way?
If your purpose is scientific and your 5% constraint reflects prior knowledge about the problem then the right way to do this may be to reflect that knowledge in your priors. Rather than using the simplex it may make sense for you to implement the stick breaking process yourself as @bgoodri suggested above (it’s pretty easy to do). Then unlike with the simplex you’ll be able to decide what your prior is for each mixing component separately, and if you like you can make one of the priors strongly informative and focused around the 5% level.
If you share your code here, we can probably help refactor it.
Unfortunately, I don’t believe constraining the proportion of data points is a good way to achieve the goal you have - as witnessed by the fact that the component is pulled towards the lower bound anyway. If you know a priori that you want to separately fit the tail of the green component, I would advise to include that structure in your model. One simple way would be to make the clustering hierarchical: first you find two clusters (red and green) and then you split the green cluster into two clusters. This might be a bit tricky to code but should IMHO better reflect what you are trying to do.
Thank you all for your suggestions :-) All of them have contributed to being able to address the problem. Finally, I have managed it using the ‘generated quantities’ block. The way I see it, I think it makes sense.
Can I use x on the left side of a probability statement (since is a linear transformation)?
x ~ dirichlet(softmax(a + b*x) * phi);
The reason that I can explain better is that I have this into a hierarchical model where dirichlet prior with components < 1 tend to push proportions to 0 and if there is not enough evidence the model never converges and proportion and prior push each-other lower and lower. If I have sensible boundary on simplex this may not happen.
Yes, it’s linear, the Jacobian is, uhh, either (K-1)*log(0.95) or K*log(0.95) but in any case constant so it can be omitted. (I assume x on the right side is an unrelated predictor and not the same simplex.)