Martin discusses this problem in his blog post on identifiability and divergences.
In the same thread, I suggest one (unprincipled and untested) way to constrain the beta parameter.