We analyze here the optimal premium rates for an insurance company for the case where the claims process is a stochastic process with unknown parameters. The beliefs of the insurance company about these parameters are represented by some prior distribution function, and the premium rates are determined on the basis of this distribution. The insurer learns about the unknown parameters from the past behavior of the claims process, and revises the prior distribution whenever new information is obtained (i.e., a claims stream is observed).
The learning process introduces multi-period dynamic considerations to the problem of the rate determination, since the rates determined at any period affect the claims process, and thus the information available for future decisions. We show how the optimal rates can be obtained by using some dynamic programming techniques. We also prove that the optimal rates obtained in our case are lower than those determined by an insurance company that does not revise its distribution.