Reward Model and Linear Dynamical System Prev: 19-mdps-valuepolicy-iteration Prev: 19-mdps-valuepolicy-iteration