We obtain estimates on the uniform convergence rate of the Birkhoff average of a continuous observable over torus translations and affine skew product toral transformations. The convergence rate depends explicitly on the modulus of continuity of the observable and on the arithmetic properties of the frequency defining the transformation. Furthermore, we show that for the one-dimensional torus translation, these estimates are nearly optimal.