Abstract
We review methods for forecast evaluations and how they can be used in political sciences. We examine how forecast densities are more useful summaries of forecasted variables than point metrics. We also cover how continuous rank probability scores, probability integral transforms, and verification rank histograms can be used to calibrate and evaluate forecast performance. Finally, we present two illustrations, one a simulation and the other a comparison of forecasting models for the China-Taiwan (cross-straits) conflict.