Economic and technological development has increased the amount, density and complexity of maritime traffic, which has resulted in new challenges. One challenge is conforming to the distinct evasion manoeuvres required by vessels entering into near-collision situations (NCSs). Existing rules are vague and do not precisely dictate which, when and how collision avoidance manoeuvres (CAMs) should be executed. The automatic identification system (AIS) is widely used for vessel monitoring and traffic control. This paper presents an efficient, scalable method for processing large-scale raw AIS data using the closest point of approach (CPA) framework. NCSs are identified to create a database of historical traffic data. Important features describing CAMs are defined, estimated and analysed. Applications on a high-quality real-world data set show promising results for a subset of the identified situations. Future applications may play a significant role in the maritime regulatory framework, navigation protocol compliance evaluation, risk assessment, automatic collision avoidance, and algorithm design and testing for autonomous vessels.