Atmospheric rivers (ARs) are increasingly recognized globally as an important weather phenomenon associated with extreme precipitation. There is a substantial body of literature indicating that ARs are responsible for a large fraction of wet-season precipitation on western coasts (Rutz et al. 2019) and that they can cause large changes in snowpack (both positive and negative; Guan et al. 2010; Chen et al. 2019). Individual ARs and collections of ARs can bring large amounts of precipitation that drive floods and other storm-related hazards (Ralph et al. 2006, 2019a). ARs are a significant factor for water and associated water systems in the vicinity of western coasts (Gao et al. 2016; Ralph et al. 2019b). It is increasingly evident that they have major impacts on the energy and water budgets of the cryosphere: including mountains (Chen et al. 2019) and high-latitude regions (Gorodetskaya et al. 2014). These research advances hinge on technical advances in tracking ARs in observations, reanalyses, and climate model simulations and on understanding uncertainties associated with different tracking methods. In parallel with the recent increase in research activity around ARs, an increasing number of research groups have developed unique methods for tracking ARs (Shields et al. 2019).
The Atmospheric River Tracking Intercomparison Project (ARTMIP) was created to design a set of experiments that could quantify the uncertainty associated with AR tracking (Shields et al. 2018; Rutz et al. 2019). The concept of a multitiered experimental approach, based on tracking ARs across common datasets, resulted from the First ARTMIP Workshop in 2017. The tier 1 experiment is focused on tracking ARs in a modern reanalysis [Modern-Era Retrospective Analysis for Research and Applications, version 2 (MERRA2)]. The Second ARTMIP Workshop (Shields et al. 2019) was oriented around discussion of tier 1 results and around designing and planning the first set of tier 2 experiments: the tier 2 C20C+ experiment and the tier 2 CMIP5/6 experiment. Both initial tier 2 experiments are focused on understanding the effects of climate change on AR characteristics, with the C20C+ experiment focusing on a set of high-resolution atmosphere-only simulations, and the CMIP5/6 experiment focusing on a multimodel collection of fully coupled simulations from the Coupled Model Intercomparison Project.
Following the Second ARTMIP Workshop, two separate developments motivated the need for developing a large dataset of hand-labeled ARs. Discussions following the Second ARTMIP Workshop suggested that differences among AR tracking algorithms might reflect differences in expert opinion about what constitutes the boundary of ARs; resolving this question would require experts to hand-label ARs. Unrelated, but concurrent, advances in computational climate science have demonstrated the utility of modern machine-learning methods for tracking weather phenomena (Mudigonda et al. 2017; Muszynski et al. 2019; Kurth et al. 2018). These developments also highlight the need for high-quality data to train machine-learning methods: expert-labeled datasets.