Retrieval of Summer Sea Ice Concentration in the Pacific Arctic Ocean from AMSR2 Observations and Numerical Weather Data Using Random Forest Regression
The Arctic sea ice concentration (SIC) in summer is a key indicator of global climate change and important information for the development of a more economically valuable Northern Sea Route. Passive microwave (PM) sensors have provided information on the SIC since the 1970s by observing the brightness temperature (TB) of sea ice and open water. However, the SIC in the Arctic estimated by operational algorithms for PM observations is very inaccurate in summer because the TB values of sea ice and open water become similar due to atmospheric effects. In this study, we developed a summer SIC retrieval model for the Pacific Arctic Ocean using Advanced Microwave Scanning Radiometer 2 (AMSR2) observations and European Reanalysis Agency-5 (ERA-5) reanalysis fields based on Random Forest (RF) regression. SIC values computed from the ice/water maps generated from the Korean Multi-purpose Satellite-5 synthetic aperture radar images from July to September in 2015–2017 were used as a reference dataset. A total of 24 features including the TB values of AMSR2 channels, the ratios of TB values (the polarization ratio and the spectral gradient ratio (GR)), total columnar water vapor (TCWV), wind speed, air temperature at 2 m and 925 hPa, and the 30-day average of the air temperatures from the ERA-5 were used as the input variables for the RF model. The RF model showed greatly superior performance in retrieving summer SIC values in the Pacific Arctic Ocean to the Bootstrap (BT) and Arctic Radiation and Turbulence Interaction STudy (ARTIST) Sea Ice (ASI) algorithms under various atmospheric conditions. The root mean square error (RMSE) of the RF SIC values was 7.89% compared to the reference SIC values. The BT and ASI SIC values had three times greater values of RMSE (20.19% and 21.39%, respectively) than the RF SIC values. The air temperatures at 2 m and 925 hPa and their 30-day averages, which indicate the ice surface melting conditions, as well as the GR using the vertically polarized channels at 23 GHz and 18 GHz (GR(23V18V)), TCWV, and GR(36V18V), which accounts for atmospheric water content, were identified as the variables that contributed greatly to the RF model. These important variables allowed the RF model to retrieve unbiased and accurate SIC values by taking into account the changes in TB values of sea ice and open water caused by atmospheric effects.