Quantitative Analysis of Cadmium Content in Tomato Leaves Based on Hyperspectral Image and Feature Selection
Abstract.In order to ensure that safe and healthy tomatoes can be provided to people, a method for quantitative determination of cadmium content in tomato leaves based on hyperspectral imaging technology was put forward in this study. Tomato leaves with seven cadmium stress gradients were studied. Hyperspectral images of all samples were firstly acquired by the hyperspectral imaging system, then the spectral data were extracted from the hyperspectral images. To simplify the model, three algorithms of competitive adaptive reweighted sampling (CARS), variable combination population analysis (VCPA) and bootstrapping soft shrinkage (BOSS) were used to select the feature wavelengths ranging from 431 to 962 nm. Final results showed that BOSS can improve prediction performance and greatly reduce features when compared with the other two selection methods. The BOSS model got the best accuracy in calibration and prediction with R2c of 0.9907 and RMSEC of 0.4257mg/kg, R2p of 0.9821, and RMSEP of 0.6461 mg/kg. Hence, the method of hyperspectral technology combined with the BOSS feature selection is feasible for detecting the cadmium content of tomato leaves, which can potentially provide a new method and thought for cadmium content detection of other crops. Keywords: Feature selection, Hyperspectral image technology, Non-destructive analysis, Regression model, Tomato leaves.