As one of the important elements of plant, nitrogen is the main component composing amino acid, protein, alkaloid, nucleic acid, chlorophyll and is of great importance to the physiological activities of plant  . Hyperspectral remote sensing is featured with high spectral resolution, continuous waveband and quick measurement. It can be a new method for real-time, fast and lossless apple tree nitrogen estimation  .
The hyperspectral remote sensing technology was applied to plant field in 1990s. Recently, this technology is being used for the inversion study of nutrient content of wheat   , corn  , rice  and cotton  by scholars from home and abroad. However, Multiple Scattering Correction (MSC) is rarely used in spectral data pre- processing. Multiple Scattering Correction is a kind of spectral reflectivity pre-pro- cessing method with scattering elimination function  -  . MSC was firstly proposed by Martens and others to eliminate the scattering interference due to different sample granularity   . Zhao et al.  studied the influence of MSC on near infrared spectrum analysis and thought the sample spectral reflectivity was not only related to component content but also subjected to sample particle size, distribution situation and tightness degree. These could be receded through MSC. Huang et al.  used MSC to process the hyperspectral reflectivity data of wheat leaves to measure chlorophyll content in wheat leaves. But there are few comparative and analytic researches on whether MSC could improve the accuracy of prediction model.
Therefore, the prediction models were established by taking apple tree as research object in this study. Due to uneven distribution of leaves and branches and various density of canopy in different apple trees, the scattering will therefore disturb spectral information. The original reflectivity was conducted the First Derivative (FD) and Multiple Scattering Correction plus First Derivative (MSC + FD), respectively. The correlation difference was analyzed between the spectral data after the two pre-processings and apple nitrogen content. The prediction models were established of apple tree canopy nitrogen content, respectively. The prediction accuracy was compared to study the influence of MSC. This research aimed at exploring a pre-processing method of hyperspectrum in order to improve the accuracy of prediction model.
2. Materials and Methods
2.1. The Research Area
The research area is located in Qixia City, Yantai, China, which is known as the “Apple City of China” with the east longitude of 120˚33' - 121˚15' and northern latitude of 37˚05' - 37˚32'. The orchard area reaches 4.3 × 104 ha and the main planting variety is Red Fuji apple tree. The temperate zone monsoon subhumid climate brings four distinctive seasons, abundant sunlight, 11.3˚C average annual temperature, 754 mm annual precipitation, 2690 h total sunshine hours and 207 d annual frost-free period. The terrain is mainly low mountains and hills and agrotype is mainly brown soil  .
2.2. Sample Collection and Spectrum Determination
The test was carried out on September 22, 2014 in Qixia City and the tested apple cultivar is Red Fuji apple. Depending upon the distribution characteristics of orchards, 105 apple trees of 32 orchards in 14 towns were selected as research objects. The determining instrument is Field Spec 4 portable field spectrometer produced by ASD Inc. Wave band scope of this instrument is 350 - 2500 nm, of which, the sampling interval of 350 - 1000 nm is 1.4 nm and spectral resolution is 3 nm; the sampling interval of 1000 - 2500 nm is 2 nm and the spectrum resolution is 8 nm. When the weather was sunny, no-cloud, no-wind or of breeze, the reflectivity of canopy was measured at 10:00 - 14:00. A 5 m external optical fiber was adopted in the measurement. The optical fiber field angle was 25˚ and the probe faced downward vertically. The selection of appropriate determination height was based on the size of the canopy so as to guarantee the whole canopy was in the field of view. Each sample was measured for 10 times and their average value was taken as the value of hyperspectral reflectivity. Standard white plate correction was required every 15 minutes to ensure the measurement accuracy  .
2.3. Apple Tree Crown Canopy Nitrogen Determination
Twelve leaves were collected at each canopy and putted into freshness protection package when measuring. These leaves could be kept in cold and dark place and brought back to laboratory. De-enzyme was conducted for 20 minutes at the dryer with the temperature of 80˚C, then the drying process with 60˚C was conducted until constant weight was achieved. The dried sample was grinded into powder with mortar and selected by a screen of 0.25 mm. The nitrogen content was determined using the Kjeldahl method.
2.4. Spectral Data Processing
The spectrum showed obvious fluctuation at 1350 nm, 1850 nm and 2500 nm due to the influence of the instrument itself and external environment, especially the moisture in atmosphere. Previous research has found that wave bands highly correlated with nitrogen content are centralized at visible light area. Therefore, the apple tree canopy spectrum of 350 - 1000 nm was selected as the research spectral scope based on the wave band characteristic of Field Spec 4 spectrometer to eliminate the interference of external factors such as moisture, etc.
To verify that MSC has effect on the improvement of nitrogen spectral quantitative inversion accuracy, the original spectral reflectivity went through first derivative and multiple scattering correction plus first derivative respectively. The MSC algorithm is as following:
Calculate the average spectrum:
Conduct linear regression on spectrum and average spectrum of each sample:
Calculate the corrected spectrum:
Of which, i = 1, 2, …, n and n is the number of samples; j means the No. j of wavelength.
The correlation coefficients were calculated between each wavelength and nitrogen content. The sensitive wavelengths were selected via testing significance to establish the multiple nonlinear regression models through SVM. The effect of MSC was evaluated through analyzing the correlation coefficient (r) between spectral data after different pre-processing and nitrogen content and studying the determination coefficient (R2) and root mean square error (RMSE) of the model. The model with the highest R2 and lowest RMSE was considered the best.
3. Results and Analysis
3.1. The Contrastive Analysis of the Original Spectrum and That after MSC Pre-Processing
The original reflectance spectral curve was obtained at 350 - 1000 nm (Figure 1). It can be found that the absolute intensity greatly differed from one another and obvious baseline translation existed at the spectral interval. This was mainly influenced by different canopy structure, light condition, etc. The curve of spectral reflectance characteristics after MSC processing was as shown in Figure 2. The absolute intensity difference reduced significantly, the spectrum was more centralized and the spectral reflection characteristic was more obvious at the same time.
3.2. Correlation Analysis between Spectral Reflectivity and Nitrogen Content
The correlation coefficient at each wavelength between the nitrogen content and the raw spectral value and MSC spectral reflectivity, was calculated, respectively (Figure 3). It can be seen from Figure 3 that the correlation coefficients were improved significantly after MSC. At 660 nm - 683 nm, MSC spectra showed significant correlation
Figure 1. Original spectral reflectance characteristics with different nitrogen concentrations.
Figure 2. Spectral reflectance characteristics after MSC.
Figure 3. Correlation analysis between spectral reflectivity and the nitrogen content.
with apple tree canopy nitrogen content (P < 0.01).
To further improve the correlation and easier select the sensitive waveband, first derivative processing was applied to the original spectrum and MSC spectrum respectively. As was shown in Figure 4 and Figure 5, the correlation coefficient between FD spectrum and MSC + FD spectrum and nitrogen content showed significant improvement. Meanwhile, the FD correlation coefficients were higher than the MSC + FD correlation coefficients. Of which, 7 wavelengths in FD correlation coefficients reached significant correlation (Figure 4); 12 wavelengths in correlation coefficients in MSC + FD correlation coefficients reached significant correlation (Figure 5).
Table 1 showed the wavelengths with significant correlation between FD spectrum and MSC + FD spectrum and apple tree canopy nitrogen content. It can be seen from
Figure 4. Correlation analysis between FD spectral reflectivity and the nitrogen content.
Figure 5. Correlation analysis between MSC + FD spectral reflectivity and the nitrogen content.
Table 1 showed that the FD 7 wavelengths of significant correlation all showed the same in MSC + FD. Moreover, the correlation coefficients of the FD 7 wavelengths were higher than MSC + FD.
3.3. Establishment and Validation of the Prediction Model
To compare the prediction accuracy, the same 7 wavelengths were taken as independent variables and apple tree canopy nitrogen contents were taken as dependent variables to the prediction model through SVM method. In the 105 samples, 70 were randomly selected as calibration set and 35 were as prediction set. Samples in calibration set conformed to normal distribution.
SVM parameters were determined after repeated training: penalty coefficient was 1
Table 1. The wavelengths with significant correlation.
**P < 0.01.
and kernel function parameter was 0.6. The modeling and prediction results were as shown in Table 2. The with MSC + FD processing compared with FD has risen to 0.746 from 0.624, and has risen from 0.535 to 0.720. The RMSEv has declined from 0.936 g・kg−1 to 0.756 g・kg−1. Therefore, the model based on MSC + FD processing can predict apple tree canopy nitrogen content in a better manner. The MSC processing can improve the accuracy of prediction model.
Studying the effect of MSC on the accuracy of apple tree canopy nitrogen content prediction model through comparing correlation between reflectivity under the two conversions and nitrogen content, conclusions were as following: 1) the correlation between apple tree canopy spectral reflectivity and nitrogen content was improved significantly after MSC pre-processing; 2) the SVM model with MSC + FD pre-processing served as the optimal method to predict the nitrogen content (= 0.746, =
Table 2. The accuracy of the evaluation models.
: determination coefficient of calibration;: determination coefficient of validation; RMSEv: root-mean-square error of validation.
0.720, RMSEv = 0.452 g・kg−1). The model achieved a notably high accuracy and provided the scientific management of the nitrogen content; 3) MSC can commendably eliminate scattering error to improve the prediction accuracy of prediction model.
 Huang, H., Wang, W., Peng, Y.K., Wu, J.H., Gao, X.D., Wang, X. and Zhang, J. (2010) Measurement of Chlorophyll Content in Wheat Leaves Using Hyperspectral Scanning. Spectroscopy and Spectral Analysis, 30, 1811-1814.
 Liang, L., Yang, M.H. and Zang, Z. (2010) Determination of Wheat Canopy Nitrogen Content Ratio by Hyperspectral Technology Based on Wavelet Denoising and Support Vector Regression. Transaction of the CSAE, 26, 248-253.
 Daughtry, C.S.T., Walthall, C.L., Kim, M.S., de Colstount, E.B. and McMurtrey Ⅲ, J.E. (2000) Estimating Corn Leaf Chlorophyll Concentration from Leaf and Canopy Reflectance. Remote Sensing of Environment, 74, 229-239.
 Huang, C.Y., Wang, D.W., Yan, H., Zhang, Y.X., Cao, L.P. and Cheng, C. (2007) Monitoring of Cotton Canopy Chlorophyll Density and Leaf Nitrogen Accumulation Status by Using Hyperspectral Data. Acta Agronomica Sinica, 33, 931-936.
 Centner, V., Verdu-Andres, J., Walczak, B., Jouan-Rimbaud, D., Despagne, F., Pasti, L., Massart, D.-L. and De Noord, O.E. (2000) Comparison of Multivariate Calibration Techniques Applied to Experimental NIR Data Sets. Applied Spectroscopy, 54, 608-623.
 Wang, K., Chi, G.Y., Lau, R. and Chen, T. (2011) Multivariate Calibration of Near Infrared Spectroscopy in the Presence of Light Scattering Effect: A Comparative Study. Analytical Letters, 44, 824-836.
 Xiong, Z.X., Wu, Z.C., Chen, Z.X. and Hu, M.Y. (2007) Application of MSC to the Improvement of Analyzing Accuracy of Fatty Acid in Rapeseed by Near-Infrared Spectroscopy. Chinese Journal of Spectroscopy Laboratory, 24, 953-958.
 Geladi, P., Macdougall, D. and Martens, H. (1985) Linearization and Scatter Correction for Near Infrared Reflectance Spectra of Meat. Applied Spectroscopy, 39, 491-500.
 Estienne, F., Despagne, F., Walczak, B., de Noord, O.E. and Massart, D.L. (2004) A Comparison of Multivariate Calibration Techniques Applied to Experimental NIR Data Sets: Part III: Robustness against Instrumental Perturbation Conditions. Chemometrics and Intelligent Laboratory Systems, 73, 207-218.
 Wang, L., Zhao, G.X., Zhu, X.C., Lei, T. and Dong, F. (2010) Quantitative Models between Canopy Hyperspectrum and Its Component Features at Apple Tree Prosperous Fruit Stage. Spectroscopy and Spectral Analysis, 30, 2719-2723.