Abstract Gaussian process regression (GPR) has been shown to be a powerful and effective nonparametric method for regression, classification and interpolation, due to many of its desirable properties. However, most GPR models consider univariate or multivariate covariates only. In this paper we extend the GPR models to cases where the covariates include both functional and multivariate variables and the response is multidimensional. The model naturally incorporates two different types of covariates: multivariate and functional, and the principal component analysis is used to de-correlate the multivariate response which avoids the widely recognised difficulty in the multi-output GPR models of formulating covariance functions which have to describe the correlations not only between data points but also between responses. The usefulness of the proposed method is demonstrated through a simulated example and two real data sets in chemometrics.
Variable selection is a highly effective approach to improve model performance when analyzing and modeling data from near-infrared spectroscopy. Moving window partial least squares (MWPLS), a well-known and classical wavelength interval selection algorithm, was widely used in the field of near-infrared spectroscopy. However, it has gradually faded from the spotlight due to its limitations. In this study, a modified MWPLS method called interval interaction moving window partial least squares (iMWPLS), was proposed to improve some limitations of MWPLS, including the lack of consideration for the combination effects of variables, the tendency to select redundant wavelength intervals, and the need for artificial intervention. By utilizing the framework of MWPLS, it could iteratively search, test, and add the intervals with the local minimum errors to the set of basic intervals. The method has high sensitivity to the combination effects of variables and enables accurate selection of intervals. Three NIR datasets, including corn starch, pharmaceutical tablets, and soil dataset were tested to evaluate the performance of iMWPLS. The results show that iMWPLS has better performance than other seven wavelength selection methods, including MWPLS, SEPA-MWPLS, fiPLS, biPLS, siPLS, iVISSA, and iRF.