Support vector machine (SVM) is a popular classifier known for accuracy,...
The Internet of Things (IoT) system generates massive high-speed tempora...
Subsampling methods aim to select a subsample as a surrogate for the obs...
Dataset shift is common in credit scoring scenarios, and the inconsisten...
Smoothing splines have been used pervasively in nonparametric regression...
With the rapid development of quantum computers, quantum algorithms have...
This paper studies the estimation of large-scale optimal transport maps
...
We consider a measurement constrained supervised learning problem, that ...
Sufficient dimension reduction is used pervasively as a supervised dimen...
Optimal transport has been one of the most exciting subjects in mathemat...
Large samples have been generated routinely from various sources. Classi...
We consider the problem of approximating smoothing spline estimators in ...
The statistical analysis of Randomized Numerical Linear Algebra (RandNLA...
We consider the problem of comparing probability densities between two
g...
Testing the hypothesis of parallelism is a fundamental statistical probl...
Many scientific studies collect data where the response and predictor
va...
For massive data, the family of subsampling algorithms is popular to dow...
A significant hurdle for analyzing large sample data is the lack of effe...
One popular method for dealing with large-scale data sets is sampling. F...