A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selectionby: Ron Kohavi
(1995), pp. 1137-1145.
|
Reviews
[Write a review of this article]
There are no reviews of this article
Notes for this articleProposition 1: Given a dataset and an inducer, if the inducer is stable under the perturbations caused by deleting the instances for the folds in k-fold cross-validation, the cross-validation estimation will be unbiased and the variance of the estimated accuracy will be approximately acc_cv (1-acc_cv)/n, where n is the number of instances in the dataset.
Corollary 2: Given a dataset and an inducer, if the inducer is stable under the perturbations caused by deleting the test instances for the folds in k-fold cross-validation for various values of k, then the variance of the estimates will be the same.
Find related articles from these CiteULike users
Find related articles with these CiteULike tags
AbstractWe review accuracy estimation methods and compare the two most common methods: crossvalidation and bootstrap. Recent experimental results on artificial data and theoretical results in restricted settings have shown that for selecting a good classifier from a set of classifiers (model selection), ten-fold cross-validation may be better than the more expensive leaveone -out cross-validation. We report on a largescale experiment---over half a million runs of C4.5 and a Naive-Bayes algorithm---to...
BibTeX record
RIS record