<?xml version="1.0" encoding="UTF-8"?>

<rdf:RDF
   xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
   xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"
   xmlns="http://purl.org/rss/1.0/"
   xmlns:dc="http://purl.org/dc/elements/1.1/"
   xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/"
   xmlns:dcterms="http://purl.org/dc/terms/"

>
<channel rdf:about="http://www.citeulike.org/about">
<pubDate>Mon, 07 Jul 2008 16:42:29 BST</pubDate>


	<title>CiteULike: neteler boosting</title>
	<description>CiteULike: neteler boosting</description>


	<link>http://www.citeulike.org/user/neteler/tag/boosting</link>
	<dc:publisher>CiteULike.org</dc:publisher>
	<dc:language>en-gb</dc:language>
	<dc:rights>Copyright &#169; 2004-2008 citeulike.org</dc:rights>
	<items>
    <rdf:Seq>
        <rdf:li rdf:resource="http://www.citeulike.org/user/neteler/article/167644"/>
        <rdf:li rdf:resource="http://www.citeulike.org/user/neteler/article/784728"/>
        <rdf:li rdf:resource="http://www.citeulike.org/user/neteler/article/165116"/>

	</rdf:Seq>
	</items>
	</channel>


<item rdf:about="http://www.citeulike.org/user/neteler/article/167644">
    <title>Boosting the margin: a new explanation for the effectiveness of voting methods</title>
    <link>http://www.citeulike.org/user/neteler/article/167644</link>
    <description>&lt;i&gt;(1997), pp. 322-330.&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;One of the surprising recurring phenomena observed in experiments with boosting is that the test error of the generated hypothesis usually does not increase as its size becomes very large, and often is observed to decrease even after the training error reaches zero. In this paper, we show that this phenomenon is related to the distribution of margins of the training examples with respect to the generated voting classification rule, where the margin of an example is simply the difference between the number of correct votes and the maximum number of votes received by any incorrect label. We show that techniques used in the analysis of Vapnik’s support vector classifiers and of neural networks with small weights can be applied to voting methods to relate the margin distribution to the test error. We also show theoretically and experimentally that boosting is especially effective at increasing the margins of the training examples. We argue that this explanation is better than explanations based on the bias-variance decomposition and provides new insight to the behaviour of boosting as well as other voting schemes, such as bagging and ECOC.</description>
    <dc:title>Boosting the margin: a new explanation for the effectiveness of voting methods</dc:title>

    <dc:creator>Robert Schapire</dc:creator>
    <dc:creator>Yoav Freund</dc:creator>
    <dc:creator>Peter Bartlett</dc:creator>
    <dc:creator>Wee Lee</dc:creator>
    <dc:source>(1997), pp. 322-330.</dc:source>
    <dc:date>2005-04-22T18:54:02-00:00</dc:date>
    <prism:publicationYear>1997</prism:publicationYear>
    <prism:startingPage>322</prism:startingPage>
    <prism:endingPage>330</prism:endingPage>
    <prism:publisher>Morgan Kaufmann</prism:publisher>
    <prism:category>boosting</prism:category>
    <prism:category>ensemble</prism:category>
    <prism:category>machine-learning</prism:category>
</item>



<item rdf:about="http://www.citeulike.org/user/neteler/article/784728">
    <title>Boosting of Tree-based Classifiers for Predictive Risk Modeling in GIS</title>
    <link>http://www.citeulike.org/user/neteler/article/784728</link>
    <description>&lt;i&gt;(2000), pp. 220-229.&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;Boosting of tree-based classifiers has been interfaced to the Geographical Information System (GIS) GRASS to create predictive classification models from digital maps. On a risk management problem in landscape ecology, the performance of the boosted tree model is better than either with a single classifier or with bagging. This results in an improved digital map of the risk of human exposure to tick-borne diseases in Trentino (Italian Alps) given sampling on 388 sites and the use of several overlaying georeferenced data bases. Margin distributions are compared for bagging and boosting. Boosting is confirmed to give the most accurate model on two additional and independent test sets of reported cases of bites on humans and of infestation measured on roe deer. An interesting feature of combining classification models within a GIS is the visualization through maps of the single elements of the combination: each boosting step map focuses on different details of data distribution. In this problem, the best performance is obtained without controlling tree sizes, which indicates that there is a strong interaction between input variables.</description>
    <dc:title>Boosting of Tree-based Classifiers for Predictive Risk Modeling in GIS</dc:title>

    <dc:creator>C Furlanello</dc:creator>
    <dc:creator>S Merler</dc:creator>
    <dc:source>(2000), pp. 220-229.</dc:source>
    <dc:date>2006-08-03T16:11:22-00:00</dc:date>
    <prism:publicationYear>2000</prism:publicationYear>
    <prism:startingPage>220</prism:startingPage>
    <prism:endingPage>229</prism:endingPage>
    <prism:publisher>Springer</prism:publisher>
    <prism:category>boosting</prism:category>
    <prism:category>disease</prism:category>
    <prism:category>ixodes</prism:category>
    <prism:category>machine-learning</prism:category>
    <prism:category>risk</prism:category>
</item>



<item rdf:about="http://www.citeulike.org/user/neteler/article/165116">
    <title>Random Forests</title>
    <link>http://www.citeulike.org/user/neteler/article/165116</link>
    <description>&lt;i&gt;Machine Learning, Vol. 45, No. 1. (2001), pp. 5-32.&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;Random forests are a combination of tree predictors such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest. The generalization error for forests converges a.s. to a limit as the number of trees in the forest becomes large. The generalization error of a forest of tree classifiers depends on the strength of the individual trees in the forest and the correlation between them. Using a random selection of ...</description>
    <dc:title>Random Forests</dc:title>

    <dc:creator>Leo Breiman</dc:creator>
    <dc:identifier>doi:10.1023/A:1010933404324</dc:identifier>
    <dc:source>Machine Learning, Vol. 45, No. 1. (2001), pp. 5-32.</dc:source>
    <dc:date>2005-04-19T18:57:17-00:00</dc:date>
    <prism:publicationYear>2001</prism:publicationYear>
    <prism:publicationName>Machine Learning</prism:publicationName>
    <prism:volume>45</prism:volume>
    <prism:number>1</prism:number>
    <prism:startingPage>5</prism:startingPage>
    <prism:endingPage>32</prism:endingPage>
    <prism:publisher>Kluwer Academic Publishers, Boston</prism:publisher>
    <prism:category>boosting</prism:category>
    <prism:category>cart</prism:category>
    <prism:category>classification</prism:category>
    <prism:category>ensemble</prism:category>
    <prism:category>machine-learning</prism:category>
</item>



</rdf:RDF>

