<?xml version="1.0" encoding="UTF-8"?>

<rdf:RDF
   xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
   xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"
   xmlns="http://purl.org/rss/1.0/"
   xmlns:dc="http://purl.org/dc/elements/1.1/"
   xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/"
   xmlns:dcterms="http://purl.org/dc/terms/"

>
<channel rdf:about="http://www.citeulike.org/about">
<pubDate>Sat, 05 Jul 2008 00:22:36 BST</pubDate>


	<title>CiteULike: AbnerCYH sys_performance</title>
	<description>CiteULike: AbnerCYH sys_performance</description>


	<link>http://www.citeulike.org/user/AbnerCYH/tag/sys_performance</link>
	<dc:publisher>CiteULike.org</dc:publisher>
	<dc:language>en-gb</dc:language>
	<dc:rights>Copyright &#169; 2004-2008 citeulike.org</dc:rights>
	<items>
    <rdf:Seq>
        <rdf:li rdf:resource="http://www.citeulike.org/user/AbnerCYH/article/2945162"/>
        <rdf:li rdf:resource="http://www.citeulike.org/user/AbnerCYH/article/2944739"/>
        <rdf:li rdf:resource="http://www.citeulike.org/user/AbnerCYH/article/2929443"/>
        <rdf:li rdf:resource="http://www.citeulike.org/user/AbnerCYH/article/2810208"/>
        <rdf:li rdf:resource="http://www.citeulike.org/user/AbnerCYH/article/2624181"/>
        <rdf:li rdf:resource="http://www.citeulike.org/user/AbnerCYH/article/2609109"/>
        <rdf:li rdf:resource="http://www.citeulike.org/user/AbnerCYH/article/2548178"/>
        <rdf:li rdf:resource="http://www.citeulike.org/user/AbnerCYH/article/2269937"/>
        <rdf:li rdf:resource="http://www.citeulike.org/user/AbnerCYH/article/2242896"/>

	</rdf:Seq>
	</items>
	</channel>


<item rdf:about="http://www.citeulike.org/user/AbnerCYH/article/2945162">
    <title>Analysis of Linear Time Sorting Algorithms</title>
    <link>http://www.citeulike.org/user/AbnerCYH/article/2945162</link>
    <description>&lt;i&gt;The Computer Journal, Vol. 51, No. 4. (1 July 2008), pp. 451-469.&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;We derive CPU time formulae for the two simplest linear time-sorting algorithms, linear probing sort and bucket sort, as a function of the load factor, and show agreement with experimentally measured CPU times. This allows us to compute optimal load factors for each algorithm, whose values have previously been identified only approximately in the literature. We also present a simple model of cache latency and apply it not only to linear probing sort and bucket sort, where the bulk of the latency is due to random access, but also to the log-linear algorithm quicksort, where the access is primarily sequential, and again show agreement with experimental CPU times. With minor modifications, our model also fits CPU times previously reported by LaMarca and Ladner for radix sort, and by Rahman and Raman for most significant digit radix sort, Flashsort1, and memory tuned quicksort. 10.1093/comjnl/bxm097</description>
    <dc:title>Analysis of Linear Time Sorting Algorithms</dc:title>

    <dc:creator>Paul Shutler</dc:creator>
    <dc:creator>Seok Sim</dc:creator>
    <dc:creator>Wei Lim</dc:creator>
    <dc:identifier>doi:10.1093/comjnl/bxm097</dc:identifier>
    <dc:source>The Computer Journal, Vol. 51, No. 4. (1 July 2008), pp. 451-469.</dc:source>
    <dc:date>2008-06-30T15:03:52-00:00</dc:date>
    <prism:publicationYear>2008</prism:publicationYear>
    <prism:publicationName>The Computer Journal</prism:publicationName>
    <prism:volume>51</prism:volume>
    <prism:number>4</prism:number>
    <prism:startingPage>451</prism:startingPage>
    <prism:endingPage>469</prism:endingPage>
    <prism:category>algorithms</prism:category>
    <prism:category>sys_performance</prism:category>
</item>



<item rdf:about="http://www.citeulike.org/user/AbnerCYH/article/2944739">
    <title>An Empirical Comparison of Three Boosting Algorithms on Real Data Sets with Artificial Class Noise</title>
    <link>http://www.citeulike.org/user/AbnerCYH/article/2944739</link>
    <description>&lt;i&gt;Multiple Classifier Systems (2003), pp. 161-161.&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;Boosting algorithms are a means of building a strong ensemble classifier by aggregating a sequence of weak hypotheses. In this paper we consider three of the best-known boosting algorithms: Adaboost [9], Logitboost [11] and Brownboost [8]. These algorithms are adaptive, and work by maintaining a set of example and class weights which focus the attention of a base learner on the examples that are hardest to classify. We conduct an empirical study to compare the performance of these algorithms, measured in terms of overall test error rate, on five real data sets. The tests consist of a series of cross-validatory samples. At each validation, we set aside one third of the data chosen at random as a test set, and fit the boosting algorithm to the remaining two thirds, using binary stumps as a base learner. At each stage we record the final training and test error rates, and report the average errors within a 95% confidence interval. We then add artificial class noise to our data sets by randomly reassigning 20% of class labels, and repeat our experiment. We find that Brownboost and Logitboost prove less likely than Adaboost to overfit in this circumstance.</description>
    <dc:title>An Empirical Comparison of Three Boosting Algorithms on Real Data Sets with Artificial Class Noise</dc:title>

    <dc:creator>Ross Mcdonald</dc:creator>
    <dc:creator>David Hand</dc:creator>
    <dc:creator>Idris Eckley</dc:creator>
    <dc:identifier>doi:10.1007/3-540-44938-8_4</dc:identifier>
    <dc:source>Multiple Classifier Systems (2003), pp. 161-161.</dc:source>
    <dc:date>2008-06-30T14:15:47-00:00</dc:date>
    <prism:publicationYear>2003</prism:publicationYear>
    <prism:publicationName>Multiple Classifier Systems</prism:publicationName>
    <prism:startingPage>161</prism:startingPage>
    <prism:endingPage>161</prism:endingPage>
    <prism:category>algorithms</prism:category>
    <prism:category>kdd</prism:category>
    <prism:category>sys_performance</prism:category>
</item>



<item rdf:about="http://www.citeulike.org/user/AbnerCYH/article/2929443">
    <title>An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization</title>
    <link>http://www.citeulike.org/user/AbnerCYH/article/2929443</link>
    <description>&lt;i&gt;Machine Learning, Vol. 40, No. 2. (2000), pp. 139-157.&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;Bagging and boosting are methods that generate a diverse ensemble of classifiers by manipulating the training data given to a “base” learning algorithm. Breiman has pointed out that they rely for their effectiveness on the instability of the base learning algorithm. An alternative approach to generating an ensemble is to randomize the internal decisions made by the base algorithm. This general approach has been studied previously by Ali and Pazzani and by Dietterich and Kong. This paper compares the effectiveness of randomization, bagging, and boosting for improving the performance of the decision-tree algorithm C4.5. The experiments show that in situations with little or no classification noise, randomization is competitive with (and perhaps slightly superior to) bagging but not as accurate as boosting. In situations with substantial classification noise, bagging is much better than boosting, and sometimes better than randomization.</description>
    <dc:title>An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization</dc:title>

    <dc:creator>Thomas Dietterich</dc:creator>
    <dc:identifier>doi:10.1023/A:1007607513941</dc:identifier>
    <dc:source>Machine Learning, Vol. 40, No. 2. (2000), pp. 139-157.</dc:source>
    <dc:date>2008-06-26T09:40:27-00:00</dc:date>
    <prism:publicationYear>2000</prism:publicationYear>
    <prism:publicationName>Machine Learning</prism:publicationName>
    <prism:volume>40</prism:volume>
    <prism:number>2</prism:number>
    <prism:startingPage>139</prism:startingPage>
    <prism:endingPage>157</prism:endingPage>
    <prism:category>algorithms</prism:category>
    <prism:category>kdd</prism:category>
    <prism:category>sys_performance</prism:category>
</item>



<item rdf:about="http://www.citeulike.org/user/AbnerCYH/article/2810208">
    <title>Path Diversity over Packet Switched Networks: Performance Analysis and Rate Allocation</title>
    <link>http://www.citeulike.org/user/AbnerCYH/article/2810208</link>
    <description>&lt;i&gt;(14 May 2008)&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;Path diversity works by setting up multiple parallel connections between the end points using the topological path redundancy of the network. In this paper, <i>Forward Error Correction</i> (FEC) is applied across multiple independent paths to enhance the end-to-end reliability. Network paths are modeled as erasure Gilbert-Elliot channels. It is known that over any erasure channel, <i>Maximum Distance Separable</i> (MDS) codes achieve the minimum probability of irrecoverable loss among all block codes of the same size. Based on the adopted model for the error behavior, we prove that the probability of irrecoverable loss for MDS codes decays exponentially for an asymptotically large number of paths. Then, optimal rate allocation problem is solved for the asymptotic case where the number of paths is large. Moreover, it is shown that in such asymptotically optimal rate allocation, each path is assigned a positive rate <i>iff</i> its quality is above a certain threshold. The quality of a path is defined as the percentage of the time it spends in the bad state. Finally, using dynamic programming, a heuristic suboptimal algorithm with polynomial runtime is proposed for rate allocation over a finite number of paths. This algorithm converges to the asymptotically optimal rate allocation when the number of paths is large. The simulation results show that the proposed algorithm approximates the optimal rate allocation (found by exhaustive search) very closely for practical number of paths, and provides significant performance improvement compared to the alternative schemes of rate allocation.</description>
    <dc:title>Path Diversity over Packet Switched Networks: Performance Analysis and Rate Allocation</dc:title>

    <dc:creator>Shervan Fashandi</dc:creator>
    <dc:creator>Shahab Gharan</dc:creator>
    <dc:creator>Amir Khandani</dc:creator>
    <dc:source>(14 May 2008)</dc:source>
    <dc:date>2008-05-18T16:02:36-00:00</dc:date>
    <prism:publicationYear>2008</prism:publicationYear>
    <prism:category>algorithms</prism:category>
    <prism:category>graph</prism:category>
    <prism:category>sys_performance</prism:category>
</item>



<item rdf:about="http://www.citeulike.org/user/AbnerCYH/article/2624181">
    <title>Structured permuting in place on parallel disk systems</title>
    <link>http://www.citeulike.org/user/AbnerCYH/article/2624181</link>
    <description>&lt;i&gt;(1996), pp. 128-139.&lt;/i&gt;</description>
    <dc:title>Structured permuting in place on parallel disk systems</dc:title>

    <dc:creator>Leonard Wisniewski</dc:creator>
    <dc:identifier>doi:10.1145/236017.236047</dc:identifier>
    <dc:source>(1996), pp. 128-139.</dc:source>
    <dc:date>2008-04-02T18:18:07-00:00</dc:date>
    <prism:publicationYear>1996</prism:publicationYear>
    <prism:startingPage>128</prism:startingPage>
    <prism:endingPage>139</prism:endingPage>
    <prism:publisher>ACM</prism:publisher>
    <prism:category>algorithms</prism:category>
    <prism:category>parallel</prism:category>
    <prism:category>sys_performance</prism:category>
</item>



<item rdf:about="http://www.citeulike.org/user/AbnerCYH/article/2609109">
    <title>Algorithm engineering</title>
    <link>http://www.citeulike.org/user/AbnerCYH/article/2609109</link>
    <description>&lt;i&gt;ACM Comput. Surv. (1999)&lt;/i&gt;</description>
    <dc:title>Algorithm engineering</dc:title>

    <dc:creator>Giuseppe Cattaneo</dc:creator>
    <dc:creator>Giuseppe Italiano</dc:creator>
    <dc:identifier>doi:10.1145/333580.333582</dc:identifier>
    <dc:source>ACM Comput. Surv. (1999)</dc:source>
    <dc:date>2008-03-28T18:31:17-00:00</dc:date>
    <prism:publicationYear>1999</prism:publicationYear>
    <prism:publicationName>ACM Comput. Surv.</prism:publicationName>
    <prism:issn>0360-0300</prism:issn>
    <prism:publisher>ACM</prism:publisher>
    <prism:category>algorithms</prism:category>
    <prism:category>sys_performance</prism:category>
</item>



<item rdf:about="http://www.citeulike.org/user/AbnerCYH/article/2548178">
    <title>On the design of CGAL a computational geometry algorithms library</title>
    <link>http://www.citeulike.org/user/AbnerCYH/article/2548178</link>
    <description>&lt;i&gt;Soft\-ware<em>dash Prac\-tice and Experience, Vol. 30, No. 11. (2000), pp. 1167-1202.</em>&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;this paper we present and discuss the design of this C++ software library.</description>
    <dc:title>On the design of CGAL a computational geometry algorithms library</dc:title>

    <dc:creator>Andreas Fabri</dc:creator>
    <dc:creator>Geert Giezeman</dc:creator>
    <dc:creator>Lutz Kettner</dc:creator>
    <dc:creator>Stefan Schirra</dc:creator>
    <dc:creator>Sven Sch&#246;nherr</dc:creator>
    <dc:source>Soft\-ware<em>dash Prac\-tice and Experience, Vol. 30, No. 11. (2000), pp. 1167-1202.</em></dc:source>
    <dc:date>2008-03-18T03:55:27-00:00</dc:date>
    <prism:publicationYear>2000</prism:publicationYear>
    <prism:publicationName>Soft\-ware<em>dash Prac\-tice and Experience</em></prism:publicationName>
    <prism:volume>30</prism:volume>
    <prism:number>11</prism:number>
    <prism:startingPage>1167</prism:startingPage>
    <prism:endingPage>1202</prism:endingPage>
    <prism:category>algorithms</prism:category>
    <prism:category>sys_performance</prism:category>
</item>



<item rdf:about="http://www.citeulike.org/user/AbnerCYH/article/2269937">
    <title>MapReduce: Simplified Data Processing on Large Clusters</title>
    <link>http://www.citeulike.org/user/AbnerCYH/article/2269937</link>
    <description>&lt;i&gt;(December 2004)&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;MapReduce is a programming model and an associated implementation for processing and generating large data sets. Users specify a map function that processes a key/value pair to generate a set of intermediate key/value pairs, and a reduce function that merges all intermediate values associated with the same intermediate key. Many real world tasks are expressible in this model, as shown in the paper. Programs written in this functional style are automatically parallelized and executed on a large cluster of commodity machines. The run-time system takes care of the details of partitioning the input data, scheduling the program's execution across a set of machines, handling machine failures, and managing the required inter-machine communication. This allows programmers without any experience with parallel and distributed systems to easily utilize the resources of a large distributed system. Our implementation of MapReduce runs on a large cluster of commodity machines and is highly scalable: a typical MapReduce computation processes many terabytes of data on thousands of machines. Programmers find the system easy to use: hundreds of MapReduce programs have been implemented and upwards of one thousand MapReduce jobs are executed on Google's clusters every day.</description>
    <dc:title>MapReduce: Simplified Data Processing on Large Clusters</dc:title>

    <dc:creator>Jeffrey Dean</dc:creator>
    <dc:creator>Sanjay Ghemawat</dc:creator>
    <dc:source>(December 2004)</dc:source>
    <dc:date>2008-01-21T20:16:16-00:00</dc:date>
    <prism:publicationYear>2004</prism:publicationYear>
    <prism:category>algorithms</prism:category>
    <prism:category>data_structure</prism:category>
    <prism:category>distributed</prism:category>
    <prism:category>sys_performance</prism:category>
</item>



<item rdf:about="http://www.citeulike.org/user/AbnerCYH/article/2242896">
    <title>Optimizing graph algorithms for improved cache performance</title>
    <link>http://www.citeulike.org/user/AbnerCYH/article/2242896</link>
    <description>&lt;i&gt;Parallel and Distributed Processing Symposium., Proceedings International, IPDPS 2002, Abstracts and CD-ROM (2002), pp. 32-41.&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;Tiling has long been used to improve cache performance. Recursion hers recently been used as a cache-oblivious method of improving cache performance. Both of these techniques are normally applied to dense linear algebra problems. We develop new implementations by means of these two techniques for the fundamental graph problem of Transitive Closure, namely the Floyd-Warshall Algorithm, and prove their optimality with respect to processor-memory traffic. Using these implementations we show up to 10&#215; improvement in execution time. We also address Dijkstra's algorithm for the single-source shortest path problem and Prim's algorithm for Minimum Spanning Tree, for which neither tiling nor recursion can be directly applied. For these algorithms, we demonstrate up to a 2&#215; improvement by using a cache friendly graph representation. Experimental results are shown for the Pentium Ill, UltraSPARC Ill, Alpha 21264, and MIPS R12000 machines using problem sizes between 1024 and 4096 vertices. We demonstrate improved cache performance using the Simplescalar simulator</description>
    <dc:title>Optimizing graph algorithms for improved cache performance</dc:title>

    <dc:creator>Joon-Sang Park</dc:creator>
    <dc:creator>M Penner</dc:creator>
    <dc:creator>VK Prasanna</dc:creator>
    <dc:identifier>doi:10.1109/IPDPS.2002.1015509</dc:identifier>
    <dc:source>Parallel and Distributed Processing Symposium., Proceedings International, IPDPS 2002, Abstracts and CD-ROM (2002), pp. 32-41.</dc:source>
    <dc:date>2008-01-17T05:02:51-00:00</dc:date>
    <prism:publicationYear>2002</prism:publicationYear>
    <prism:publicationName>Parallel and Distributed Processing Symposium., Proceedings International, IPDPS 2002, Abstracts and CD-ROM</prism:publicationName>
    <prism:startingPage>32</prism:startingPage>
    <prism:endingPage>41</prism:endingPage>
    <prism:category>algorithms</prism:category>
    <prism:category>sys_performance</prism:category>
</item>



</rdf:RDF>

