MapReduce with Deltas
- The MapReduce programming model is extended slightly in order to use deltas. Because many MapReduce jobs are being re-executed over slightly changing input, processing only those changes promises significant improvements. Reduced execution time allows for more frequent execution of tasks, yielding more up-to-date results in practical applications. In the context of compound MapReduce jobs, benefits even add up over the individual jobs, as each job gains from processing less input data. The individual steps necessary in working with deltas are being analyzed and examined for efficiency. Several use cases have been implemented and tested on top of Hadoop. The correctness of the extended programming model relies on a simple correctness criterion.
Verfasserangaben: | David Saile |
---|---|
URN: | urn:nbn:de:kola-5479 |
Betreuer: | Ralf Lämmel |
Dokumentart: | Diplomarbeit |
Sprache: | Englisch |
Datum der Fertigstellung: | 10.08.2011 |
Datum der Veröffentlichung: | 10.08.2011 |
Veröffentlichende Institution: | Universität Koblenz-Landau, Campus Koblenz, Universitätsbibliothek |
Titel verleihende Institution: | Universität Koblenz, Fachbereich 4 |
Datum der Freischaltung: | 10.08.2011 |
Freies Schlagwort / Tag: | Delta; Incremental Algorithms; MapReduce |
Institute: | Fachbereich 4 / Fachbereich 4 |
DDC-Klassifikation: | 0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 004 Datenverarbeitung; Informatik |
Lizenz (Deutsch): | Es gilt das deutsche Urheberrecht: § 53 UrhG |