|
|
|
Hadoop++

Nowadays, working over very large data sets (Petabytes of information) is a common reality for several enterprises. In this context, query processing is a big challenge and becomes crucial. The Apache Hadoop project has been adopted by many famous companies to query their Petabytes of information. Some examples of such enterprises are Yahoo! and Facebook. Recently, some researchers from the database community indicated that Hadoop may suffer from performance issues when running analytical queries. We believe this is not an inherent problem of the MapReduce paradigm but rather some implementation choices done in Hadoop. Therefore, the overall goal of Hadoop++ project is to improve Hadoop's performance for analytical queries. Already, our preliminary results show an improvement of Hadoop++ over Hadoop by up to a factor 20. In addition, we are currently investigating the impact of a number of other optimizations techniques.
Current Team
- Prof. Jens Dittrich
- Dr. Jorge Quiane
- Jörg Schad
- Alekh Jindal
- Stefan Schuh
- Stefan Richter
Publications
-
Jens Dittrich, Jorge-Arnulfo Quiane-Ruiz, Stefan Richter, Stefan Schuh, Alekh Jindal, Jörg Schad
Only Aggressive Elephants are Fast Elephants
VLDB 2012/PVLDB, Istanbul, Turkey.
-
Alekh Jindal, Jorge-Arnulfo Quiane-Ruiz, Jens Dittrich
Trojan Data Layouts: Right Shoes for a Running Elephant
ACM SOCC 2011, Cascais, Portugal.
-
Jorge-Arnulfo Quiane-Ruiz, Christoph Pinkel, Jörg Schad, Jens Dittrich
RAFT at Work: Speeding-Up MapReduce Applications under Task and Node Failures
SIGMOD 2011, Athens. (Demo paper) poster
-
Jorge-Arnulfo Quiane-Ruiz, Christoph Pinkel, Jörg Schad, Jens Dittrich
RAFTing MapReduce: Fast Recovery on the Raft
ICDE 2011, Hannover. TR
-
Jörg Schad
Flying Yellow Elephant: Predictable and Efficient MapReduce in the Cloud
VLDB 2010 PhD Workshop, Singapore.
-
Jens Dittrich, Jorge-Arnulfo Quiane-Ruiz, Alekh Jindal, Yagiz Kargin, Vinay Setty, and Jörg Schad
Hadoop++: Making a Yellow Elephant Run Like a Cheetah (Without It Even Noticing)
VLDB 2010/PVLDB, Singapore. correction talk
-
Jörg Schad, Jens Dittrich, and Jorge-Arnulfo Quiane-Ruiz
Runtime Measurements in the Cloud: Observing, Analyzing, and Reducing Variance
VLDB 2010/PVLDB, Singapore. talk
|