Sample Projects
Hadoop-based Real World Applications Do you have any large scale data processing problem in your work or research? Can you formalize your data analysis problem? Can you use Hadoop to scale the computation? Make sure you can finish it in about one month.
Algorithm Design and Programming Project Implement a Hadoop-based Frequent itemset mining Apriori Eclat FP-tree MapReduce PLSA (EM-algorithm) Hadoop based Subgraph Matching
Analysis of Hadoop and hadoop-based system Hadoop, Giraph, Mahout… Open Source Projects Lack of good design documentation Many inefficiency bottleneck Read and analyze one of open source projects