Download presentation
Presentation is loading. Please wait.
1
Map/Reduce Discussion
Sara Javanmardi
2
To do Check all grades on EEE and make sure to have grade for
Submitted Assignment Taken Quizzes Demo for Ranking Assignment Today after discussion session
3
Business Major Those interested in extra credit points can write a blog related to large scale data analysis in the market, from the business aspect. Example
4
Step 1 Sample: WikiNgram-Skeleton.zip
The small test cluster is online at It contains small samples of both datasets (/data/ folder on hdfs)
5
To do Preprocessing Wiki markup text to transform it to raw text (done for you) Complete the Mapper and Reducer classes in the sample code Tokenize the text Keep track of ngram frequencies (n<=3)
6
Step 2 Connect to the small test cluster and test your code using your user account info This cluster is on till March 9th
7
Step 3 Connect to the big cluster and run your finalized code using your user account info This cluster is on till deadline
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.