Download presentation
Presentation is loading. Please wait.
Published byHolly Bishop Modified over 6 years ago
1
Big Data Machine Learning using Apache Spark MLlib
Mehdi Assefi , Ehsun Behravesh , Guangchi Liu , and Ahmad P. Tafti
2
Motivation Big Data World! Applications Challenges
healthcare informatics genomic data analysis text mining stochastic modeling Challenges Cost Time
3
Major Libraries
4
Major Libraries Apache Spark StreamingEnhanced situational awareness,
Apache Spark SQL, Spark GraphX, Apache Spark MLlib ,
5
Apache Spark MLlib platform independent open-source libraries distributed architecture and automatic data parallelization
6
Functions Regression dimension reduction Classification Clustering
rule extraction
7
Pathway
8
Experimental Evaluation
Datasets VMWARE Cluster environment Machine Learning Algorithms
11
Results
18
Conclusion
19
Questions?
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.