Download presentation
Presentation is loading. Please wait.
1
The experiments based on CNN
Raymond ZHAO WENLONG
2
Content Summary about Dataset
The experiments based on CNN (on Amazon dataset)
3
Dataset from HP ~1k reviews for 58 laptops, and ~78k words totally
An average of ~16 reviews per laptop An average of ~82 words per review See the data: hp_data.md
4
Dataset from flipkart (in India)
~32k reviews for 408 laptops, and ~1.17M words totally Average ~79 reviews per laptop Average ~36 words per review See the data: flikkart_data.md
5
Dataset from Amazon ~7.2k reviews for 116 laptops, and ~521k words totally ~60 words in each review See the data: amazon_laptop.json
6
Summary about Dataset From Amazon, ~32k reviews for 408 laptops, and ~1.17M words totally From HP, ~1k reviews for 58 laptops, and ~78k words totally From flipkart, ~7.2k reviews for 116 laptops, and ~521k words totally ~40.2k reviews for ~ ( =) 582 laptops (duplication) ~1.67M words totally
7
Based on CNN See the source code: text_cnn.py
(some borrows from cnn.py)
8
The experiments based on CNN (on Amazon Dataset)
A bit better than SWEM-con Alg
9
TODO The experiments on all datasets RNN + LSTM ?
10
Thanks
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.