HappyAImen WANG, Chenghui SHEN, Kairan WU, Shukun

Slides:



Advertisements
Similar presentations
2015 SLA IT Webinar Using Analytics to Understand Social Media Activity Michelle Chen School of Information San José State University February 4 th, 2015.
Advertisements

Title Course opinion mining methodology for knowledge discovery, based on web social media Authors Sotirios Kontogiannis Ioannis Kazanidis Stavros Valsamidis.
A Brief Overview. Contents Introduction to NLP Sentiment Analysis Subjectivity versus Objectivity Determining Polarity Statistical & Linguistic Approaches.
Jianwei Lu1 Information Extraction from Event Announcements Student: Jianwei Lu ( ) Supervisor: Robert Dale.
ITEC810 Project By: P. M. Mathindri Nilushika Pathiraja 1.
Web queries classification Nguyen Viet Bang WING group meeting June 9 th 2006.
9. Learning Objectives  How do companies utilize social media research? What are the primary approaches to social media research?  What is the research.
HOW-TO: Driving Traffic with Twitter Cards & Analytics 9 types of Twitter Cards to install on your site and how to measure ROI for subscription sales.
1 Applying Collaborative Filtering Techniques to Movie Search for Better Ranking and Browsing Seung-Taek Park and David M. Pennock (ACM SIGKDD 2007)
How Useful are Your Comments? Analyzing and Predicting YouTube Comments and Comment Ratings Stefan Siersdorfer, Sergiu Chelaru, Wolfgang Nejdl, Jose San.
TEXT ANALYTICS - LABS Maha Althobaiti Udo Kruschwitz Massimo Poesio.
Evaluating & Maintaining a Site Domain 6. Conduct Technical Tests Dreamweaver provides many tools to assist in finalizing and testing your website for.
CSC 594 Topics in AI – Text Mining and Analytics
QuickSuite VIP-QuickSuite: A Collection of tools designed to adjust to solve your company needs.
Content Analytics – Uncovering Critical Insight YellowBrix 3/2/20161.
Institute of Informatics & Telecommunications NCSR “Demokritos” Spidering Tool, Corpus collection Vangelis Karkaletsis, Kostas Stamatakis, Dimitra Farmakiotou.
University Of Seoul Ubiquitous Sensor Network Lab Query Dependent Pseudo-Relevance Feedback based on Wikipedia 전자전기컴퓨터공학 부 USN 연구실 G
© NCSR, Frascati, July 18-19, 2002 CROSSMARC big picture Domain-specific Web sites Domain-specific Spidering Domain Ontology XHTML pages WEB Focused Crawling.
Opinion spam and Analysis 소프트웨어공학 연구실 G 최효린 1 / 35.
I was looking through many APIs to figure out what I wanted to use and how I wanted to develop this Twitterbot. My early attempts consisted of developing.
1 The application of the BSC model to guide the Hi-Tech company strategy management – A case study Presenter : Sandra Chen 陳奕嘉 Instructor : Kate Chen 陳姿青.
CAPE INFORMATION TECHNOLOGY
Sentiment Analysis of Twitter Data
Introduction to gathering and analyzing data via APIs Gus Cavanaugh
The Sellout: Readers Sentiment Analysis of 2016 Man Booker Prize Winner Paper ID : 748.
Name: Sushmita Laila Khan Affiliation: Georgia Southern University
The Campaign Management Cheat Sheet
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Automated Experiments on Ad Privacy Settings
 Corpus Formation [CFT]  Web Pages Annotation [Web Annotator]  Web sites detection [NEACrawler]  Web pages collection [NEAC]  IE Remote.
Proposal for Term Project
Sentiment analysis tools
CANTINA: A Content-Based Approach to Detecting Phishing Web Sites
Future-oriented Benchmarking Through Social Media Analysis
Future-Oriented Benchmarking through Social Media Analysis
Memory Standardization
CONTENT MANAGEMENT SYSTEM CSIR-NISCAIR, New Delhi
A Network Science Approach to Fake News Detection on Social Media
APPLIED TECHNIQUES OF IMAGE CONTENT ANALYSIS USING AMAZON REKOGNITION
Power of Social Media Analytics
Listening, Learning, Leading
Measuring Sustainability Reporting using Web Scraping and Natural Language Processing Alessandra Sozzi
Artificial Intelligence with Heart: Improving Customer Experience through Sentiment Analysis.
Introducing the Numeracy continuum K-10
iCIMS 17.1 Release: Highlights
Good/Bad, Happy/Sad conducting sentiment analysis on user survey data from Houghton Library with R.
Automated MS Word and PowerPoint Translator
Speech Capture, Transcription and Analysis App
Data Extraction using Web Scraping
Cryptocurrencies: A Brief Look & Sentiment Analysis
David Cyphert CS 2310 – Software Engineering
Text Analysis and Search Analytics
Introduction Previous lessons have demonstrated that the normal distribution provides a useful model for many situations in business and industry, as.
Text Mining & Natural Language Processing
Lecture 6: How to Read an Academic Paper
PolyAnalyst Web Report Training
Feedback on the use of EST in Finland
Text Analysis and Search Analytics
Sentiment Analysis In Student Learning Experience By Obinna Obeleagu
Sentiment Analysis In Student Learning Experience By Obinna Obeleagu
Prepared by Prof. Philip R. Murray Finley
CAPE INFORMATION TECHNOLOGY
Unsupervised Machine Learning: Clustering Assignment
Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.
A Study of Company — Sense Time
Stock Predictions Project Presentation
Web Application Development Using PHP
Yingze Wang and Shi-Kuo Chang University of Pittsburgh
Company development tracking Group: GESHA Li Lu Zhou Yawen Qiu Chuchu
Purpose of This Deck This slide deck is intended for use by site administrators to provide information to Parents about the California School Dashboard.
Presentation transcript:

HappyAImen WANG, Chenghui 20552633 SHEN, Kairan 20552932 WU, Shukun 20549545 XIAO, Yuxiang 20551433 LAM, Hiufung 20476671

Content 1. Project Progress 2. Final Deliverable · General review · Introduction of Sentiment analysis · Code display 2. Final Deliverable 3. Synthesis and Suggestion for further study

General review · 1st Update Meeting: Research on the three selected approaches were conducted. 1. SenseTime AI Summit; 2. Potential policies with impact on SenseTime; 3. Current clients of Sense; Challenge: limited public available information on target companies and their clients. · 2st Update Meeting: Web scraping for news about SenseTime & tracking company’s project progress Challenge: public information is not sufficient to determine whether a project is still in progress or already closed. Nevertheless, web scraping still serves as an useful tool for data collection, which could be the main tool in the project.

General review What we want to do: The way we do it: Find out whether or not the leading AI startup company SenseTime has promotion bubbles. The way we do it: Sentiment Analysis: (automated process of understanding an opinion about a given subject from written or spoken language)

General review Steps: Find all the information available about SenseTime Feed all the information data into the sentiment analysis framework to determine the probability of positivity or negativity Use Tencent Cloud service, i.e., Tencent AI open platform sentiment analysis API interface service. Use Reptile technique to quickly capture all the information we want on a specific website.Also, we make some efforts to connect to the Tencent API interface service.

Sentiment analysis brief introduction General review Sentiment analysis brief introduction A field within Natural Language Processing (NLP) Identify and extract opinions within text. Usually use machine learning techniques. Modeled as a classification problem where a classifier is fed with a text and returns the corresponding category, e.g. positive, negative, or neutral See final report to get more specific information

Code display Crawl the news’ URL by using python request library · delete the irrelevant news – news’ title do not have company name · delete the repeated news Extract the content (news) from each URL by using re and beautifulsoup library Use Tencent API interface service to complete sentiment analysis.

Code display Leifeng: https://colab.research.google.com/drive/1NrkMyd2OMvG3dUjAI5-9UnYl1pCsUM8n Xinhua: https://colab.research.google.com/drive/1WKOErdQjYrJsrmWUe9EvJLVMg1aDIa7S Renmin: https://colab.research.google.com/drive/1pbART3rZhYBgL2-dTanTtesNXlE_H7Vv PEdaily: https://colab.research.google.com/drive/1agCrF6HED4wWSTxajDnczR-GyKG3SpFL

Final Deliverable Sensetime Malong CloudWalk Megvii YITU Xinhua 29 10 15 8 Leifeng 39 7 25 35 5 PEdaily 18 2 3 9 Renmin 27 6 5 companies: Sensetime, Malong, CloudWalk, Megvii and YITU 4 websites: Xinhua, Leifeng, Renmin and PEdaily 320 articles

Final Deliverable Sensetime Malong CloudWalk Megvii YITU Xinhua 0.6898356 0.6756691 0.6692568 0.6860859 0.7050765 Leifeng 0.6809414 0.6866832 0.6401622 0.6496635 0.6983780 PEdaily 0.7063379 0.6794848 0.7133881 0.6945139 0.6664504 Renmin 0.6794342 0.6891639 0.6462130 0.6773417 0.6693506 · Based on the crawling and text emotion analysis, we calculated the average points and got the results. · The score of the emotion is the positive point of the article.

Final Deliverable The five companies have very close points. Sensetime has the highest points, YITU comes second and Malong comes thrid. The media has a very positive comment on these three companies.

Final Deliverable Sensetime and Malong has a balanced points from four different websites. However, CloudWalk, Megavii and YITU seem to impress differently in these 4 websites.

Final Deliverable To analyze the company Sensetime, we draw a picture as below. The number of articles on Sensetime is 114. We can see that the points of these 114 articles concentrate on the range of [0.6348, 0.7428].

Synthesis and Suggestion for further study Getting public opinions and comments from social media such as Sina Weibo, Twitter, Zhihu, etc., Then perform sentiment analysis on these texts and add weights to text information from different sources to get a more objective result. For a specific company, perform sentiment analysis on the texts at different time periods, horizontally evaluate the public's evaluation of the company during its development process. Tencent AI open platform sentiment analysis API was used for the sentiment analysis. However, it is suggested that a tailor-made sentiment analysis model can be employed in further study. Other ideas: bubble model, unsupervised deep learning method of clustering

Thank you HappyAImen