The Team Ernesto Cortes Kipp Dunn Sar Gregorczyk Alex Schmidt

Slides:



Advertisements
Similar presentations
Introduction Lesson 1 Microsoft Office 2010 and the Internet
Advertisements

Microsoft Office Illustrated Fundamentals Unit C: Getting Started with Unit C: Getting Started with Microsoft Office 2010 Microsoft Office 2010.
MS Access.
TS 313 Multimedia Applications Welcome to TS 313 Multimedia Applications There is no audio lecture associated with this set of introduction slides Refer.
Objectives Overview Define an operating system
Pasewark & Pasewark Microsoft Office XP: Introductory Course 1 INTRODUCTION Lesson 1 – Microsoft Office XP Basics and the Internet.
April 28, 2015 Virginia Tech. Data Analytics “Analytics is the combustion engine of business, and it will be necessary for organizations that want to.
Reference and Instruction Automated Statistics Gathering and Reporting System Members: Patrick Chen (pyc7) Soo-Yung Cho (sc444) Gregg Herlacher (gah24)
Cornell University Library Instruction Statistics Reporting System Members: Patrick Chen (pyc7) Soo-Yung Cho (sc444) Gregg Herlacher (gah24) Wilson Muyenzi.
CS155b: E-Commerce Lecture 10: Feb. 13, 2003 XML and its relationship to B2B commerce Acknowledgements: R. Glushko, A. Gregory, and V. Ramachandran.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 8: Implementing and Managing Printers.
Web-based Query & Reporting System for Software User Consultant Richard Knowles Rutgers University Electrical & Computer Engineering Mentors: Amy Chen.
Desktop and Mobile Testing Miroslav Shtilianov QA Engineer Automated Testing Team Telerik QA Academy
The Co-op Database Project Who It's For At Northeastern University cooperative education is an integral part of the education experience. There is a continuous.
Access The L Line The Express Line to Learning 2007 L Line L © Wiley Publishing All Rights Reserved.
Web Archives, IDEAL, and PBL Overview Edward A. Fox Digital Library Research Laboratory Dept. of Computer Science Virginia Tech Blacksburg, VA, USA 21.
Section 1: Introducing Group Policy What Is Group Policy? Group Policy Scenarios New Group Policy Features Introduced with Windows Server 2008 and Windows.
Footer Text A Tool for Environmental Scheduling, Accountability and Performance Measurement TxECOS.
The Cluster Computing Project Robert L. Tureman Paul D. Camp Community College.
1 FlexTraining in a Nutshell Welcome to a brief introduction of the FlexTraining Total e- Learning Solution. This short sample course will outline the.
Markup and Validation Agents in Vijjana – A Pragmatic model for Self- Organizing, Collaborative, Domain- Centric Knowledge Networks S. Devalapalli, R.
Topical Categorization of Large Collections of Electronic Theses and Dissertations Venkat Srinivasan & Edward A. Fox Virginia Tech, Blacksburg, VA, USA.
Tweets Metadata May 4, 2015 CS Multimedia, Hypertext and Information Access Department of Computer Science Virginia Polytechnic Institute and State.
Integrate Full-Text Retrieval with Digital Archives System Reporter : Chia-Hao Lee Computer System and Communication Lab, Academia Sinica Institute of.
1 IBM Academic Initiative Introduction for Pamplin School of Business Virginia Tech – October 13, 2011 “IBM Academic Skills Cloud and Computing Education.
11 IMPLEMENTING AND MANAGING SOFTWARE UPDATE SERVICES Chapter 7.
HTML5 based Notification System for Updating E-Training Contents Yu-Doo Kim 1 and Il-Young Moon 1 1 Department of Computer Science Engineering, KoreaTech,
An Investigation into using a Document Management System Presented by: Bijal RanaSupervisor: John Ebden.
Pasewark & Pasewark Microsoft Office 2003: Introductory 1 INTRODUCTION Lesson 1 – Microsoft Office 2003 Basics and the Internet.
Pasewark & Pasewark Microsoft Office 2003: Introductory 1 INTRODUCTION Lesson 1 – Microsoft Office 2003 Basics and the Internet.
Problem Based Learning To Build And Search Tweet And Web Archives Richard Gruss Edward A. Fox Digital Library Research Laboratory Dept. of Computer Science.
Entity Framework Database Connection with ASP Notes from started/getting-started-with-ef-using-mvc/creating-an-
CHAPTER 7 Operating System Copyright © Cengage Learning. All rights reserved.
DISCOVERING COMPUTERS 2018 Digital Technology, Data, and Devices
SharePoint 101 – An Overview of SharePoint 2010, 2013 and Office 365
SharePoint Broken Link Manager
Section 13 - Integrating with Third Party Tools
Common Crawl Mining Team: Brian Clarke, Tommy Dean, Ali Pasha, Casey Butenhoff Manager: Don Sanderson (Eastman Chemical Company) Client: Ken Denmark.
Identifying Drug Related Events from Social Media
Background Check Website for R4 OpSec, LLC
Zenodo Data Archive Irtiza Delwar, Michael Culhane, John Sizemore, Gil Turner Client: Dr. Seungwon Yang Instructor: Dr. Edward A. Fox CS 4624 Multimedia,
Migrating Oracle Forms Using Oracle Application Express
An educational system for medical billers in training
Virginia Tech Center for Drug Discovery Website Migration and Redesign
1 2 3 Here we are on the Ohio Web Library’s home page. To get to Business Source Premier, use the following steps: 1. Go to Ohio Web Library 2. Click on.
Thin Clients, RDP and Citrix.
Trail Study Kevin Cianfarini, Shane Davies, Marshall Hansen, Andrew Eason … CS4624: Multimedia, Hypertext, and Information Access Instructor: Dr. Edward.
CEED Phone App Madhur Mahajan, Zachary Hensley, Randy Liang, Sean Greynolds CS4624: Multimedia, Hypertext, and Information Access Edward A. Fox Virginia.
Maptivity Conor O’Neill, Kaz Eslami, Cody Douglass
Event Focused URL Extraction from Tweets
Getting Started.
Event Trend Detector Ryan Ward, Skylar Edwards, Jun Lee, Stuart Beard, Spencer Su CS 4624 Multimedia, Hypertext, and Information Access Instructor: Edward.
Final Presentation: Neural Network Doc Summarization
Getting Started.
News Event Detection Website Joe Acanfora, Briana Crabb, Jeff Morris
Paleontology Topic Trends
Tweet URL Analysis Guoxin Sun, Kehan Lyu, Liyan Li
Through the Fire and Flames
SharePoint Broken Link Manager
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
Microsoft Office Not in Textbook.
Dell Latitude Laptop Student setup.
Team 7 → Final Presentation
NAVIGATING THE MINEFIELD
Autism Support Portal Members: Sib Quayum, Ryan Galliher, Ayumi Ritchie, Kenneth Nagies Course: Multimedia, Hypertext, and Information Access (CS 4624)
Microsoft Office Illustrated Fundamentals
Cases Admin Training.
E-learning and educational technology tools
How to install and manage exchange server 2010 OP Saklani.
Python4ML An open-source course for everyone
Presentation transcript:

The Team Ernesto Cortes Kipp Dunn Sar Gregorczyk Alex Schmidt Opinion Mining & Summarization The Team Ernesto Cortes Kipp Dunn Sar Gregorczyk Alex Schmidt Project Info Multimedia, Hypertext, and Information Access Instructor: Edward A. Fox Virginia Tech, Blacksburg VA 24061 05/01/2018 Sar

Presentation Outline Our Mission Web-Crawler Database and Web-app Summarization Demo Lessons Learned Contributions References Questions Presentation Outline Sar

Our Mission Opinion Mining Project Create a suite of tools: Web-Crawler Database Summarization Toolkit Web Server Sar

Web-Crawler (Scrapy) Current Status Future Plans Web Server Integration Documentation Future Plans Additional sources Ernesto Source: https://doc.scrapy.org/en/latest/topics/architecture.html

Database and Web Application Current Status Integration with NLP tools Updated UX and UI Future Plans Data Sanitization Crawling and NLP options Better UI and UX Kipp Relational DB because: Link information between tables Add new tables Design: By separating the data, we have the ability to easily add information to a source or a specific product without changing thousands of different entries in a table.

Summarization: Extract Reviews with highest helpfulness Build 5 corpuses for each rating level Database Extractive Summarization Keyword Extraction Lemmatize and remove stopwords LDA Topic Modeling Schmidt We have done: extractive summarization, keywords, topic modeling (similar to keyword extraction however views the document more completely to get groupings of related words instead of just individual words) Next: automate word cloud generation, webapp integration, refinement. Future: abstract summarization training and integration.

Summary Example for Dell Inspiron: WIndows 10 works beautifully on this laptop, On the flip side I think the product that I have got has some inherent issue with the in-built speakers. Especially the driver under network section with name - Intel PROSet/Wireless 3165 WiFi Driver I downloaded the above driver on a different computer and ported to this new Dell laptop via flash drive.After installing above driver, this product starts connecting to Wifi and then I felt that I can use this laptop. To correct the problem, perform the following steps (assuming your laptop will not stay connected to the internet long enough to download the updated driver): 1. However, Dells very helpful tech synced….

Final Product Product Selection Screen

Final Product Individual Product

Lessons Learned Design time is important Open-source libraries are your friends The client can be a great resource Lessons Learned Sar

Contributions Kipp Dunn: Web Application & DB Lead Alex Schmidt: Summarization Tools Lead Ernesto Cortes: Web Crawler Lead Sar Gregorczyk: Documentation Lead and Team Coordination Sar

Acknowledgements Our client: Xuan Zhang Currently taking the Ph.D program at the Computer Science Department of Virginia Tech. My research area is Natural Language Processing. The research projects I have been involved include: 1) Product defect identification based on probabilistic graphical model 2) Unsupervised events extraction based on topic modeling and named entity recognition 3) Adverse events recognition based on classification and data under-sampling

References https://mysql-net.github.io/MySqlConnector/tutorials/net-core-mvc/ Gensim: https://radimrehurek.com/gensim/ https://rare-technologies.com/text-summarization-with-gensim/ Scrapy: https://scrapy.org/ Sar

Questions? Sar Ta aon cheiste agat?