PURE Learning Plan Richard Lee, James Chen,.

Slides:



Advertisements
Similar presentations
News and Blog Analysis with Lydia Steven Skiena Dept. of Computer Science SUNY Stony Brook
Advertisements

WWW 2014 Seoul, April 8 th SNOW 2014 Data Challenge Two-level message clustering for topic detection in Twitter Georgios Petkos, Symeon Papadopoulos, Yiannis.
TEMPLATE DESIGN © Identifying Noun Product Features that Imply Opinions Lei Zhang Bing Liu Department of Computer Science,
Poster Print Size: This poster template is 24” high by 36” wide. It can be used to print any poster with a 2:3 aspect ratio including 36x54 and 48x72.
Methods in Computational Linguistics II Queens College Lecture 1: Introduction.
Dialogue – Driven Intranet Search Suma Adindla School of Computer Science & Electronic Engineering 8th LANGUAGE & COMPUTATION DAY 2009.
IVITA Workshop Summary Session 1: interactive text analytics (Session chair: Professor Huamin Qu) a) HARVEST: An Intelligent Visual Analytic Tool for the.
Sunita Sarawagi.  Enables richer forms of queries  Facilitates source integration and queries spanning sources “Information Extraction refers to the.
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
Predicting Text Quality for Scientific Articles AAAI/SIGART-11 Doctoral Consortium Annie Louis : Louis A. and Nenkova A Automatically.
Automatic Discovery of Technology Trends from Patent Text Youngho Kim, Yingshi Tian, Yoonjae Jeong, Ryu Jihee, Sung-Hyon Myaeng School of Engineering Information.
CSCD 555 Research Methods for Computer Science
COURSE OVERVIEW ADVANCED TEXT ANALYTICS Thomas Tiahrt, MA, PhD CSC492 – Advanced Text Analytics.
Siemens Big Data Analysis GROUP 3: MARIO MASSAD, MATTHEW TOSCHI, TYLER TRUONG.
DeepDive Deep Linguistic Processing with Condor Feng Niu, Christopher Ré, and Ce Zhang Hazy Research Group University of Wisconsin-Madison
GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
ELN – Natural Language Processing Giuseppe Attardi
Impact of the Toll-access vs. Open-access Resources.
2007. Software Engineering Laboratory, School of Computer Science S E Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying.
Linguistics & AI1 Linguistics and Artificial Intelligence Linguistics and Artificial Intelligence Frank Van Eynde Center for Computational Linguistics.
Automatic Detection of Tags for Political Blogs Khairun-nisa Hassanali and Vasileios Hatzivassiloglou Human Language Technology Research Institute The.
U & I: Users & Information Lab Sept 2008  Alice Oh 
Using Text Mining and Natural Language Processing for Health Care Claims Processing Cihan ÜNAL
Community Information Service Omid Fatemieh CS 598 CXZ Department of Computer Science University of Illinois at Urbana-Champaign.
 Text Representation & Text Classification for Intelligent Information Retrieval Ning Yu School of Library and Information Science Indiana University.
Automatic Detection of Tags for Political Blogs Khairun-nisa Hassanali Vasileios Hatzivassiloglou The University.
Acknowledgements Contact Information Objective An automated annotation tool was developed to assist human annotators in the efficient production of a high.
CS 6998 NLP for the Web Columbia University 04/22/2010 Analyzing Wikipedia and Gold-Standard Corpora for NER Training William Y. Wang Computer Science.
Natural language processing tools Lê Đức Trọng 1.
Computational Linguistics. The Subject Computational Linguistics is a branch of linguistics that concerns with the statistical and rule-based natural.
TEXT ANALYTICS - LABS Maha Althobaiti Udo Kruschwitz Massimo Poesio.
Natural Language Programming David Vadas The University of Sydney Supervisor: James Curran.
Computer Skills for Economic Analysis By Greg Haffner.
1 CSC 594 Topics in AI – Text Mining and Analytics Fall 2015/16 3. Word Association.
From Text to Image: Generating Visual Query for Image Retrieval Wen-Cheng Lin, Yih-Chen Chang and Hsin-Hsi Chen Department of Computer Science and Information.
Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.
Iana Atanassova Research: – Information retrieval in scientific publications exploiting semantic annotations and linguistic knowledge bases – Ranking algorithms.
Information Transfer through Online Summarizing and Translation Technology Sanja Seljan*, Ksenija Klasnić**, Mara Stojanac*, Barbara Pešorda*, Nives Mikelić.
Tools for Linguistic Analysis. Overview of Linguistic Tools  Dictionaries  Linguistic Inquiry and Word Count (LIWC) Linguistic Inquiry and Word Count.
1 An Introduction to Computational Linguistics Mohammad Bahrani.
Exploiting Named Entity Taggers in a Second Language Thamar Solorio Computer Science Department National Institute of Astrophysics, Optics and Electronics.
1 ICASSP Paper Survey Presenter: Chen Yi-Ting. 2 Improved Spoken Document Retrieval With Dynamic Key Term Lexicon and Probabilistic Latent Semantic Analysis.
Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.
Using Human Language Technology for Automatic Annotation and Indexing of Digital Library Content Kalina Bontcheva, Diana Maynard, Hamish Cunningham, Horacio.
Detection of Misinformation on Online Social Networking
Measuring Monolinguality
Sentiment analysis algorithms and applications: A survey
Google SyntaxNet “Parsey McParseface and other SyntaxNet models are some of the most complex networks that we have trained with the TensorFlow framework.
CORPUS LINGUISTICS Corpus linguistics is the study of language as expressed in samples (corpora) or "real world" text. An approach to derive at a set of.
INAGO Project Automatic Knowledge Base Generation from Text for Interactive Question Answering.
Natural Language Processing (NLP)
Supervised Machine Learning
University of Computer Studies, Mandalay
Introduction to Machine Learning and NLP
--Mengxue Zhang, Qingyang Li
Text Analytics Giuseppe Attardi Università di Pisa
Writing Analytics Clayton Clemens Vive Kumar.
Phil Durrant Debra Myhill Mark Brenchley
Text Analytics and Machine Learning Workshop
Extracting Recipes from Chemical Academic Papers
Natural Language Processing (NLP)
CS224N Section 3: Corpora, etc.
University of Illinois System in HOO Text Correction Shared Task
Engleski jezik struke 3 Sreda,
CS224N Section 3: Project,Corpora
Analyzing and Organizing Information
Stance Classification of Ideological Debates
CS249 Advanced Seminar: Learning From Text
Natural Language Processing (NLP)
Presentation transcript:

PURE Learning Plan Richard Lee, James Chen,

Project Overview Mentored by PhD candidate Jason Cho Advice from Professor Eric Meyer of the University of Illinois Department of Journalism Project in automatic bias-detection in newspaper articles Split into two parts: Similar article topic recognition Easier, but less interesting Automatic bias detection Harder, but more interesting

What Have We Learned? This is a very, very difficult problem. Project involves a variety of fields, including computer science, English, linguistics, mathematics, journalism, etc. Relatively unique problem, no thorough solution has been attempted/researched Similar research done, such as sarcasm detection, bias research (linguistics), etc.

Getting Up to Speed Tools Papers OpenNLP, Python NLTK Stanford NLP Toolkit Papers Shedding (a Thousand Points of) Light on Biased Language Recognizing stances in online debates Extracting opinion targets in a single-and cross-domain setting with conditional random fields

Ideas Investigating words that have a high correlation with bias, and seeing if the presence of them indicate bias Take presumably biased articles, use named- entity recognition to parse relevant sentences (eg, all sentences with "Obama" present) Use a combination of word-counting, part-of- speech tagging, and word sentiment determination to aggregate overall bias.