Statistical Learning Methods for Natural Language Processing on the Internet 徐丹云.

Slides:



Advertisements
Similar presentations
Improvements and extras Paul Thomas CSIRO. Overview of the lectures 1.Introduction to information retrieval (IR) 2.Ranked retrieval 3.Probabilistic retrieval.
Advertisements

Metadata in Carrot II Current metadata –TF.IDF for both documents and collections –Full-text index –Metadata are transferred between different nodes Potential.
Chapter 5: Introduction to Information Retrieval
Modern information retrieval Modelling. Introduction IR systems usually adopt index terms to process queries IR systems usually adopt index terms to process.
Modern Information Retrieval Chapter 1: Introduction
SEARCHING QUESTION AND ANSWER ARCHIVES Dr. Jiwoon Jeon Presented by CHARANYA VENKATESH KUMAR.
1.Accuracy of Agree/Disagree relation classification. 2.Accuracy of user opinion prediction. 1.Task extraction performance on Bing web search log with.
COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
Search Engines and Information Retrieval
Context-Aware Query Classification Huanhuan Cao 1, Derek Hao Hu 2, Dou Shen 3, Daxin Jiang 4, Jian-Tao Sun 4, Enhong Chen 1 and Qiang Yang 2 1 University.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Information Retrieval in Practice
1 Information Retrieval and Web Search Introduction.
Basic IR Concepts & Techniques ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Chapter 5: Information Retrieval and Web Search
Overview of Search Engines
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Search Engines and Information Retrieval Chapter 1.
Reyyan Yeniterzi Weakly-Supervised Discovery of Named Entities Using Web Search Queries Marius Pasca Google CIKM 2007.
A Two Tier Framework for Context-Aware Service Organization & Discovery Wei Zhang 1, Jian Su 2, Bin Chen 2,WentingWang 2, Zhiqiang Toh 2, Yanchuan Sim.
INF 141 COURSE SUMMARY Crista Lopes. Lecture Objective Know what you know.
Web Search. Structure of the Web n The Web is a complex network (graph) of nodes & links that has the appearance of a self-organizing structure  The.
 Text Representation & Text Classification for Intelligent Information Retrieval Ning Yu School of Library and Information Science Indiana University.
Information Retrieval Models - 1 Boolean. Introduction IR systems usually adopt index terms to process queries Index terms:  A keyword or group of selected.
Probabilistic Query Expansion Using Query Logs Hang Cui Tianjin University, China Ji-Rong Wen Microsoft Research Asia, China Jian-Yun Nie University of.
Chapter 6: Information Retrieval and Web Search
Toward A Session-Based Search Engine Smitha Sriram, Xuehua Shen, ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
LATENT SEMANTIC INDEXING Hande Zırtıloğlu Levent Altunyurt.
GUIDED BY DR. A. J. AGRAWAL Search Engine By Chetan R. Rathod.
Introduction to Information Retrieval Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL.
Implicit User Modeling for Personalized Search Xuehua Shen, Bin Tan, ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
Toward Semantic Search: RDFa based facet browser Jin Guang Zheng Tetherless World Constellation.
8 December 1997Industry Day Applications of SuperTagging Raman Chandrasekar.
Why Decision Engine Bing Demos Search Interaction model Data-driven Research Problems Q & A.
UIC at TREC 2006: Blog Track Wei Zhang Clement Yu Department of Computer Science University of Illinois at Chicago.
Image Retrieval and Ranking using L.S.I and Cross View Learning Sumit Kumar Vivek Gupta
COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
WEB STRUCTURE MINING SUBMITTED BY: BLESSY JOHN R7A ROLL NO:18.
Multi-Modal Bayesian Embeddings for Learning Social Knowledge Graphs Zhilin Yang 12, Jie Tang 1, William W. Cohen 2 1 Tsinghua University 2 Carnegie Mellon.
Trends in NL Analysis Jim Critz University of New York in Prague EurOpen.CZ 12 December 2008.
Information Retrieval in Practice
Information Retrieval in Practice
Information Organization: Overview
Search Engine Architecture
Sentiment analysis algorithms and applications: A survey
School of Computer Science & Engineering
Chinese Academy of Sciences, Beijing, China
Information Retrieval and Web Search
Personalized Social Image Recommendation
Information Retrieval and Web Search
Information Retrieval and Web Search
Lecture 24: NER & Entity Linking
Web IR: Recent Trends; Future of Web Search
Information Retrieval
Parallel Analytic Systems
John Lafferty, Chengxiang Zhai School of Computer Science
Multimedia Information Retrieval
CSE 635 Multimedia Information Retrieval
Chapter 5: Information Retrieval and Web Search
CS246: Information Retrieval
Information Retrieval and Web Design
Recuperação de Informação
Information Retrieval and Web Search
Topic: Semantic Text Mining
Presentation transcript:

Statistical Learning Methods for Natural Language Processing on the Internet 徐丹云

CCF ADL 52 12.5-7@Shenzhen ~200

Wikification and Beyond: The Challenges of Entity and Concept Grounding Heng Ji,Rensselaer Polytechnic Institute Motivation and Task Definition, A Skeletal View of Wikification Systems Key Challenges and Recent Advances, New Tasks, Trends and Applications

Wikification and Beyond: The Challenges of Entity and Concept Grounding Input: A text document d; Output: a set of pairs (mi, ti) Identifying mentions mi in d Local Inference For each mi in d: (1)Identify a set of relevant titles T(mi); (2)Rank titles ti Global Inference For each document d: (1)Consider all mi and all ti; (2) Re-rank titles ti Heng Ji,Rensselaer Polytechnic Institute

Wikification and Beyond: The Challenges of Entity and Concept Grounding Improving Wikification by Acquiring Rich Knowledge Better Meaning Representation Collaborative Title Collection Global Inference Using the Additional Knowledge Joint Mention Extraction and Linking Collective Inference Matches of Knowledge Graphs Heng Ji,Rensselaer Polytechnic Institute

Wikification and Beyond: The Challenges of Entity and Concept Grounding I don’t think Republican candidates like Romney, Newt, and Johnson have a real chance for the election.(Abstract Meaning Representation) Heng Ji,Rensselaer Polytechnic Institute candidate Person:Johnson Person:Romney Person:Newt and Political-party:Republican

Big Learning with Bayesian Methods Jun Zhu,Tsinghua University Basics, Big Learning Challenges, and Regularized Bayesian Inference Online Learning, Large-scale Topic Graph Learning and Visualization

Big Learning with Bayesian Methods Jun Zhu,Tsinghua University Concept of Baye’s Rule Approximate Bayesian Inference Markov chain Monte Carlo methods(MCMC) Examples A Bayesian Ranking Model Latent Dirichlet Allocation

Big Learning with Bayesian Methods Computationally efficient Bayesian models are becoming increasingly relevant in Big data era RegBayes: Bridges Bayesian methods, learning and optimization Offers an extra freedom to incorporate rich side information Many scalable algorithms have been developed: Online/stochastic algorithms(e.g., online BayesPA) Distributed inference algorithms(e.g, scalable CTM) Jun Zhu,Tsinghua University

Sentiment Analysis: Mining Opinions, Sentiments and Emotions Bing Liu, University Of Illinois at Chicago Sentiment Analysis Essentials Advanced Topics

Sentiment Analysis: Mining Opinions, Sentiments and Emotions Definition Opinion(entity, aspect, sentiment, holder, time) Analysis Document Sentence Entity Aspect Bing Liu, University Of Illinois at Chicago

Sentiment Analysis: Mining Opinions, Sentiments and Emotions Aspect extraction Finding frequent nouns and noun phrases Exploiting opinion and target relations Supervised learning Topic modeling Aspect sentiment classification Lexicon-based approach Bing Liu, University Of Illinois at Chicago

Sentiment Analysis: Mining Opinions, Sentiments and Emotions Advanced Topics Explicit and implicit aspects “The picture quality of this phone is great” VS ”This car is so expensive” Resource usage aspect and sentiment “This washer uses a lot of water” Coreference resolution “This phone’s sound is great. It is cheap too.” Bing Liu, University Of Illinois at Chicago

Semantic Matching in Search Jun Xu, China Academy of Sciences Semantic Matching between Query and Document Approaches to Semantic Matching in Search

Semantic Matching in Search Jun Xu, China Academy of Sciences Semantic Matching between Query and Document query document Term match Semantic match Seattle best hotel Seattle best hotels Partial Yes Pool schedule Swimming pool schedule Natural logarithm transform Logarithm transform China kong China hong kong No Why are windows so expensive Why are macs so expensive

Semantic Matching in Search Jun Xu, China Academy of Sciences Aspects of Semantic Matching Term: NY ->NY Phrase: hot dog -> hot dog Sense: utube -> youtube Topic: Microsoft Office -> Microsoft, PowerPoint, Word, Excel… Structure: how far is sun from earth -> distance between sun and earth

Semantic Matching in Search Jun Xu, China Academy of Sciences Query: michael jordan berkele Term: michael jordan berkeley Phrase: Michael Jordan berkeley Sense: michael i. jordan Topic: machine learning, berkeley Structure: michael jordan

Semantic Matching in Search Jun Xu, China Academy of Sciences Document: Homepage of Michael Jordan Phrase: Michael Jordan, Berkeley, professor Topic: machine learning, berkeley Structure: michael jordan

Semantic Matching in Search Jun Xu, China Academy of Sciences Approaches to Semantic Matching in Search Matching by Query Reformulation Matching with Term Dependency Model Matching with Translation Model Matching with Topic Model Matching Latent Space Model

From Simple Search to Search Intelligence: The Evolution of Search Engines Jianyun Nie, University of Montreal Traditional IR Models, Query and Document Expansion Advanced Methods of Intelligent IR, Mining Relations in Documents and Query logs, Mining Search Intents

From Simple Search to Search Intelligence: The Evolution of Search Engines Traditional IR Models, Query and Document Expansion Indexing Stopwords stemming Retrieval Boolean model Vector space model Probabilistic model Jianyun Nie, University of Montreal Doucument Query indexing indexing Representation (keywords) Representation (keywords) Retrieval

From Simple Search to Search Intelligence: The Evolution of Search Engines Jianyun Nie, University of Montreal Advanced Methods of Intelligent IR, Mining Relations in Documents and Query logs, Mining Search Intents Using Language Model in IR Query Expansion Inference

Machine Learning for Search Ranking and Ad Auction Tieyan Liu, Microsoft Research Machine learning for Web Search Machine learning for computational advertising

Machine Learning for Search Ranking and Ad Auction Machine learning for Web Search Learn to rank methods Regression Classification Pairwise regression Listwise Ranking Generalization theory Tieyan Liu, Microsoft Research

Machine Learning for Search Ranking and Ad Auction Machine learning for computational advertising User Click Behavior Modeling Advertiser Bidding Behavior Modeling Auction Mechanism Optimization Tieyan Liu, Microsoft Research

Thank You!