Outline 1.Introduction 2.Harvesting Classes 3.Harvesting Facts 4.Common Sense Knowledge 5.Knowledge Consolidation 6.Web Content Analytics 7.Wrap-Up from.

Slides:



Advertisements
Similar presentations
Office of SA to CNS GeoIntelligence Introduction Data Mining vs Image Mining Image Mining - Issues and Challenges CBIR Image Mining Process Ontology.
Advertisements

Date: 2014/05/06 Author: Michael Schuhmacher, Simon Paolo Ponzetto Source: WSDM’14 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Knowledge-based Graph Document.
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
Large-Scale Object Recognition using Label Relation Graphs Jia Deng 1,2, Nan Ding 2, Yangqing Jia 2, Andrea Frome 2, Kevin Murphy 2, Samy Bengio 2, Yuan.
Support Vector Machines and Kernels Adapted from slides by Tim Oates Cognition, Robotics, and Learning (CORAL) Lab University of Maryland Baltimore County.
Machine learning continued Image source:
UCB Computer Vision Animals on the Web Tamara L. Berg CSE 595 Words & Pictures.
Qualifying Exam: Contour Grouping Vida Movahedi Supervisor: James Elder Supervisory Committee: Minas Spetsakis, Jeff Edmonds York University Summer 2009.
Crowdsourcing 04/11/2013 Neelima Chavali ECE 6504.
Unsupervised Feature Selection for Multi-Cluster Data Deng Cai et al, KDD 2010 Presenter: Yunchao Gong Dept. Computer Science, UNC Chapel Hill.
Proceedings of the IEEE 2010 Antonio Torralba, MIT Jenny Yuen, MIT Bryan C. Russell, MIT.
How computers answer questions An introduction to machine learning Peter Barnum August 7, 2008.
A Metric-based Framework for Automatic Taxonomy Induction Hui Yang and Jamie Callan Language Technologies Institute Carnegie Mellon University ACL2009,
CSE 574 – Artificial Intelligence II Statistical Relational Learning Instructor: Pedro Domingos.
Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.
Designing clustering methods for ontology building: The Mo’K workbench Authors: Gilles Bisson, Claire Nédellec and Dolores Cañamero Presenter: Ovidiu Fortu.
Introduction to Language Models Evaluation in information retrieval Lecture 4.
Representation learning for Knowledge Bases LivesIn BornIn LocateIn Friendship Nationality Nicole Kidman PerformIn Nationality Sydney Hugh Jackman Australia.
Word sense induction using continuous vector space models
Time-Sensitive Web Image Ranking and Retrieval via Dynamic Multi-Task Regression Gunhee Kim Eric P. Xing 1 School of Computer Science, Carnegie Mellon.
CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.
Adding Common Sense into Artificial Intelligence Common Sense Computing Initiative Software Agents Group MIT Media Lab.
Information Retrieval in Practice
Ontology Learning from Text: A Survey of Methods Source: LDV Forum,Volume 20, Number 2, 2005 Authors: Chris Biemann Reporter:Yong-Xiang Chen.
Outline 1.Introduction 2.Harvesting Classes 3.Harvesting Facts 4.Common Sense Knowledge 5.Knowledge Consolidation 6.Web Content Analytics 7.Wrap-Up in.
ConceptNet 5 Jason Gaines 1. Overview What Is ConceptNet 5? History Structure Demonstration Questions Further Information 2.
BACKGROUND KNOWLEDGE IN ONTOLOGY MATCHING Pavel Shvaiko joint work with Fausto Giunchiglia and Mikalai Yatskevich INFINT 2007 Bertinoro Workshop on Information.
Date : 2014/09/18 Author : Niket Tandon, Gerard de Melo, Fabian Suchanek, Gerhard Weikum Source : WSDM’14 Advisor : Jia-ling Koh Speaker : Shao-Chun Peng.
PageRank for Product Image Search Kevin Jing (Googlc IncGVU, College of Computing, Georgia Institute of Technology) Shumeet Baluja (Google Inc.) WWW 2008.
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
June 12, 2008 The University of Mississippi Design Strategy for Knowledge Base Formation to Automate a Course Map Creation Susan Lukose
Latent Semantic Analysis Hongning Wang Recap: vector space model Represent both doc and query by concept vectors – Each concept defines one dimension.
Constructing Knowledge Graph from Unstructured Text Image Source: Kundan Kumar Siddhant Manocha.
IEEE Int'l Symposium on Signal Processing and its Applications 1 An Unsupervised Learning Approach to Content-Based Image Retrieval Yixin Chen & James.
AnalogySpace: Reducing the Dimensionality of Common Sense Knowledge Advisor: Hsin-His Chen Reporter: Chi-Hsin Yu Date: From AAAI 2008 Robert.
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li and Li Fei-Fei Dept. of Computer Science, Princeton University, USA CVPR ImageNet1.
Understanding User’s Query Intent with Wikipedia G 여 승 후.
Advisor: Hsin-His Chen Reporter: Chi-Hsin Yu Date: From AAAI 2008 William Pentney, Department of Computer Science & Engineering University of.
Finding Experts Using Social Network Analysis 2007 IEEE/WIC/ACM International Conference on Web Intelligence Yupeng Fu, Rongjing Xiang, Yong Wang, Min.
Image Classification over Visual Tree Jianping Fan Dept of Computer Science UNC-Charlotte, NC
Web Search and Text Mining Lecture 5. Outline Review of VSM More on LSI through SVD Term relatedness Probabilistic LSI.
Recognition Using Visual Phrases
Tutorial: Knowledge Bases for Web Content Analytics
The Unreasonable Effectiveness of Data
Advisor: Hsin-Hsi Chen Reporter: Chi-Hsin Yu Date: From Word Representations:... ACL2010, From Frequency... JAIR 2010 Representing Word... Psychological.
Sentiment Analysis Using Common- Sense and Context Information Basant Agarwal 1,2, Namita Mittal 2, Pooja Bansal 2, and Sonal Garg 2 1 Department of Computer.
NEIL: Extracting Visual Knowledge from Web Data Xinlei Chen, Abhinav Shrivastava, Abhinav Gupta Carnegie Mellon University CS381V Visual Recognition -
Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.
A Review of Relational Machine Learning for Knowledge Graphs CVML Reading Group Xiao Lin.
Bassem Makni SML 16 Click to add text 1 Deep Learning of RDF rules Semantic Machine Learning.
Information Organization: Overview
A Brief Introduction to Distant Supervision
Wenhan Xiong, Thien Hoang, William Wang Department of Computer Science
Combining CNN with RNN for scene labeling (segmentation)
Probabilistic Data Management
Title {Calibri 36 Font Size}
HowToKB: Mining HowTo Knowledge from Online Communities
Distributed Representation of Words, Sentences and Paragraphs
Rob Fergus Computer Vision
WebChild: Harvesting and Organizing Commonsense Knowledge from Web
Commonsense Knowledge Acquisition and Applications
Knowledge Base Completion
On-going research on Object Detection *Some modification after seminar
Enriching Structured Knowledge with Open Information
Graph and Tensor Mining for fun and profit
Topological Signatures For Fast Mobility Analysis
Information Organization: Overview
VERB PHYSICS: Relative Physical Knowledge of Actions and Objects
Machine Learning for Space Systems: Are We Ready?
Presentation transcript:

Outline 1.Introduction 2.Harvesting Classes 3.Harvesting Facts 4.Common Sense Knowledge 5.Knowledge Consolidation 6.Web Content Analytics 7.Wrap-Up from the crowd from the Web from images from KBs 1

Goal: Commonsense Knowledge Apples are green, red, round, juicy, … but not fast, funny, verbose, … Pots and pans are in the kitchen or cupboard, on the stove, … but not in in the bedroom, in your pocket, in the sky, … People have only one spouse (at a time) Our goal is to find common sense knowledge such as Problem: This information is rarely mentioned explicitly on the Web "blind person": 570,000 results on Google "non-blind person": 3,300 results on Google "green cherry": 245m results "red cherry": 70m results "red cow": 31.9m results "brown cow": 31.8m results "green cow": 31.7m results 2

Verbosity Game [Speer & Havasi 2012] 3

Concept Net many inputs incl. WordNet, Verbosity game, etc. ConceptNet 5: 3.9 Mio concepts 12.5 Mio. edges oven cook restauran t satisfy hunge r follow recipe bake cake person survive eat desert sweet isA usedFor atLocation usedFor causes- Desire createdBy capableOf desires motivatedByGoal desires, capableOf atLocation usedFor isA hasProperty recievesAction motivatedByGoal [Speer & Havasi 2012] 4

Outline 1.Introduction 2.Harvesting Classes 3.Harvesting Facts 4.Common Sense Knowledge 5.Knowledge Consolidation 6.Web Content Analytics 7.Wrap-Up from the crowd √ from the Web from images from KBs 5

Pattern-Based Harvesting of Commonsense Properties Start with seed facts for apple hasProperty round dog hasAbility bark plate hasLocation table Find patterns that express these relations, such as X is very Y, X can Y, X put in/on Y, … Problem: noise and sparseness of data Solution: harness Web-scale n-gram corpora  5-grams + frequencies Confidence score: PMI (X,Y), PMI (p,(XY)), support(X,Y), … are features for regression model (N. Tandon et al.: AAAI 2011) Apply these patterns to find more facts. 6

Commonsense Properties with Semantic Types (N. Tandon et al.: WSDM 2014) 1) Compute the ranges for common-sense relations hasTaste: sweet, sour, spicy, … 2) Compute the domains for common-sense relations hasTaste: shake (milk shake), juice… 3) Compute assertions hasTaste: { shake/sweet, … } For all 3 tasks, use label propagation on a graph with few seeds from WordNet and with edges from n-gram corpus.  WebChild: 4 Mio. triples for 19 relations 7 univsch>

Universal Schema Rome,Italy Paris,France Florence,Italy … is the captial of is located in 's location in capitalOf locatedIn [Riedel et al.: HLT-NAACL’13] patterns in text relations in KB # co-occurrences sub-relations equivalent relations symmetry facts sub-patterns equivalent patterns Do matrix completion, learn: 8

Outline 1.Introduction 2.Harvesting Classes 3.Harvesting Facts 4.Common Sense Knowledge 5.Knowledge Consolidation 6.Web Content Analytics 7.Wrap-Up from the crowd √ from the Web √ from images from KBs 9 img>

ImageNet Populate WordNet classes with photos [J. Deng et al.: CVPR‘09] How: crowdsourcing for seeds, distantly supervised classifiers, object recognition (bounding boxes) in computer vision 10

NEIL Infer instances of partOf occursAt, inScene relations [X. Chen et al.: ICCV‘13] bike inScene trafficJam bike occursAt park pedals partOf bike 11

Outline 1.Introduction 2.Harvesting Classes 3.Harvesting Facts 4.Common Sense Knowledge 5.Knowledge Consolidation 6.Web Content Analytics 7.Wrap-Up from the crowd √ from the Web √ from images √ from KBs 12

Path Ranking Algorithm (PRA) isA- playsInLeague ABC Football player isA Batistuta hasTeam isInLeague (Lao et al.: EMNLP’11) plays- inLeague Observation: There are often alternative ways to walk from the subject of a fact to its object: hasTeam + isInLeague isA + isA- + playsInLeague playsInLeague(Batistuta, A) 13

Path Ranking Algorithm (PRA) isA- playsInLeague ABC Football player isA Batistuta hasTeam isInLeague (Lao et al.: EMNLP’11) plays- inLeague Use regularized logistic regression to learn a weight θ P indicating how often a path leads to the correct object: hasTeam + isInLeague60% isA + isA- + playsInLeague10% playsInLeague(Batistuta, A) 14

Path Ranking Algorithm (PRA) isA- playsInLeague ABC Football player isA Caruzzo hasTeam isInLeague (Lao et al.: EMNLP’11) playsInLeague(Caruzzo, A) ? path P Score(s,t)= Σ f P (s,t)θ P f P (s,t)=Pr(s → t ; P) 15

Learn Horn Rules Problem: Rule mining needs counter-examples, but RDF KBs are positive only. Should not use absent information as counter-examples because of the Open World Assumption. 16

Partial Completeness Assumption Partial Completeness Assumption: If we know r(x,y1),..., r(x,yn), then all other r(x,z) are false (used as "Local closed world assumption" in Google KV) [Galarraga et al.: WWW’13] 17

AMIE: Efficient Rule Mining [Galarraga et al.: WWW’13] AMIE (efficient search space traversal with pruning) 18 relwork>

More Related Work Sensor factorization (Harshman et al., '94) Probabilistic Relational Learning (Friedman et al., '99) Relational Markov Networks (Taskar et al., '02) Markov-logic Networks (Kok et al., '07) Extension of SBMs (Kemp et al., '06) (Sutskever et al., '10) Spectral clustering (undirected graphs) (Dong et al., '12) Collective matrix factorization (Nickel et al., '11) Embedding models (Bordes et al., '11, '13) (Jenatton et al., '12) (Socher et al., '13) (Wang et al., '14) (Garca-Duran et al., '14) (Weston et al., '14) Jason Weston’s AKBC’14 19

Open Problems and Grand Challenges 20 Commonsense rules beyond Horn clauses Comprehensive commonsense knowledge organized in ontologically clean manner especially for emotions and other analytics Visual knowledge with text grounding: populate concepts, typical activities & scenes could serve as training data for image & video understanding  x: type(x,spider)  numLegs(x)=8  x: type(x,animal)  hasLegs(x)  even(numLegs(x))  x: human(x)  (  y: mother(x,y)   z: father(x,z))  x: human(x)  (male(x)  female(x))