What Did We See? & WikiGIS Chris Pal University of Massachusetts A Talk for Memex Day MSR Redmond, July 19, 2006.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Interactive Evolutionary Computation Review of Applications Praminda Caleb-Solly Intelligent Computer Systems Centre University of the West of England.
Document Summarization using Conditional Random Fields Dou Shen, Jian-Tao Sun, Hua Li, Qiang Yang, Zheng Chen IJCAI 2007 Hao-Chin Chang Department of Computer.
Feature Selection as Relevant Information Encoding Naftali Tishby School of Computer Science and Engineering The Hebrew University, Jerusalem, Israel NIPS.
Deep Learning Bing-Chen Tsai 1/21.
Unsupervised Learning Clustering K-Means. Recall: Key Components of Intelligent Agents Representation Language: Graph, Bayes Nets, Linear functions Inference.
Integrated Instance- and Class- based Generative Modeling for Text Classification Antti PuurulaUniversity of Waikato Sung-Hyon MyaengKAIST 5/12/2013 Australasian.
An Introduction to Conditional Random Field Ching-Chun Hsiao 1.
Designing Multimedia with Fuzzy Logic Enrique Diaz de Leon * Rene V. Mayorga ** Paul D. Guild *** * ITESM, Guadalajara Campus, Mexico ** Faculty of Engineering,
An Overview of Machine Learning
Conditional Random Fields - A probabilistic graphical model Stefan Mutter Machine Learning Group Conditional Random Fields - A probabilistic graphical.
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data John Lafferty Andrew McCallum Fernando Pereira.
Data Visualization STAT 890, STAT 442, CM 462
Hidden Markov Models Theory By Johan Walters (SR 2003)
Deep Learning.
Distributional Clustering of Words for Text Classification Authors: L.Douglas Baker Andrew Kachites McCallum Presenter: Yihong Ding.
Research Introspection “ICML does ICML” Andrew McCallum Computer Science Department University of Massachusetts Amherst.
Scalable Training of Mixture Models via Coresets Daniel Feldman Matthew Faulkner Andreas Krause MIT.
CPSC 422, Lecture 18Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 18 Feb, 25, 2015 Slide Sources Raymond J. Mooney University of.
Scalable Text Mining with Sparse Generative Models
Diffusion Geometries, and multiscale Harmonic Analysis on graphs and complex data sets. Multiscale diffusion geometries, “Ontologies and knowledge building”
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.
Crash Course on Machine Learning
Data Mining Techniques
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
ROOT: A Data Mining Tool from CERN Arun Tripathi and Ravi Kumar 2008 CAS Ratemaking Seminar on Ratemaking 17 March 2008 Cambridge, Massachusetts.
Multimedia Databases (MMDB)
CONCLUSION & FUTURE WORK Normally, users perform triage tasks using multiple applications in concert: a search engine interface presents lists of potentially.
Graphical models for part of speech tagging
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
A Weakly-Supervised Approach to Argumentative Zoning of Scientific Documents Yufan Guo Anna Korhonen Thierry Poibeau 1 Review By: Pranjal Singh Paper.
TEMPLATE DESIGN © Zhiyao Duan 1,2, Lie Lu 1, and Changshui Zhang 2 1. Microsoft Research Asia (MSRA), Beijing, China.2.
M Machine Learning F# and Accord.net. Alena Dzenisenka Software architect at Luxoft Poland Member of F# Software Foundation Board of Trustees Researcher.
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
Machine Learning Extract from various presentations: University of Nebraska, Scott, Freund, Domingo, Hong,
Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.
CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS.
Conditional Random Fields for ASR Jeremy Morris July 25, 2006.
Training Conditional Random Fields using Virtual Evidence Boosting Lin Liao, Tanzeem Choudhury †, Dieter Fox, and Henry Kautz University of Washington.
1 CRANDEM: Conditional Random Fields for ASR Jeremy Morris 11/21/2008.
Guest lecture: Feature Selection Alan Qi Dec 2, 2004.
John Lafferty Andrew McCallum Fernando Pereira
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Discriminative Training and Machine Learning Approaches Machine Learning Lab, Dept. of CSIE, NCKU Chih-Pin Liao.
Introduction to Gaussian Process CS 478 – INTRODUCTION 1 CS 778 Chris Tensmeyer.
Preliminary Transformations Presented By: -Mona Saudagar Under Guidance of: - Prof. S. V. Jain Multi Oriented Text Recognition In Digital Images.
Mismatch String Kernals for SVM Protein Classification Christina Leslie, Eleazar Eskin, Jason Weston, William Stafford Noble Presented by Pradeep Anand.
Part 3: Estimation of Parameters. Estimation of Parameters Most of the time, we have random samples but not the densities given. If the parametric form.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
1 Dongheng Sun 04/26/2011 Learning with Matrix Factorizations By Nathan Srebro.
Learning Bayesian Networks for Complex Relational Data
Introducing Precictive Analytics
Computer vision: models, learning and inference
Online Multiscale Dynamic Topic Models
Deep Learning Amin Sobhani.
Machine Learning Ali Ghodsi Department of Statistics
CSC 594 Topics in AI – Natural Language Processing
Dynamic Routing Using Inter Capsule Routing Protocol Between Capsules
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 18
CRANDEM: Conditional Random Fields for ASR
Overview of Machine Learning
Speech recognition, machine learning
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 18
Ping LUO*, Fen LIN^, Yuhong XIONG*, Yong ZHAO*, Zhongzhi SHI^
presented by Thomas L. Packer
New technologies have made it possible to:
Topic: Semantic Text Mining
Speech recognition, machine learning
Presentation transcript:

What Did We See? & WikiGIS Chris Pal University of Massachusetts A Talk for Memex Day MSR Redmond, July 19, 2006

Research Questions 1.How do personal and community photo- journals and blogs interact? Spectrum from personal blogs – community portals (bliki’s) – Wiki articles (most public) User Interface & Social Computing Research 2.Can we ‘mine’ information in Blogs ? Find Blog entries that look like Wiki entries, extract information, encourage contributions? Document and Text Processing Research 3.What is the role of computer vision for location and object recognition? Can we use these methods to provide the user with relevant information?

Search Blogs and Wiki Entries

Questions About Observations

Search and Social Computing I Discover that my friend Justin also found an interesting mushroom Have I been here as well?

1. Object Recognition From Images and Text 2. Location Recognition From Images and Text Object and Location Recognition

Conditional Random Fields y t-1 y t x t y t+1 x t +1 x t y t+2 x t +2 y t+3 x t +3 said Ling a Microsoft VP … OTHER PERSON OTHER ORG TITLE … Named Entities (SFSM states) Binary Features Input Sequence Widely applicable, many positive results e.g. speech recognition Fact Extraction (from Blogs and Wikis) Address extraction Information Extraction Example

Research Result - Training a CRF Define the vector of feature values a time t Define the global feature function as The gradient of the conditional log likelihood Model expectation, i.e.Empirical expectation

Results: CRF Training NetTalk text-to-speech: Linear-chain CRF training using sparse inference 75% less training time than exact training, with no loss in accuracy Accuracy: Fixed: 85.7 KL: 91.6 Exact: 91.6

SenseCam Enhanced Blogs Produce Lots of Data for Location Recognition

Multi-Conditional Learning Motivation - Simple GMM Example Joint Conditional Multi-Conditional

Multi-Conditional Learning One motivation: Conditional Random Fields can be derived from a traditional joint model But, there are many other conditional distributions that could be defined What do we gain if we model those as well? Other combinations possible

Image Segmentation/Pixel Classification MSR Cambridge / Berkeley Data

Mixtures of Factor Analyzers Generative model for simultaneous dimensionality reduction and clustering We wish to obtain a discriminative version of this type of model discriminatively

Performance vs. Model Complexity Interesting ? Joint Optimization benefits more substantially from additional data.

Performance with More Data Training Set AccuracyTest Set Accuracy hmm…

Search Blogs of Friends

Detect and Find Expert Knowledge

Simple Exponential Family Models for Documents

Results: Document Classification

New Graphical Models for and Blogs xbxb y NbNb xsxs NsNs xrxr N r-1 Body Title Friends Words Words discussed Predicted Recipient NrNr - function - random variable - N replications N Model: Nb words in the body, Ns words in the subject, Nr recipients The graph describes the joint distribution of random variables in term of the product of local functions Scenario: Predict which friends might be interested in your new Blog entry New Idea: Plated Factor Graphs

Detect Quality Content and Encourage Knowledge Contributions

Conclusions, Present & Future Work WikiGIS – Merged Blogs, Blikis and Wikis with Microsoft Virtual Earth Merge the SenseCam with a smart Phone - Enable Intelligent Digital Assistants - Output to the television Next Steps: Location and object recognition enabling information retrieval Other Uses: Assistive Technology for the Elderly

References & Results so Far with Charles Sutton and Andrew McCallum. Sparse Forward-Backward using Minimum Divergence Beams for Fast Training of Conditional Random Fields. In proceedings of ICASSP 2006.Sparse Forward-Backward using Minimum Divergence Beams for Fast Training of Conditional Random Fields with Michael Kelm and Andrew McCallum. Combining Generative and Discriminative Methods for Pixel Classification with Multi-Conditional Learning To appear in the proceedings of ICPR 2006.Combining Generative and Discriminative Methods for Pixel Classification with Multi-Conditional Learning with Andrew McCallum, Greg Druck and Xuerui Wang. Multi-Conditional Learning: Generative/ Discriminative Training for Clustering and Classification To appear in the proceedings of AAAI Multi-Conditional Learning: Generative/ Discriminative Training for Clustering and Classification CC Prediction with graphical models To appear in the proceedings of CEAS 2006.CC Prediction with graphical models