I. Research Overview and Outcome

Slides:



Advertisements
Similar presentations
The difference between what we want and what we have got
Advertisements

Ability-Based Education at Alverno College. Proposed Outcomes for Session 1. To introduce you to Alvernos approach to designing integrative general education.
CWS: A Comparative Web Search System Jian-Tao Sun, Xuanhui Wang, § Dou Shen Hua-Jun Zeng, Zheng Chen Microsoft Research Asia University of Illinois at.
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Part III. Presentation Style How you do it also matters.
INFO624 - Week 2 Models of Information Retrieval Dr. Xia Lin Associate Professor College of Information Science and Technology Drexel University.
Large-Scale Entity-Based Online Social Network Profile Linkage.
TAILS: COBWEB 1 [1] Online Digital Learning Environment for Conceptual Clustering This material is based upon work supported by the National Science Foundation.
A Probabilistic Framework for Semi-Supervised Clustering
Human Language Technologies. Issue Corporate data stores contain mostly natural language materials. Knowledge Management systems utilize rich semantic.
 Manmatha MetaSearch R. Manmatha, Center for Intelligent Information Retrieval, Computer Science Department, University of Massachusetts, Amherst.
An Overview of Link Analysis Techniques for Academic Web Sites Mike Thelwall, Statistical Cybermetrics Research Group, University of Wolverhampton, UK.
Scalable Text Mining with Sparse Generative Models
Large-Scale Cost-sensitive Online Social Network Profile Linkage.
Model Driven Development reduces the problem-implementation gap by redefining the role of models and using platforms for translating and realizing the.
Developing writing/speaking skills on Yahoo discussion groups.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
JASS 2005 Next-Generation User-Centered Information Management Information visualization Alexander S. Babaev Faculty of Applied Mathematics.
Museum and Community Partnerships.
Data / Information / Knowledge Presentation by Pauline Lake Modifications by Rick Mercer Acknowledgment and Disclaimer: This presentation is supported.
Abstract Developing sign language applications for deaf people is extremely important, since it is difficult to communicate with people that are unfamiliar.
Transfer Learning Task. Problem Identification Dataset : A Year: 2000 Features: 48 Training Model ‘M’ Testing 98.6% Training Model ‘M’ Testing 97% Dataset.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Visualizing Ontology Components through Self-Organizing.
Partnership for International Research and Education A Global Living Laboratory for Cyberinfrastructure Application Enablement Enhanced Grid Enabled Weather.
Soccer Video Analysis EE 368: Spring 2012 Kevin Cheng.
Robert Savinelli, Sam Fleischman, and Jason White University of California Santa Barbara PARTNERSHIP FOR INTERNATIONAL RESEARCH AND EDUCATION 向荣柏向荣柏 白杰森白杰森.
CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.
Partnership for International Research and Education A Global Living Laboratory for Cyberinfrastructure Application Enablement II. International Experience.
Exploring in the Weblog Space by Detecting Informative and Affective Articles Xiaochuan Ni, Gui-Rong Xue, Xiao Ling, Yong Yu Shanghai Jiao-Tong University.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Introduction to Sampling Methods Qi Zhao Oct.27,2004.
Thinking in Parallel - Introduction New Mexico Supercomputing Challenge in partnership with Intel Corp. and NM EPSCoR.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Boosting the Feature Space: Text Classification for Unstructured.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Presentation by: ABHISHEK KAMAT ABHISHEK MADHUSUDHAN SUYAMEENDRA WADKI
Using Video-Annotated Peer Review to Support the Diffusion of
Sentiment analysis algorithms and applications: A survey
? What is Institutional Repository for Rutgers University
Efficient Image Classification on Vertically Decomposed Data
Geographical Information Systems
Shortest title possible, keep it simple!
The Binary Number System
Introductory Seminar on Research: Fall 2017
Discussion and Conclusion
Parallelized Analysis Using Subdural Interictal EEG
| (269) | Western Michigan University
“ ” A case study to support testing of dynamic SPL
I.III Implementation I. Introduction III. Layers I.II Approach
Let’s Get This Course Started
Efficient Image Classification on Vertically Decomposed Data
KINAMIN Plugin - Kaseya IN A MINute
“Bayesian Identity Clustering”
Working Set: Model and Characterization
Description of the Problem
Title here Subtitle (optional) Methodology Introduction Discussion
Put your name here Name of the Department, School or College
Title of Poster Site Visit 2017 Introduction Results
Put your name here Name of the Department, School or College
Supporting Material for the Biodiversity Teaching Experiment
Put your name here Name of the Department, School or College
Deep Cross-media Knowledge Transfer
Siân Curtis, PhD OVC Evaluation Dissemination Meeting,
Title of Poster Title of Poster Title of Poster
Title of Poster Title of Poster Title of Poster
Text Categorization Berlin Chen 2003 Reference:
Title of Poster Site Visit 2018 Introduction Results
Rachit Saluja 03/20/2019 Relation Extraction with Matrix Factorization and Universal Schemas Sebastian Riedel, Limin Yao, Andrew.
This material is based upon work supported by the National Science Foundation under Grant #XXXXXX. Any opinions, findings, and conclusions or recommendations.
Project Title: I. Research Overview and Outcome
Conclusion & Discussion Research purposes/ Research hypothesis
Presentation transcript:

I. Research Overview and Outcome Partnership for International Research and Education A Global Living Laboratory for Cyberinfrastructure Application Enablement Project Title: Multi-type Data Clustering Across Multiple Feature Spaces Student: Michael Armella, Ph.D Student, Florida International University FIU Advisor: Shu-Ching Chen, Professor, Florida International University PIRE International Partner Advisor: Yang Qiu Song, IBM China Research Lab National Science Foundation I. Research Overview and Outcome Introduction Documents and records stored by most companies is multi-type Multi-type data is a more complicated search task than single-type data Searching multi-type data requires the comparison of dissimilar types Problem Statement Features extracted from dissimilar types can not be compared Modeling the relationships between these types is difficult Features from dissimilar types must be linked or transformed in order to be compared Feature Extraction Text, image and video features were extracted. The features between text and multimedia are not comparable. Text Features: term frequency-inverse document frequency (tf-idf) Image Features: Color histogram, Texture histogram, Blob detection, SIFT points Video Features: Extract keyframes and extract features from keyframe images Translated Learning Translated learning [1] is process of learning in one domain and then using this knowledge in another domain The purpose of translated learning is to build a “bridge” to link one feature space (known as the “source space”) to another space (known as the “target space”) through a translator in order to migrate the knowledge from source to target The translated learning model can be represented as follows[1]: Semi-Supervised Learning Future Work Modification of HMRF to allow for application of transferred learning to create links Develop method to evaluate the effectiveness of multi-type search over single-type search Develop a full scale web system to implement the proposed techniques for web based search Hidden Markov Random Field (HMRF) [2] is a semi-supervised learning technique that takes known labeled data and pair wise attributes which link objects as either must-link or cannot-link Instead of having an underlying Markov chain, hidden Markov random fields have an underlying Markov random field The main difference with a hidden Markov model is that neighborhood is not defined in one dimension but within a network Objects which have must-links will be heavily weighted to be given the same label; objects which have cannot-links will be heavily weighted to be given different labels Conclusion Proposed multi-type search has promise and can be used to create refined search methodologies The ability to transfer features between types allows for increased application of traditional search methodologies over non-traditional data sets References [1] Wenyuan Dai, Yuqiang Chen, Gui-Rong Xue, Qiang Yang, Yong Yu. “Translated Learning: Transfer Learning across Different Feature Spaces”. In Proceedings of NIPS'2008. pp.353~360 [2] S. Basu, M. Bilenko, and R. J. Mooney, "A probabilistic framework for semi-supervised clustering," Seattle, WA, August 2004, pp. 59-68. Where y represents the features of the data instances x The training data xs is represented by features ys Figure 1. Hidden Markov Random Field [2] The test data xt is represented by features yt We create a new chain that connects the features ys to yt and allows use to map features to xt. The mapping from ys to yt is done through the creation of a translator II. International Experience Temple of Heaven - 天坛 – It is amazing to visit a temple that is near 600 years old and is one of the largest in Beijing Research Team at IBM-CRL Summer Palace - 颐和园 – Great views, very relaxing, it is easy to see why the emperor would want to spend his summers here Experience of a lifetime See a beautiful foreign country Experience new and interesting culture, food and language Leave your comfort zone and learn more about your self Meet and form collaborations with other researchers from around the world. Fragrance Hills - 香山 – Great views over western Beijing can be found if you are willing to make the 2 hour hike to the top Tsinghua View: Some days were better than others. The campus was much larger than any college campus I had ever seen Food: The food was delicious, I have never eaten so many different types of food. I tried food that I never would have at home and found it delicious III. Acknowledgement The material presented in this poster is based upon the work supported by the National Science Foundation under Grant No. OISE-0730065. Additional support was provided by the GAANN Fellowship funded by U.S. Department of Education under Grant No. P200A070543. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation or the Department of Education.