Date : 2014/06/10 Author :Shahab Kamali Frank Wm. Tompa Source : SIGIR’13 Advisor : Jia-ling Koh Speaker : Shao-Chun Peng Retrieving Documents With Mathematical.

Slides:



Advertisements
Similar presentations
1 TITLE OF THE ARTICLE Author (Institution) Co-Author (Institution) .
Advertisements

1 Modeling Query-Based Access to Text Databases Eugene Agichtein Panagiotis Ipeirotis Luis Gravano Computer Science Department Columbia University.
Chungnam National University DataBase System Lab
COSC2007 Data Structures II Chapter 10 Trees I. 2 Topics Terminology.
Slide 1 Insert your own content. Slide 2 Insert your own content.
By D. Fisher Geometric Transformations. Reflection, Rotation, or Translation 1.
Combining Like Terms. Only combine terms that are exactly the same!! Whats the same mean? –If numbers have a variable, then you can combine only ones.
Problem Solving & Program Design in C Sixth Edition By Jeri R. Hanly & Elliot B. Koffman 1-1.
0 - 0.
Teacher Name Class / Subject Date A:B: Write an answer here #1 Write your question Here C:D: Write an answer here.
Online Event-driven Subsequence Matching over Financial Data Streams Huanmei Wu,Betty Salzberg, Donghui Zhang Northeastern University, College of Computer.
Date : 2012/09/20 Author : Sina Fakhraee, Farshad Fotouhi Source : KEYS12 Speaker : Er-Gang Liu Advisor : Dr. Jia-ling Koh 1.
Date: 2013/1/17 Author: Yang Liu, Ruihua Song, Yu Chen, Jian-Yun Nie and Ji-Rong Wen Source: SIGIR12 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Adaptive.
6/10/20141 Top-Down Clustering Method Based On TV-Tree Zbigniew W. Ras.
1 Directed Depth First Search Adjacency Lists A: F G B: A H C: A D D: C F E: C D G F: E: G: : H: B: I: H: F A B C G D E H I.
Traditional IR models Jian-Yun Nie.
Computing Structural Similarity of Source XML Schemas against Domain XML Schema Jianxin Li 1 Chengfei Liu 1 Jeffrey Xu Yu 2 Jixue Liu 3 Guoren Wang 4 Chi.
Boolean and Vector Space Retrieval Models
Indexing DNA Sequences Using q-Grams
Jan SedmidubskyOctober 28, 2011Scalability and Robustness in a Self-organizing Retrieval System Jan Sedmidubsky Vlastislav Dohnal Pavel Zezula On Investigating.
Chapter 30 Induction and Inductance In this chapter we will study the following topics: -Faraday’s law of induction -Lenz’s rule -Electric field induced.
Psychological Advertising: Exploring User Psychology for Click Prediction in Sponsored Search Date: 2014/03/25 Author: Taifeng Wang, Jiang Bian, Shusen.
Delta-Oriented Testing for Finite State Machines
Covering Indexes for XML Queries by Prakash Ramanan
Inconsistent Heuristics
Term Level Search Result Diversification DATE : 2013/09/11 SOURCE : SIGIR’13 AUTHORS : VAN DANG, W. BRUCE CROFT ADVISOR : DR.JIA-LING, KOH SPEAKER : SHUN-CHEN,
Date: 2014/05/06 Author: Michael Schuhmacher, Simon Paolo Ponzetto Source: WSDM’14 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Knowledge-based Graph Document.
DQR : A Probabilistic Approach to Diversified Query recommendation Date: 2013/05/20 Author: Ruirui Li, Ben Kao, Bin Bi, Reynold Cheng, Eric Lo Source:
Entity-Centric Topic-Oriented Opinion Summarization in Twitter Date : 2013/09/03 Author : Xinfan Meng, Furu Wei, Xiaohua, Liu, Ming Zhou, Sujian Li and.
Date : 2013/09/17 Source : SIGIR’13 Authors : Zhu, Xingwei
Image Indexing and Retrieval using Moment Invariants Imran Ahmad School of Computer Science University of Windsor – Canada.
1 Extending PRIX for Similarity-based XML Query Group Members: Yan Qi, Jicheng Zhao, Dan Situ, Ning Liao.
B-Tree B-Tree is an m-way search tree with the following properties:
Chapter 4 : Query Languages Baeza-Yates, 1999 Modern Information Retrieval.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
Modern Information Retrieval Chapter 4 Query Languages.
Title Extraction from Bodies of HTML Documents and its Application to Web Page Retrieval Microsoft Research Asia Yunhua Hu, Guomao Xin, Ruihua Song, Guoping.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Chien-Shing Chen Author : Satoshi Oyama Takashi Kokubo Toru lshida 國立雲林科技大學 National Yunlin.
Exploring Online Social Activities for Adaptive Search Personalization CIKM’10 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
1 Efficient Search Ranking in Social Network ACM CIKM2007 Monique V. Vieira, Bruno M. Fonseca, Rodrigo Damazio, Paulo B. Golgher, Davi de Castro Reis,
Querying Structured Text in an XML Database By Xuemei Luo.
A Probabilistic Graphical Model for Joint Answer Ranking in Question Answering Jeongwoo Ko, Luo Si, Eric Nyberg (SIGIR ’ 07) Speaker: Cho, Chin Wei Advisor:
1 XSLT An Introduction. 2 XSLT XSLT (extensible Stylesheet Language:Transformations) is a language primarily designed for transforming the structure of.
Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.
1 Using The Past To Score The Present: Extending Term Weighting Models with Revision History Analysis CIKM’10 Advisor : Jia Ling, Koh Speaker : SHENG HONG,
Templated Search over Relational Databases Date: 2015/01/15 Author: Anastasios Zouzias, Michail Vlachos, Vagelis Hristidis Source: ACM CIKM’14 Advisor:
Finding Experts Using Social Network Analysis 2007 IEEE/WIC/ACM International Conference on Web Intelligence Yupeng Fu, Rongjing Xiang, Yong Wang, Min.
A New Spatial Index Structure for Efficient Query Processing in Location Based Services Speaker: Yihao Jhang Adviser: Yuling Hsueh 2010 IEEE International.
Date: 2012/08/21 Source: Zhong Zeng, Zhifeng Bao, Tok Wang Ling, Mong Li Lee (KEYS’12) Speaker: Er-Gang Liu Advisor: Dr. Jia-ling Koh 1.
Date: 2012/11/29 Author: Chen Wang, Keping Bi, Yunhua Hu, Hang Li, Guihong Cao Source: WSDM’12 Advisor: Jia-ling, Koh Speaker: Shun-Chen, Cheng.
1 Adaptive Subjective Triggers for Opinionated Document Retrieval (WSDM 09’) Kazuhiro Seki, Kuniaki Uehara Date: 11/02/09 Speaker: Hsu, Yu-Wen Advisor:
Date: 2013/4/1 Author: Jaime I. Lopez-Veyna, Victor J. Sosa-Sosa, Ivan Lopez-Arevalo Source: KEYS’12 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang KESOSD.
Date: 2013/9/25 Author: Mikhail Ageev, Dmitry Lagun, Eugene Agichtein Source: SIGIR’13 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Improving Search Result.
Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
Question Answering Passage Retrieval Using Dependency Relations (SIGIR 2005) (National University of Singapore) Hang Cui, Renxu Sun, Keya Li, Min-Yen Kan,
Short Text Similarity with Word Embedding Date: 2016/03/28 Author: Tom Kenter, Maarten de Rijke Source: CIKM’15 Advisor: Jia-Ling Koh Speaker: Chih-Hsuan.
QUERY-PERFORMANCE PREDICTION: SETTING THE EXPECTATIONS STRAIGHT Date : 2014/08/18 Author : Fiana Raiber, Oren Kurland Source : SIGIR’14 Advisor : Jia-ling.
Click Through Rate Prediction for Local Search Results
Improving Search Relevance for Short Queries in Community Question Answering Date: 2014/09/25 Author : Haocheng Wu, Wei Wu, Ming Zhou, Enhong Chen, Lei.
Where Did You Go: Personalized Annotation of Mobility Records
إستراتيجيات ونماذج التقويم
Learning Literature Search Models from Citation Behavior
Date : 2013/1/10 Author : Lanbo Zhang, Yi Zhang, Yunfei Chen
HeteroMed: Heterogeneous Information Network for Medical Diagnosis
A Data-Driven Question Generation Model for Educational Content
Title Introduction: Discussion & Conclusion: Methods & Results:
Attention Is All You Need
Connecting the Dots Between News Article
Preference Based Evaluation Measures for Novelty and Diversity
Presentation transcript:

Date : 2014/06/10 Author :Shahab Kamali Frank Wm. Tompa Source : SIGIR’13 Advisor : Jia-ling Koh Speaker : Shao-Chun Peng Retrieving Documents With Mathematical Content

Outline 2 Introduction mathematical expressions Related Work Purpose Methods Experiments Conclusions

Introduction 3 Mathematical expressions(context-dependent rules) Content-based Presentation-based Dom Tree

Introduction 4

Related Work(Exact match) 5

Related Work(Approximate match) 6 SubexprExactMatch at least one of its subexpressions exactly matches the query some structure information is missed by transforming an expression into bags of tokens NormalizedSubExactMatch one of its normalized subexpressions matches the normalized query performance remains relatively poor

Related Work(Approximate match) 7 MIaS subtrees are normalized and transformed into tokens and a text search engine is used to index and retrieve them

Purpose 8 Mathematical Expression its appearance(or presentation) its mathematical meaning (often termed its content) how to capture the relevance of mathematical expressions, how to query them, and how to evaluate the results

Outline 9 Introduction Methods SIMILARITY SEARCH PATTERN SEARCH Experiments Conclusions

SIMILARITY SEARCH 10 Translate translated input into Presentation MathML Similarity based on tree edit distance

Tree Edit 11 T1 = (V1;E1) T2 = (V2;E2) τ is a sequence of edit operations that transforms T1 to T2 dist(T1; T2) = min {cost( τ )| τ (T1) = T2} E1 and E2 represented by trees T1 and T2.

Tree Edit(cost) 12 If λ (N1) = λ (N2) then cost(N1 → N2) = 0 If N1, N2 are leaf nodes and λ (N1) ≠ λ (N2) and λ (parent(N1)) = λ (parent(N2)), cost(N1 → N2) =C PL ( λ (parent(N1)); λ (N1); λ (N2)) mi i ji Cost= α N1 N2

Tree Edit(cost) 13 If N1, N2 are leaf nodes and λ (N1) ≠ λ (N2) and λ (parent(N1)) ≠ λ (parent(N2)) then cost(N1 → N2) =C L ( λ (N1) ; λ (N2) ) If N1, N2 are not both leaf nodes and λ (N1) ≠ λ (N2) then cost(N1 → N2) = C I ( λ (N1); λ (N2)) mi x mn 3 mi x Cost=2 β N1 N2

cost 14 α<=β<=γ X+2 and X+1 are different with cost α X+2 and X+Y are different with cost β X+2 and X+E are different with cost γ(E is a expression)

Tree Edit(cost) 15 Cost= α+β

PATTERN SEARCH 16 Translate: wild card Ranker sort results with respect to the sizes of the matched expressions in increasing order.

PATTERN SEARCH 17 a template can be defined using wildcards as non- terminals,and regular expressions to describe their relationships Number wild cards,Variable wild cards, Operator wild cards

wild cards(example) 18

Feedback 19

Outline 20 Introduction Method Experiments Conclusion

Experiment-Dataset 21 DLMF : Digital Library of Mathematics Functions

Evaluation measures 22 Consider top 10 result NFR MRR

Results 23

Outline 24 Introduction Methods Experiments Conclusions

25 categorize existing approaches to match mathematical expressions, concentrating on two paradigms that consider the structure of expressions propose a representative system for each search paradigm