1 Finding Competitive Price Yu Peng (Hong Kong University of Science and Technology) Raymond Chi-Wing Wong (Hong Kong University of Science and Technology)

Slides:



Advertisements
Similar presentations
1 Competitive Privacy: Secure Analysis on Integrated Sequence Data Raymond Chi-Wing Wong 1, Eric Lo 2 The Hong Kong University of Science and Technology.
Advertisements

Identifying the Most Influential Data Objects with Reverse Top-k Queries By Akrivi Vlachou 1, Christos Doulkeridis 1, Kjetil Nørvag 1 and Yannis Kotidis.
Probabilistic Skyline Operator over Sliding Windows Wenjie Zhang University of New South Wales & NICTA, Australia Joint work: Xuemin Lin, Ying Zhang, Wei.
Minimizing Seed Set for Viral Marketing Cheng Long & Raymond Chi-Wing Wong Presented by: Cheng Long 20-August-2011.
1 Finding Shortest Paths on Terrains by Killing Two Birds with One Stone Manohar Kaul (Aarhus University) Raymond Chi-Wing Wong (Hong Kong University of.
Primality Testing Patrick Lee 12 July 2003 (updated on 13 July 2003)
A (1+  )-Approximation Algorithm for 2-Line-Center P.K. Agarwal, C.M. Procopiuc, K.R. Varadarajan Computational Geometry 2003.
Da Yan, Zhou Zhao and Wilfred Ng The Hong Kong University of Science and Technology.
An Approach to Evaluate Data Trustworthiness Based on Data Provenance Department of Computer Science Purdue University.
1 On Good and Fair Paper- Reviewer Assignment Cheng LONG, Raymond Chi-Wing Wong, Yu Peng, Liangliang Ye The Hong Kong University of Science and Technology.
RANSAC experimentation Slides by Marc van Kreveld 1.
A Generic Framework for Handling Uncertain Data with Local Correlations Xiang Lian and Lei Chen Department of Computer Science and Engineering The Hong.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Using Trees to Depict a Forest Bin Liu, H. V. Jagadish EECS, University of Michigan, Ann Arbor Presented by Sergey Shepshelvich 1.
Probabilistic Inference Protection on Anonymized Data
Privacy Preserving Serial Data Publishing By Role Composition Yingyi Bu 1, Ada Wai-Chee Fu 1, Raymond Chi-Wing Wong 2, Lei Chen 2, Jiuyong Li 3 The Chinese.
On Efficient Spatial Matching Raymond Chi-Wing Wong (the Chinese University of Hong Kong) Yufei Tao (the Chinese University of Hong Kong) Ada Wai-Chee.
FlexPref: A Framework for Extensible Preference Evaluation in Database Systems Justin J. Levandoski Mohamed F. Mokbel Mohamed E. Khalefa.
1 Efficient Method for Maximizing Bichromatic Reverse Nearest Neighbor Raymond Chi-Wing Wong (Hong Kong University of Science and Technology) M. Tamer.
Efficient Skyline Querying with Variable User Preferences on Nominal Attributes Raymond Chi-Wing Wong 1, Ada Wai-Chee Fu 2, Jian Pei 3, Yip Sing Ho 2,
1 Mining Favorable Facets Raymond Chi-Wing Wong (the Chinese University of Hong Kong) Jian Pei (Simon Fraser University) Ada Wai-Chee Fu (the Chinese University.
1 Continuous k-dominant Skyline Query Processing Presented by Prasad Sriram Nilu Thakur.
An Incremental Refining Spatial Join Algorithm for Estimating Query Results in GIS Wan D. Bae, Shayma Alkobaisi, Scott T. Leutenegger Department of Computer.
Probabilistic Skyline Operator over sliding Windows Wan Qian HKUST DB Group.
Creating Competitive Products Qian Wan [1], Raymond Chi-Wing Wong [1], Ihab F. Ilyas [2], M. Tamer Ozsu [2], Yu Peng [1] [1] Hong Kong University of Science.
1 Efficient Algorithms for Optimal Location Queries in Road Networks Zitong Chen (Sun Yat-Sen University) Yubao Liu (Sun Yat-Sen University) Raymond Chi-Wing.

1 Hypersphere Dominance: An Optimal Approach Cheng Long, Raymond Chi-Wing Wong, Bin Zhang, Min Xie The Hong Kong University of Science and Technology Prepared.
Mining Preferences from Superior and Inferior Examples KDD’08 1.
Beyond Co-occurrence: Discovering and Visualizing Tag Relationships from Geo-spatial and Temporal Similarities Date : 2012/8/6 Resource : WSDM’12 Advisor.
Approximate Frequency Counts over Data Streams Loo Kin Kong 4 th Oct., 2002.
Ranking Queries on Uncertain Data: A Probabilistic Threshold Approach Wenjie Zhang, Xuemin Lin The University of New South Wales & NICTA Ming Hua,
MINING FREQUENT ITEMSETS IN A STREAM TOON CALDERS, NELE DEXTERS, BART GOETHALS ICDM2007 Date: 5 June 2008 Speaker: Li, Huei-Jyun Advisor: Dr. Koh, Jia-Ling.
Creating Competitive Products Qian Wan [1], Raymond Chi-Wing Wong [1], Ihab F. Ilyas [2], M. Tamer Ozsu [2], Yu Peng [1] [1] Hong Kong University of Science.
1 Exact Top-k Nearest Keyword Search in Large Networks Minhao Jiang†, Ada Wai-Chee Fu‡, Raymond Chi-Wing Wong† † The Hong Kong University of Science and.
Towards Robust Indexing for Ranked Queries Dong Xin, Chen Chen, Jiawei Han Department of Computer Science University of Illinois at Urbana-Champaign VLDB.
Introduction to Matching Theory E. Maskin Jerusalem Summer School in Economic Theory June 2014.
Designing Example Critiquing Interaction Boi Faltings Pearl Pu Marc Torrens Paolo Viappiani IUI 2004, Madeira, Portugal – Wed Jan 14, 2004 LIAHCI.
Efficient Computation of Reverse Skyline Queries VLDB 2007.
K-Hit Query: Top-k Query Processing with Probabilistic Utility Function SIGMOD2015 Peng Peng, Raymond C.-W. Wong CSE, HKUST 1.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
1 Shape Segmentation and Applications in Sensor Networks Xianjin Xhu, Rik Sarkar, Jie Gao Department of CS, Stony Brook University INFOCOM 2007.
Mining Favorable Facets Raymond Chi-Wing Wong (the Chinese University of Hong Kong) Jian Pei (Simon Fraser University) Ada Wai-Chee Fu (the Chinese University.
Efficient Processing of Top-k Spatial Preference Queries
Zhuo Peng, Chaokun Wang, Lu Han, Jingchao Hao and Yiyuan Ba Proceedings of the Third International Conference on Emerging Databases, Incheon, Korea (August.
On Computing Top-t Influential Spatial Sites Authors: T. Xia, D. Zhang, E. Kanoulas, Y.Du Northeastern University, USA Appeared in: VLDB 2005 Presenter:
9/2/2005VLDB 2005, Trondheim, Norway1 On Computing Top-t Most Influential Spatial Sites Tian Xia, Donghui Zhang, Evangelos Kanoulas, Yang Du Northeastern.
1 On Optimal Worst-Case Matching Cheng Long (Hong Kong University of Science and Technology) Raymond Chi-Wing Wong (Hong Kong University of Science and.
August 30, 2004STDBM 2004 at Toronto Extracting Mobility Statistics from Indexed Spatio-Temporal Datasets Yoshiharu Ishikawa Yuichi Tsukamoto Hiroyuki.
The σ-neighborhood skyline queries Chen, Yi-Chung; LEE, Chiang. The σ-neighborhood skyline queries. Information Sciences, 2015, 322: 張天彥 2015/12/05.
Privacy-preserving data publishing
Information Technology Selecting Representative Objects Considering Coverage and Diversity Shenlu Wang 1, Muhammad Aamir Cheema 2, Ying Zhang 3, Xuemin.
On Top-n Reverse Top-k Queries: Variants, Algorithms, and Applications 陳良弼 Arbee L.P. Chen National Chengchi University 9/21/2012 at NCHU.
Finding skyline on the fly HKU CS DB Seminar 21 July 2004 Speaker: Eric Lo.
Other Clustering Techniques
HASE: A Hybrid Approach to Selectivity Estimation for Conjunctive Queries Xiaohui Yu University of Toronto Joint work with Nick Koudas.
Efficient OLAP Operations in Spatial Data Warehouses Dimitris Papadias, Panos Kalnis, Jun Zhang and Yufei Tao Department of Computer Science Hong Kong.
Multi-Task Assignment for CrowdSensing in Mobile Social Network Mingjun Xiao ∗, Jie Wu†, Liusheng Huang ∗, Yunsheng Wang‡, and Cong Liu§
HKU CSIS DB Seminar Skyline Queries HKU CSIS DB Seminar 9 April 2003 Speaker: Eric Lo.
Parallel Computation of Skyline Queries COSC6490A Fall 2007 Slawomir Kmiec.
Da Yan, Raymond Chi-Wing Wong, and Wilfred Ng The Hong Kong University of Science and Technology.
Abolfazl Asudeh Azade Nazi Nan Zhang Gautam DaS
CS260 Data Mining Using Tensor Methods 00 – Paper Title
Preference Query Evaluation Over Expensive Attributes
Conflict-Aware Event-Participant Arrangement
University of Crete Department Computer Science CS-562
Feifei Li, Ching Chang, George Kollios, Azer Bestavros
Continuous Density Queries for Moving Objects
Efficient Processing of Top-k Spatial Preference Queries
Fraction-Score: A New Support Measure for Co-location Pattern Mining
Presentation transcript:

1 Finding Competitive Price Yu Peng (Hong Kong University of Science and Technology) Raymond Chi-Wing Wong (Hong Kong University of Science and Technology) Presented by Ted Prepared by Raymond Chi-Wing Wong

Outline 1.Introduction 2.Problem Definition 3.Algorithm Spatial Approach 4.Discussion 5.Empirical Study 6.Related Work 7.Conclusion 2

1. Introduction HotelPrice ($) h1h1 100 h2h2 250 h3h3 200 h4h H = {h 1, h 2, h 3, h 4 } A = {a 1 } hotels attraction-site (e.g., Sea World) h4h4 a1a1 h2h2 h3h3 h1h1 Spatial LayoutPrice HotelDistance-to- SeaWorld (km) Price ($) h1h h2h h3h h4h Decision-Making Table According to the spatial layout and the price information, we can generate a decision table. Consider that a customer looks for a hotel near to Sea World 3

1. Introduction HotelPrice ($) h1h1 100 h2h2 250 h3h3 200 h4h H = {h 1, h 2, h 3, h 4 } A = {a 1 } hotels attraction-site (e.g., Sea World) h4h4 a1a1 h2h2 h3h3 h1h1 Spatial LayoutPrice HotelDistance-to- SeaWorld (km) Price ($) h1h h2h h3h h4h Decision-Making Table Consider that a customer looks for a hotel near to Sea World h 1 dominates h 3 (since h 1 is better than h 3 in terms of Distance-to- SeaWorld and Price).

1. Introduction HotelPrice ($) h1h1 100 h2h2 250 h3h3 200 h4h H = {h 1, h 2, h 3, h 4 } A = {a 1 } hotels attraction-site (e.g., Sea World) h4h4 a1a1 h2h2 h3h3 h1h1 Spatial LayoutPrice HotelDistance-to- SeaWorld (km) Price ($) h1h h2h h3h h4h Decision-Making Table Consider that a customer looks for a hotel near to Sea World h 2 does not dominate h 3 (since h 2 has a shorter Distance- to-SeaWorld than h 3 but h 2 has a higher price than h 3.)

1. Introduction HotelPrice ($) h1h1 100 h2h2 250 h3h3 200 h4h H = {h 1, h 2, h 3, h 4 } A = {a 1 } hotels attraction-site (e.g., Sea World) h4h4 a1a1 h2h2 h3h3 h1h1 Spatial LayoutPrice HotelDistance-to- SeaWorld (km) Price ($) h1h h2h h3h h4h Decision-Making Table Consider that a customer looks for a hotel near to Sea World Skyline: a set of hotels which are not dominated by other hotels Skyline = {h 1, h 2, h 4 } h 3 is dominated by h 1 A set of all “best” possible hotels

1. Introduction HotelPrice ($) h1h1 100 h2h2 250 h3h3 200 h4h H = {h 1, h 2, h 3, h 4 } A = {a 1 } hotels attraction-site (e.g., Sea World) h4h4 a1a1 h2h2 h3h3 h1h1 Spatial LayoutPrice HotelDistance-to- SeaWorld (km) Price ($) h1h h2h h3h h4h Decision-Making Table Consider that a new company wants to open a new hotel h f hfhf, h f } h f ? h f 2.0 ? How can we set the price of h f ?

1. Introduction HotelPrice ($) h1h1 100 h2h2 250 h3h3 200 h4h H = {h 1, h 2, h 3, h 4 } A = {a 1 } hotels attraction-site (e.g., Sea World) h4h4 a1a1 h2h2 h3h3 h1h1 Spatial LayoutPrice HotelDistance-to- SeaWorld (km) Price ($) h1h h2h h3h h4h Decision-Making Table Consider that a new company wants to open a new hotel h f hfhf, h f } h f ? h f 2.0 ?300 h f is dominated by h 2. $300 is not a competitive price.

1. Introduction HotelPrice ($) h1h1 100 h2h2 250 h3h3 200 h4h H = {h 1, h 2, h 3, h 4 } A = {a 1 } hotels attraction-site (e.g., Sea World) h4h4 a1a1 h2h2 h3h3 h1h1 Spatial LayoutPrice HotelDistance-to- SeaWorld (km) Price ($) h1h h2h h3h h4h Decision-Making Table Consider that a new company wants to open a new hotel h f hfhf, h f } h f ? h f 2.0 ?230 h f is not dominated by any hotel. $230 is a competitive price. Problem (Finding Simple Competitive Price): Given a set of existing hotels and a new hotel h f, what greatest possible price can we set for h f such that h f is in the skyline?

1. Introduction HotelPrice ($) h1h1 100 h2h2 250 h3h3 200 h4h H = {h 1, h 2, h 3, h 4 } A = {a 1 } hotels attraction-site (e.g., Sea World) h4h4 a1a1 h2h2 h3h3 h1h1 Spatial LayoutPrice HotelDistance-to- SeaWorld (km) Price ($) h1h h2h h3h h4h Decision-Making Table Consider that a new company wants to open a new hotel h f hfhf, h f } h f ? h f 2.0 ?230 h f does not dominate any hotel. In order to make sure that h f is chosen by customers with a higher probability, we would like to set the price of h f such that 1. h f is in the skyline 2. h f dominates at least K hotels where K is a user parameter. Problem (Finding Simple Competitive Price): Given a set of existing hotels and a new hotel h f, what greatest possible price can we set for h f such that h f is in the skyline?

1. Introduction HotelPrice ($) h1h1 100 h2h2 250 h3h3 200 h4h H = {h 1, h 2, h 3, h 4 } A = {a 1 } hotels attraction-site (e.g., Sea World) h4h4 a1a1 h2h2 h3h3 h1h1 Spatial LayoutPrice HotelDistance-to- SeaWorld (km) Price ($) h1h h2h h3h h4h Decision-Making Table Consider that a new company wants to open a new hotel h f hfhf, h f } h f ? h f 2.0 ?230 h f dominates one hotel (i.e., h 4 ). In order to make sure that h f is chosen by customers with a higher probability, we would like to set the price of h f such that 1. h f is in the skyline 2. h f dominates at least K hotels where K is a user parameter. 210 $210 is a 1-dominating competitive price. Problem (Finding K-Dominating Competitive Price): Given a set of existing hotels, a new hotel h f and an integer K, what greatest possible price can we set for h f such that (1) h f is in the skyline and (2) h f dominates at least K hotels. Finding K-Dominating Competitive Price is more general than Finding Simple Competitive Price. Problem (Finding Simple Competitive Price): Given a set of existing hotels and a new hotel h f, what greatest possible price can we set for h f such that h f is in the skyline?

1. Introduction In this paper, we study the problem of “finding K-dominating competitive price” Contributions: First to study this problem Most existing studies focus on how to find hotels in the skyline when the prices of the hotels are given. Propose an algorithm based on spatial properties 12

Outline 1.Introduction 2.Problem Definition 3.Algorithm Spatial Approach 4.Discussion 5.Empirical Study 6.Related Work 7.Conclusion 13

1. Introduction HotelPrice ($) h1h1 100 h2h2 250 h3h3 200 h4h H = {h 1, h 2, h 3, h 4 } A = {a 1 } hotels attraction-site (e.g., Sea World) h4h4 a1a1 h2h2 h3h3 h1h1 Spatial LayoutPrice HotelDistance-to- SeaWorld (km) Price ($) h1h h2h h3h h4h Decision-Making Table Consider that a new company wants to open a new hotel h f hfhf, h f } h f ? h f 2.0 ? A can contain more than one attraction-site {a 1, a 2, a 3 } a2a2 a3a3 Problem (Finding K-Dominating Competitive Price): Given a set of existing hotels, a new hotel h f and an integer K, what greatest possible price can we set for h f such that (1) h f is in the skyline and (2) h f dominates at least K hotels. Skyline requirement K-dominating requirement distance to a 1 … … … … distance to a 2 … … … … distance to a 3 … … … …

Outline 1.Introduction 2.Problem Definition 3.Algorithm Spatial Approach 4.Discussion 5.Empirical Study 6.Related Work 7.Conclusion 15

3. Algorithm Properties Algorithm 16

3. Algorithm Properties 2 properties for Skyline Requirement 1 property for K-dominating Requirement 17

3. Algorithm 2 Properties for Skyline Requirement Convex Hull Property Suppose that h f is in the convex hull of A. h f is in the skyline no matter what price we set for h f. 18 Spatial Layout hfhf h4h4 a1a1 h2h2 h3h3 h1h1 a2a2 a3a3 Convex hull

3. Algorithm 2 Properties for Skyline Requirement Non-Convex Hull Property Notations: I and U 19 Spatial Layout h4h4 a1a1 h2h2 h3h3 h1h1 a2a2 a3a3 hfhf I

3. Algorithm 2 Properties for Skyline Requirement Non-Convex Hull Property Notations: I and U 20 Spatial Layout h4h4 a1a1 h2h2 h3h3 h1h1 a2a2 a3a3 hfhf U

3. Algorithm 2 Properties for Skyline Requirement Non-Convex Hull Property Suppose that h f is in the convex hull of A. If we set the price to be min h ∈ I h.p, then h f is in the skyline 21 Spatial Layout h4h4 a1a1 h2h2 h3h3 h1h1 a2a2 a3a3 hfhf Suppose I contain some hotels. We set the smallest price among these hotels in I. Otherwise, we can set any price.

3. Algorithm Properties 2 properties for Skyline Requirement 1 property for K-dominating Requirement 22

3. Algorithm 1 property for K-dominating Requirement Suppose that there are at least K service-sites not in U. If we set the price to be the K-th greatest price among all hotels not in U, then h f dominates at least K hotels. 23 Spatial Layout h4h4 a1a1 h2h2 h3h3 h1h1 a2a2 a3a3 hfhf Suppose K = 2. We set the price of h f to be the 2 nd greatest price among these 3 hotels.

3. Algorithm Properties Algorithm 24

3. Algorithm find I and U find the convex hull of A (i.e., CH(A)) find the price (price sky ) based on 2 properties for the skyline requirement find the price (price dom ) based on 1 property for the K-dominating requirement price ← min{price sky, price dom } return price 25

3. Algorithm find I and U find the convex hull of A (i.e., CH(A)) find the price (price sky ) based on 2 properties for the skyline requirement find the price (price dom ) based on 1 property for the K-dominating requirement price ← min{price sky, price dom } return price 26 if h f is inside CH(A) then price sky ← ∞ else price sky ← min h ∈ I h.p price dom ← the K-th greatest price among all service-sites not in U

Outline 1.Introduction 2.Problem Definition 3.Algorithm Spatial Approach 4.Discussion 5.Empirical Study 6.Related Work 7.Conclusion 27

4. Discussion Handling Multiple Non-spatial Attributes How to set K 28

1. Introduction HotelPrice ($) h1h1 100 h2h2 250 h3h3 200 h4h H = {h 1, h 2, h 3, h 4 } A = {a 1 } hotels attraction-site (e.g., Sea World) h4h4 a1a1 h2h2 h3h3 h1h1 Spatial LayoutPrice HotelDistance-to- SeaWorld (km) Price ($) h1h h2h h3h h4h Decision-Making Table Consider that a new company wants to open a new hotel h f hfhf, h f } h f ? h f 2.0 ? A can contain more than one attraction-site {a 1, a 2, a 3 } a2a2 a3a3 Problem (Finding K-Dominating Competitive Price): Given a set of existing hotels, a new hotel h f and an integer K, what greatest possible price can we set for h f such that (1) h f is in the skyline and (2) h f dominates at least K hotels. distance to a 1 … … … … distance to a 2 … … … … distance to a 3 … … … … Each hotel can be incorporated with multiple non-spatial attributes.

4. Discussion Handling Multiple Non-spatial Attributes How to set K 30

4. Discussion How to set K Existing Models in Economic and Business Research Studying Customer Retention/Attrition (e.g., demand-and- supply model) 31

Outline 1.Introduction 2.Problem Definition 3.Algorithm Spatial Approach 4.Discussion 5.Empirical Study 6.Related Work 7.Conclusion 32

5. Empirical Study Dataset Real Dataset Surf Hotel 20,000 hotels 4 attraction-sites Synthetic Dataset Objects in North American (e.g., roads, populated places and Cultural Landmarks) from Digital Chart of the World 200,000 hotels 20 attraction-sites 33

5. Empirical Study Algorithms Our Algorithm (3-Phase) Without Index With Index Two Competitive Algorithms Blind Try a set of possible prices in an increment count of 0.01 (i.e., 0.01, 0.02, 0.03, …) For each possible price, test whether it satisfies the skyline requirement and the K-dominating requirement. Pick the greatest price which satisfies the 2 requirements Guided Similar to Blind But, try a set of possible prices of existing hotels. 34

5. Empirical Study Default Parameter K = 20 35

5. Empirical Study Measurements Price p max,s denotes price sky used in the algorithm p max,d denotes price dom used in the algorithm p max denotes the final price used in the algorithm Query Time 36

5. Empirical Study 37 Synthetic dataset

5. Empirical Study 38 Real dataset

Outline 1.Introduction 2.Problem Definition 3.Algorithm Spatial Approach 4.Discussion 5.Empirical Study 6.Related Work 7.Conclusion 39

6. Related Work Skyline One example is “spatial skyline” (VLDB06 and TODS09) Price is given Application of Using Skyline KDD08: Find Customer Preferences Based on Skylines VLDB09: Find a Set of Products which are in the Skyline Most existing studies assume that an attribute value of a service-site is given. In this paper, we study how to find an attribute value, specifically price, of a new service-site. 40

Outline 1.Introduction 2.Problem Definition 3.Algorithm Spatial Approach 4.Discussion 5.Empirical Study 6.Related Work 7.Conclusion 41

7. Conclusion Finding K-Dominating Competitive Price First one to study the problem Spatial Approach Experiments 42

Q&A 43

Backup Slides 44

5. Empirical Study 45 Synthetic dataset No. of attractions

5. Empirical Study 46 Synthetic dataset No. of hotels