Introduction Using time property and location property from lost items’ pictures, we construct the Lost and Found System which combined with image search.

Slides:



Advertisements
Similar presentations
1 DynaMat A Dynamic View Management System for Data Warehouses Vicky :: Cao Hui Ping Sherman :: Chow Sze Ming CTH :: Chong Tsz Ho Ronald :: Woo Lok Yan.
Advertisements

1 Chapter 5 : Query Processing and Optimization Group 4: Nipun Garg, Surabhi Mithal
An Approach to Evaluate Data Trustworthiness Based on Data Provenance Department of Computer Science Purdue University.
A Crowd-Enabled Approach for Efficient Processing of Nearest Neighbor Queries in Incomplete Databases Samia Kabir, Mehnaz Tabassum Mahin Department of.
Answering Metric Skyline Queries by PM-tree Tomáš Skopal, Jakub Lokoč Department of Software Engineering, FMP, Charles University in Prague.
Implementation of the DDI at the Roper Center A Pilot Project on Resource Integration Marc Maynard and Hui Wang The Roper Center.
Finding Similar Music Artists for Recommendation Presented by :Abhay Goel, Prerak Trivedi.
By Liqiang Cheng, Naiqi Jin and Jason Yap. Project Description Project summary: A Geo-spatial search system that collects and combines data from various.
The Geant4 physics validation repository
Face Recognition Data Search Tool COMP6703 PRESENTATION Presented by Yan Gao u Supervisor: Professor Tom Gedeon.
Scalable Network Distance Browsing in Spatial Database Samet, H., Sankaranarayanan, J., and Alborzi H. Proceedings of the 2008 ACM SIGMOD international.
Methodology Conceptual Database Design
What is Redmine? If you search for a free project management tool most likely you will end up with Redmine. This is an open source Ruby on Rails web application,
Centralized and Client/Server Architecture and Classification of DBMS
Hadoop Team: Role of Hadoop in the IDEAL Project ●Jose Cadena ●Chengyuan Wen ●Mengsu Chen CS5604 Spring 2015 Instructor: Dr. Edward Fox.
U.S. Department of the Interior U.S. Geological Survey David V. Hill, Information Dynamics, Contractor to USGS/EROS 12/08/2011 Satellite Image Processing.
Implementation Yaodong Bi. Introduction to Implementation Purposes of Implementation – Plan the system integrations required in each iteration – Distribute.
COMPUTER MODELS FOR SKY IMAGE ANALYSIS OF THE INASAN ZVENIGOROD OBSERVATORY Sergei Pirogov ( Institute of Astronomy, Russian Academy of Sciences) VIIth.
Distributed Indexing of Web Scale Datasets for the Cloud {ikons, eangelou, Computing Systems Laboratory School of Electrical.
天文信息技术联合实验室 New Progress On Astronomical Cross-Match Research Zhao Qing.
Better way for sorting – heap sort , , Department of Computer Science and Information Engineering, Chung Cheng University, Chayi, Taiwan.
MySQL. Dept. of Computing Science, University of Aberdeen2 In this lecture you will learn The main subsystems in MySQL architecture The different storage.
Maximal Vector Computation in Large Data Sets The 31st International Conference on Very Large Data Bases VLDB 2005 / VLDB Journal 2006, August Parke Godfrey,
1 Converting Categories to Numbers for Approximate Nearest Neighbor Search 嘉義大學資工系 郭煌政 2004/10/20.
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
Towards Robust Indexing for Ranked Queries Dong Xin, Chen Chen, Jiawei Han Department of Computer Science University of Illinois at Urbana-Champaign VLDB.
Map-Reduce-Merge: Simplified Relational Data Processing on Large Clusters Hung-chih Yang(Yahoo!), Ali Dasdan(Yahoo!), Ruey-Lung Hsiao(UCLA), D. Stott Parker(UCLA)
Garrett Poppe, Liv Nguekap, Adrian Mirabel CSUDH, Computer Science Department.
Large-scale Incremental Processing Using Distributed Transactions and Notifications Daniel Peng and Frank Dabek Google, Inc. OSDI Feb 2012 Presentation.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,
Live Demo Augmented reality – lets see some pictures flying…Augmented reality – lets see some pictures flying… Facebook -Facebook -
Large scale IP filtering using Apache Pig and case study Kaushik Chandrasekaran Nabeel Akheel.
Efficient Computation of Reverse Skyline Queries VLDB 2007.
DataBases & Data Mining Joined Specialization Project „Data Mining Classification Tool” By Mateusz Żochowski & Jakub Strzemżalski.
CONCLUSION & FUTURE WORK Normally, users perform search tasks using multiple applications in concert: a search engine interface presents lists of potentially.
Large scale IP filtering using Apache Pig and case study Kaushik Chandrasekaran Nabeel Akheel.
Efficient Processing of Top-k Spatial Preference Queries
Zhuo Peng, Chaokun Wang, Lu Han, Jingchao Hao and Yiyuan Ba Proceedings of the Third International Conference on Emerging Databases, Incheon, Korea (August.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Query Sensitive Embeddings Vassilis Athitsos, Marios Hadjieleftheriou, George Kollios, Stan Sclaroff.
 Frequent Word Combinations Mining and Indexing on HBase Hemanth Gokavarapu Santhosh Kumar Saminathan.
Data and Knowledge Engineering Laboratory Clustered Segment Indexing for Pattern Searching on the Secondary Structure of Protein Sequences Minkoo Seo Sanghyun.
Spatial Indexing Techniques Introduction to Spatial Computing CSE 5ISC Some slides adapted from Spatial Databases: A Tour by Shashi Shekhar Prentice Hall.
1 CSIS 7101: CSIS 7101: Spatial Data (Part 1) The R*-tree : An Efficient and Robust Access Method for Points and Rectangles Rollo Chan Chu Chung Man Mak.
HEMANTH GOKAVARAPU SANTHOSH KUMAR SAMINATHAN Frequent Word Combinations Mining and Indexing on HBase.
The Development of a search engine & Comparison according to algorithms Sung-soo Kim The final report.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
Introduction A sorting algorithm is an algorithm that puts elements of a list in a certain order. The most-used orders are numerical order. Efficient sorting.
Data Management Bin Yao (Assistant Professor) Department of computer science and engineering Shanghai Jiao Tong University.
ISC321 Database Systems I Chapter 2: Overview of Database Languages and Architectures Fall 2015 Dr. Abdullah Almutairi.
1 Spatial Query Processing using the R-tree Donghui Zhang CCIS, Northeastern University Feb 8, 2005.
Mehdi Kargar Department of Computer Science and Engineering
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Abstract Major Cloud computing companies have started to integrate frameworks for parallel data processing in their product portfolio, making it easy for.
The Improvement of PaaS Platform ZENG Shu-Qing, Xu Jie-Bin 2010 First International Conference on Networking and Distributed Computing SQUARE.
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Spatial Online Sampling and Aggregation
CS & CS Capstone Project & Software Development Project
Put your name here Name of the Department, School or College
Put your name here Name of the Department, School or College
A Restaurant Recommendation System Based on Range and Skyline Queries
Interpret the execution mode of SQL query in F1 Query paper
Put your name here Name of the Department, School or College
Put your name here Department of What, School or College
R-tree – Another Example (1/2)
Efficient Processing of Top-k Spatial Preference Queries
Presentation transcript:

Introduction Using time property and location property from lost items’ pictures, we construct the Lost and Found System which combined with image search technology. We find owners of lost items by similarity, computed in multiple dimension by using Skyline Algorithm, between items and owners. Due to the BigData century, our system is based on MapReduce architecture; having effective processing performance to reduce traditional human resource’s cost on finding items. In the poster, we will show details and operations of our system and display final results. Materials and methods (1)Nearest Neighbor algorithm NN algorithm will find the nearest one from the origin vertexes. After getting the point, it will ignore other points that can not replace the one we found. (2)Branch and Bound Skyline algorithm This algorithm stores all points into R-tree and use Heap to find out all dominant points. It solves the problem that NN algorithm will read and operate repeatedly in the multi-dimensional. (3)MapReduce Cloud architecture(fig. 1) which is proposed by Google is mainly used for massively data parallel computing. "Map" and "Reduce " are main concepts. Map is to do specified and independently operations to each element in set. Reduce is to merge a collection of elements appropriately and simplify its operation and results. Acknowledgments We thank George Chang for laboratory assistance and Phillip Chang for questionable statistical advice. Funding for this project was provided by the Ministry of Science and Technology and Data Management+ laboratory. Results After taking pictures of items that users found, users would upload pictures to the system. Our systems would run on Hadoop to get results. Figure 4. Found Interface Figure 5. Result Interface Table 1. Table 2. Conclusions By Installing our system’s APP version or using our system’s web version, users can create records for their items; user who find items can upload items’ pictures to database. After uploading lost items’ pictures, we use time property and location property from lost items’ pictures to compare with users’ items; system would find the likely users and notify them. Accurately analyze information, effectively reduce the time of processing, and correctly find items’ owners 郭柏辰 許任傑 Department of Computer Science, National Chung Cheng University, Chiayi, Taiwan Literature cited Roussopoulos, N., Kelly, S., Vincent, F. Nearest Neighbor Queries, SIGMOD Rec., D. Papadias, Y. Tao, G. Fu, B. Seeger, An Optimal and Progressive Algorithm for Skyline Queries, In Proceedings of the 2003 ACM SIGMOD International Conference on Managementof Data, J. Dean and S. Ghemawat, MapReduce: Simplified Data Processing on Large Clusters,Communications of the ACM, A. Guttman, R-trees: A Dynamic Index Structure for Spatial Searching, In Proceedings of the 1984 ACM SIGMOD International Conference on Management of Data, For further information Please contact More information on this and related projects can be obtained at dmplus.cs.ccu.edu.tw. Effective query of lost property base on skyline and map-reduce We use the following three properties to compute the result (1)Location We record users’ paths by GPS. If user finds an item and upload its picture, we can compute from users who went through these areas. The operation is like Fig. 2, the lost item’s location is at point a and User1, User2, and User3 are likely users. We find the nearest point on each path such as point a, b, and c in Fig. 2 by KNN algorithm. Figure 2. path (2)Time Using the time property of lost items’ pictures, we compute the difference between users’ and lost items’ time. In Fig.3, the time of finding lost item is Ta and the time of discovering the item is missing is Tb. Figure 3. time (3)Image Similarity After computing the first two properties, we use PerceptualDiff API to compute the similarity between pictures from users and pictures of lost items. Integrating the above description, we can get the follow formula : Ws + Wt + Wp = 1 (100%) (1) In formula (1), weight of location is Ws, weight of time is Wt, and weight of picture is Wp. Each weight means one dimension, and we use Skyline algorithm to compute the final result. data amount(K)execution time(sec) JAVA Hadoop data amount(K)execution time(sec) JAVA Hadoop data amount(K)execution time(sec) JAVA Hadoop Table 3. According to tables 1, 2, and 3, we found that the execution time of Hadoop did not increase much when data amount got bigger; on the contrary, the execution time of JAVA increase much. Figure 1. MapReduce Architecture Figure 6. Users would receive mails when their items was found