Research Meeting 2009-11-19 Jaeseok Myung. Copyright  2009 by CEBT Summary  TA DB: 중간고사 채점 – 평균 : 66.04, 표준편차 : 18.29 – 지난학기 평균 : 59.33, 표준편자 : 15.42.

Slides:



Advertisements
Similar presentations
SPARQL Basic Graph Pattern Processing with Iterative MapReduce
Advertisements

Map Collections and Custom Collection Classes Chapter 14.
Based on the text by Jimmy Lin and Chris Dryer; and on the yahoo tutorial on mapreduce at index.html
Advanced Indexing Techniques with
Copyright © 2011 Ramez Elmasri and Shamkant Navathe Algorithms for SELECT and JOIN Operations (8) Implementing the JOIN Operation: Join (EQUIJOIN, NATURAL.
Store RDF Triples In A Scalable Way Liu Long & Liu Chunqiu.
RDF-3X: a RISC style Engine for RDF Ref: Thomas Neumann and Gerhard Weikum [PVLDB’08 ] Presented by: Pankaj Vanwari Course: Advanced Databases (CS 632)
6.830/6.814 Lecture 5 Database Internals Continued September 17, 2014.
The Hadoop RDBMS Replace Oracle with Hadoop John Leach CTO and Co-Founder J.
Experiments on Query Expansion for Internet Yellow Page Services Using Log Mining Summarized by Dongmin Shin Presented by Dongmin Shin User Log Analysis.
Transportation and Warehouse Planning Systems Byron Flores Bus M. 462 April 12, 2010.
Multidimensional Database in Context of DB2 OLAP Server Khang Pham Class: CSCI397-16C Instructor: Professor Renner.
Representing Block and Record Addresses Rajhdeep Jandir ID: 103.
PowerPoint Presentation for Dennis, Wixom & Tegarden Systems Analysis and Design Copyright 2001 © John Wiley & Sons, Inc. All rights reserved. Slide 1.
Research Meeting Jaeseok Myung. Copyright  2009 by CEBT Summary  TA DB : 23 일 수업 WEC : 강의 평가  Honor Society Omega Chi Epsilon  Research.
PowerPoint Presentation for Dennis, Wixom & Tegarden Systems Analysis and Design Copyright 2001 © John Wiley & Sons, Inc. All rights reserved. Slide 1.
1 External Sorting for Query Processing Yanlei Diao UMass Amherst Feb 27, 2007 Slides Courtesy of R. Ramakrishnan and J. Gehrke.
Microsoft Dynamics AX Technical Conference 2013
Presented by Cathrin Weiss, Panagiotis Karras, Abraham Bernstein Department of Informatics, University of Zurich Summarized by: Arpit Gagneja.
Copyright © 2012 Cleversafe, Inc. All rights reserved. 1 Combining the Power of Hadoop with Object-Based Dispersed Storage.
Storing RDF Data in Hadoop And Retrieval Pankil Doshi Asif Mohammed Mohammad Farhan Husain Dr. Latifur Khan Dr. Bhavani Thuraisingham.
Time Series Compressibility and Privacy VLDB 2007 : Time-Series Data Mining Presented By Spiros Papadimitriou, Feifei Li, George Kollios, Philip S. Yu.
Lesson 10: Move Tasks and Ad Hoc Moves Move Materials from one location to another.
Chapters 17 & 18 Physical Database Design Methodology.
Hexastore: Sextuple Indexing for Semantic Web Data Management
Zois Vasileios Α. Μ :4183 University of Patras Department of Computer Engineering & Informatics Diploma Thesis.
MapReduce – An overview Medha Atre (May 7, 2008) Dept of Computer Science Rensselaer Polytechnic Institute.
Advanced Databases: Lecture 8 Query Optimization (III) 1 Query Optimization Advanced Databases By Dr. Akhtar Ali.
1 CPS216: Advanced Database Systems Notes 04: Operators for Data Access Shivnath Babu.
CSED421 Database Systems Lab. Welcome Lab Class –Library 501, Fri 9:00 – 10:40 Teacher Assistants – 안석현, 이상훈 –{ashworld, –IDS.
Object Persistence (Data Base) Design Chapter 13.
Object Persistence Design Chapter 13. Key Definitions Object persistence involves the selection of a storage format and optimization for performance.
Slide 1 Object Persistence Design Chapter 13 Alan Dennis, Barbara Wixom, and David Tegarden John Wiley & Sons, Inc. Slides by Fred Niederman Edited by.
Supporting Large-scale Social Media Data Analyses with Customizable Indexing Techniques on NoSQL Databases.
Database Tuning Chap 8 : IOT Architecture Chap 9 : Cluster Factor Optimization Center for E-Business Technology Seoul National University Seoul, Korea.
Hung-chih Yang 1, Ali Dasdan 1 Ruey-Lung Hsiao 2, D. Stott Parker 2
RDF-3X : RISC-Style RDF Database Engine
RDF-3X : a RISC-style Engine for RDF Thomas Neumann, Gerhard Weikum Max-Planck-Institute fur Informatik, Max-Planck-Institute fur Informatik PVLDB ‘08.
Copyright  2009 by CEBT Meeting  Lab. 이사 3 월 28( 토 )~29( 일 ) 잠정 예정 포장이사 견적 & 냉난방기 이전 설치 견적  정보과학회 데이터베이스 논문지 1 차 심사 완료 오타 수정 수식 설명 추가 요구  STFSSD 발표자료.
RDF-3X: a RISC-style Engine for RDF Presented by Thomas Neumann, Gerhard Weikum Max-Planck-Institut fur Informatik Saarbrucken, Germany Session 19: System.
Lazy Maintenance of Materialized Views Jingren Zhou, Microsoft Research, USA Paul Larson, Microsoft Research, USA Hicham G. Elmongui, Purdue University,
1 CPS216: Advanced Database Systems Notes 05: Operators for Data Access (contd.) Shivnath Babu.
7 Strategies for Extracting, Transforming, and Loading.
A Comparison of Approaches to Large-Scale Data Analysis Andrew Pavlo, Erik Paulson, Alexander Rasin, Daniel J. Abadi, David J. Dewitt, Samuel Madden, Michael.
Partition Architecture Yeon JongHeum
12 Copyright © 2009, Oracle. All rights reserved. Managing Backups, Development Changes, and Security.
Triple Storage. Copyright  2006 by CEBT Triple(RDF) Storages  A triple store is designed to store and retrieve identities that are constructed from.
Nov 2006 Google released the paper on BigTable.
A Comparison of Join Algorithms for Log Processing in MapReduce SIGMOD 2010 Spyros Blanas, Jignesh M. Patel, Vuk Ercegovac, Jun Rao, Eugene J. Shekita,
Mapping the Data Warehouse to a Multiprocessor Architecture
BC030_ ABAP Dictionary Tables in Relational Databases.
Research Meeting Jaeseok Myung. Copyright  2009 by CEBT Summary  ITRC ( 홍보 / 섭외 분과 ) 명재석, 남광현, 공기현, 조혜숙 – 뉴스레터, 홈페이지, 홍보자료, 홍보 로드맵, …  연구.
Research Meeting Jaeseok Myung. Copyright  2009 by CEBT Summary  TA DB : project 3, midterm(24 명 응시 ) WEC : report, project (android), classroom,
Optimizing Joins in a Map-Reduce Environment EDBT 2010 Presented by Foto Afrati, Jeffrey D. Ullman Summarized by Jaeseok Myung Intelligent Database.
Chap 5. Disk IO Distribution Chap 6. Index Architecture Written by Yong-soon Kwon Summerized By Sungchan IDS Lab
DB Tuning : Chapter 10. Optimizer Center for E-Business Technology Seoul National University Seoul, Korea 이상근 Intelligent Database Systems Lab School of.
April 2002Information Systems Design John Ogden & John Wordsworth FOI: 1 Database Design File organisations and indexes John Wordsworth Department of Computer.
RDF storages and indexes Maciej Janik September 1, 2005 Enterprise Integration – Semantic Web.
GAS ontology: an ontology for collaboration among ubiquitous computing devices International Journal of Human-Computer Studies (May 2005) Presented By.
Database Tuning Seminar JOIN Dongmin Shin IDS Lab., SNU
3/1/2002CSE Virtual Memory Virtual Memory CPU On-chip cache Off-chip cache DRAM memory Disk memory Note: Some of the material in this lecture are.
Lecture No. 16 Vishal Jethva Digital Logic & Design svbitec.wordpress.com.
Lecture 45 Syed Mansoor Sarwar
Copyrights apply.
'. \s\s I. '.. '... · \ \ \,, I.
Mapping the Data Warehouse to a Multiprocessor Architecture
Digital Logic & Design Dr. Waseem Ikram Lecture No. 16.
Object Database Queries: OQL
' '· \ ·' ,,,,
CPS216: Advanced Database Systems
Presentation transcript:

Research Meeting Jaeseok Myung

Copyright  2009 by CEBT Summary  TA DB: 중간고사 채점 – 평균 : 66.04, 표준편차 : – 지난학기 평균 : 59.33, 표준편자 : WEC: classroom( ), 강사 (SKT, 정준용 매니저 )  서울대 멘토링 진행중 관리자 업무플로우, 금주 마감 예정 (3 주 ) 디비 스키마, 매칭 모듈 설계 진행 예정 (1~2 주 )  Research SPARQL BGP Processing with Iterative MR – Using finer keys for map tasks => Scailability – Using advanced storage for selection task => Performance – Using selectivity for BGP analysis – Using MR pipelining Center for E-Business Technology

Copyright  2009 by CEBT Using Advanced Storage for Selection Task Center for E-Business Technology SELECT ?A ?B ?C ?D … WHERE { ?A memberOf ?B. ?B type Univ. … } 1212 SPO a1memberOfb1 typeUniv ……… TP_NOAB… 1a1b1… 2 NULL… ……… Selection BigTablePP SOO SOO SOO SOO

Copyright  2009 by CEBT Optimization for Selection Task  Triple Indexing SP-O SO-P PS-O PO-S OS-P OP-S S-PO P-SO O-SP Center for E-Business Technology Jaeseok MemberOf ?X Jaeseok ?x IDS Jaeseok ?x ?y BigTablePP SOO SOO SOO SOO OO PSS PSS PSS PSS SS OPP OPP OPP OPP Up to 3 times bigger disk space is needed for eliminating full table scan Reducing the use of disk by using dictionary encoding

Copyright  2009 by CEBT Optimization for Selection Task  Implementation Using Hadoop – HBase Adding Data Loader Component – N-Triple => HBase Implementation of Selection Tasks using HBase Comparison between N-Triple and Hbase  Dictionary Encoding Center for E-Business Technology