Hemera KickOff October 5th, 2010 Working Group B5 Efficient management of very large volumes of information for data- intensive applications Gabriel Antoniu,

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

1 OOA-HR Workshop, 11 October 2006 Semantic Metadata Extraction using GATE Diana Maynard Natural Language Processing Group University of Sheffield, UK.
A centre of expertise in digital information management Tools for the Trade? Supporting Multidisciplinary Research Dr Liz Lyon, Director.
Copyright 2012 Trend Micro Inc. Raimund Genes, CTO Innovation In Cloud Security.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
Data warehouse example
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
The Data Warehouse and Technology. Some Technological  Manage large amounts of data  Manage data on a diverse media  Easily index and monitor data.
Preface Exponential growth of data volume, steady drop in storage costs, and rapid increase in storage capacity Inadequacy of the sequential processing.
SpaceGRID and EGSO Satu Keski-Jaskari Maria Vappula Parallal Computing – Seminar
Yannis Ioannidis University of Athens, Hellas Digital Libraries at a Crossroads Toward the Future Generation of Digital Library Mgmt Systems.
MUSCLE WP9 E-Team Integration of structural and semantic models for multimedia metadata management Aims: (Semi-)automatic MM metadata specification process.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
Cyberinfrastructure Supporting Social Science Cyberinfrastructure Workshop October Chicago Geoffrey Fox
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Cluj Napoca, 28 August IEEE International Conference on Intelligent Computer Communication and Processing Digital Libraries Workshop Towards.
Building Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Semantic web technologies for secure interoperability and.
RDA Wheat Data Interoperability Working Group Outcomes RDA Outputs P5 9 th March 2015, San Diego.
Opensource for Cloud Deployments – Risk – Reward – Reality
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Database and Data-Intensive Systems. Data-Intensive Systems From monolithic architectures to diverse systems Dedicated/specialized systems, column stores.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
Building Scalable Web Archives Florent Carpentier, Leïla Medjkoune Internet Memory Foundation IIPC GA, Paris, May 2014.
1 Enabling Webscale Research in Europe Julien Masanès European Archive Foundation Consultation Workshop, Brussels, 19/1/2010.
Exploring the Applicability of Scientific Data Management Tools and Techniques on the Records Management Requirements for the National Archives and Records.
SeLeNe - Architecture George Samaras Kyriakos Karenos Larnaca – April 2003 THE UNIVERSITY OF CYPRUS.
Ohio State University Department of Computer Science and Engineering 1 Cyberinfrastructure for Coastal Forecasting and Change Analysis Gagan Agrawal Hakan.
Data Management Information Management Knowledge Management Data and Applications Security Challenges Bhavani Thuraisingham October 2006.
DOMENICO TALIA (joint work with M. Cannataro, A. Congiusta, P. Trunfio) DEIS University of Calabria ITALY Grid-Based Data Mining and.
Secure Sensor Data/Information Management and Mining Bhavani Thuraisingham The University of Texas at Dallas October 2005.
Data Mining By Dave Maung.
Héméra: Scientific Challenges using Grid’ Christian Perez INRIA, France.
Scenarios for a Learning GRID Online Educa Nov 30 – Dec 2, 2005, Berlin, Germany Nicola Capuano, Agathe Merceron, PierLuigi Ritrovato
Tel Aviv University - Industrial Engineering Department 1 Data Grid In Engineering TOC Grid Overview The PF5 definition: A very high-speed trans-European.
Major Disciplines in Computer Science Ken Nguyen Department of Information Technology Clayton State University.
Data Grid Research Group Dept. of Computer Science and Engineering The Ohio State University Columbus, Ohio 43210, USA David Chiu & Gagan Agrawal Enabling.
ACGT: Open Grid Services for Improving Medical Knowledge Discovery Stelios G. Sfakianakis, FORTH.
The Astronomy challenge: How can workflow preservation help? Susana Sánchez, Jose Enrique Ruíz, Lourdes Verdes-Montenegro, Julian Garrido, Juan de Dios.
Data Mining with Big data
CS573 Data Privacy and Security Secure data outsourcing – Combining encryption and fragmentation.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
What does the Cloud mean for Data Management: Challenges and Opportunities Akrivi Vlachou Norwegian University of Science and Technology (NTNU), Trondheim,
MEDIGRID project, DataGrid FR meeting, April 18, 2002, Johan Montagnat, WP10 ACI GRID 2002 MEDIGRID: high performance medical image processing on a computational.
Content Challenges for Open Government Dale Waldt Sr. Analyst / Consultant
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
SAPIR Search in Audio-Visual Content using P2P Information Retrival For more information visit: Support.
Extracting value from grey literature Processes and technologies for aggregating and analysing the hidden Big Data treasure of the organisations.
MULTIMEDIA DATA MODELS AND AUTHORING
Data Grid Research Group Dept. of Computer Science and Engineering The Ohio State University Columbus, Ohio 43210, USA David Chiu and Gagan Agrawal Enabling.
Semantic (web) activity at Elsevier Marc Krellenstein VP, Search and Discovery Elsevier October 27, 2004
Developing GRID Applications GRACE Project
ETICS An Environment for Distributed Software Development in Aerospace Applications SpaceTransfer09 Hannover Messe, April 2009.
Social Impacts of Data Mining 2004/12/30. Outline Is data mining a hype or a persistent growing business? Is data mining merely managers’ business or.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
Update on CHEP from the Computing Speaker Committee G. Carlino (INFN Napoli) on behalf of the CSC ICB, October
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Digital Video Library - Jacky Ma.
DBSI Teaser presentation The Beckman Report On Database Research
GGF OGSA-WG, Data Use Cases Peter Kunszt Middleware Activity, Data Management Cluster EGEE is a project funded by the European.
Data and Applications Security Developments and Directions
University of Technology
Topics Covered in COSC 6340 Data models (ER, Relational, XML (short))
Topics Covered in COSC 6340 Data models (ER, Relational, XML)
PBKM: A Secure Knowledge Management Framework
Toward WIS2.0 Shigeharu NISHIKAWA Japan Meteorological Agency
Big DATA.
Example of Event-Based Video Data (Touch-down Scenario)
Additional text exploring the video clip.
Presentation transcript:

Hemera KickOff October 5th, 2010 Working Group B5 Efficient management of very large volumes of information for data- intensive applications Gabriel Antoniu, Jean-Marc Pierson

Challenges Tremendous volumes of data (up to Petabytes), increasing every year Cloud infrastructures enforce this trend Large span of diverse applications Different modalities of data: images, text, video, raw values Distributed, heterogeneous, structured or not, semantically (en-)riched, confidential Stored in DFS or DDB, Cloud storage services, Warehouses

Aim of the WG Explore research issues related to high-level services for information management (search, mining, visualisation, processing) For large volumes of distributed data Taking into account –security, efficiency and heterogeneity –applications requirements –and the execution infrastructure (grids, clouds)

Issues to be addressed Low-level: –Fault-tolerance, caching, transport, security (encryption, confidentiality), consistency, location transparency Intermediate-level: –Interoperability among storage systems –Data indexing High-level: –Data mining, data classification, data assimilation, knowledge extraction, data visualization –Metadata management

Communities involved Distributed applications Distributed systems –clusters, grids, P2P, clouds Fault-tolerant systems Databases, data mining Security Numerical algorithms

Roadmap Identify research teams –Active in the area of the WG –With experience in data-intensive applications on Aladdin-G5K –And new comers… Organize workshops and possibly schools to share and disseminate experience and knowledge