Viet Tran Institute of Informatics, SAS Slovakia.

Slides:



Advertisements
Similar presentations
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Advertisements

From Web Archiving services to Web scale data processing platform Internet Memory Research GA IIPC, Paris, May 19th 2014.
1. 2 Welcome to HP-CAST-NTIG at NSC 1–2 April 2008.
Rodney Neal Office 365 for Education Montgomery County Schools
INFSO-RI Enabling Grids for E-sciencE FloodGrid application Ladislav Hluchy, Viet D. Tran Institute of Informatics, SAS Slovakia.
11 October Primary Research Team & Capabilities Dept. of Parallel and Distributed Computing Research and Development Areas: –Large-scale HPCN, Grid.
SUPPORTING A MODELING CONTINUUM IN SCALATION John A. Miller Michael E. Cotterell Stephen J. Buckley University of Georgia IBM Thomas J. Watson Research.
GRACE Project IST EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
Software Architecture
Stern Center for Research Computing
BalticGrid-II Project MATLAB implementation and application in Grid Ilmars Slaidins, Lauris Cikovskis Riga Technical University AHM Riga May 12-14, 2009.
Processing and Recommendation Michal Laclavík, Ladislav Hluchý, Martin Šeleng ( research, information extraction, information retrieval, contextual.
Information processing Michal Laclavík, Ladislav Hluchý ( research, information extraction, information retrieval, contextual recommendation)
Institute of Informatics, Slovak Academy of Sciences Michal Laclavík Ladislav Hluchý.
Sandor Acs 05/07/
HPCVL High Performance Computing Virtual Laboratory Founded 1998 as a joint HPC lab between –Carleton U. (Comp. Sci.) –Queen’s U. (Engineering) –U. of.
Session 4e, 24 October 2007 eChallenges e-2007 Copyright 2007 Institute of Informatics, SAS Network Enterprise Interoperability and Collaboration using.
The II SAS Testbed Site Jan Astalos - Institute of Informatics Slovak Academy of Sciences.
Workshop 12g, 26 October 2007 eChallenges e-2007 Copyright 2007 Commius consortium Commius: ISU via Michal Laclavík Institute of Informatics, Slovak.
CSE 451: Operating Systems Autumn 2010 Module 25 Cloud Computing Ed Lazowska Allen Center 570.
Session 10a, 21st October 2005 eChallenges e-2005 Copyright 2005 K-Wf Grid, Institute of Informatics SAS Experience Management based on Text Notes (EMBET)
INFSO-RI Grupo de Redes y Computación de Altas Prestaciones Actividades del Grupo de Redes y Computación de Altas Prestaciones.
ICT-enabled Agricultural Science for Development Scenarios, Opportunities, Issues by ICTs transforming agricultural science, research & technology generation.
Lightweight Semantic Approach for Enterprise Search and Interoperability Michal Laclavík, Štefan Dlugolinský, Martin Šeleng, Marek Ciglan, Martin Tomašek,
Overview of IST 6FP Call 2/2003 Grid Projects Marian Bubak, Piotr Nowakowski Academic Computer Center CYFRONET AGH Cracow, Poland.
Computational Research in the Battelle Center for Mathmatical medicine.
11 November Primary Research Team & Capabilities Dept. of Parallel and Distributed Computing Research and Development Areas: –Large-scale HPCN, Grid.
Local Intelligent Networks and Energy Active Regions in Flanders Carlo Mol - VITO.
LLNL’s Data Center and Interoperable Services 5 th Annual ESGF Face-to-Face Conference ESGF 2015 Monterey, CA, USA Dean N. Williams, Tony Hoang, Cameron.
Breaking points of traditional approach What if you could handle big data?
7th May Primary Research Team & Capabilities Dept. of Parallel and Distributed Computing Research and Development Areas: –Large-scale HPCN, Grid.
February 19, 2015 Learning & Research NSU Dr. George Hsieh Department of Computer Science.
Big Data to Knowledge Panel SKG 2014 Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China August Geoffrey Fox
Cloud Computing project NSYSU Sec. 1 Demo. NSYSU EE IT_LAB2 Outline  Our system’s architecture  Flow chart of the hadoop’s job(web crawler) working.
GFURR seminar Can Collecting, Archiving, Analyzing, and Accessing Webpages and Tweets Enhance Resilience Research and Education? Edward A. Fox, Andrea.
WIKTBratislava, 28. november Semantic Organization/Enterprise Vision Michal Laclavik, Ladislav Hluchy, Marian Babik, Zoltan Balogh, Ivana Budinska,
Indiana University Faculty Geoffrey Fox, David Crandall, Judy Qiu, Gregor von Laszewski Data Science at Digital Science Center.
Beyond Hadoop The leading open source system for processing big data continues to evolve, but new approaches with added features are on the rise. Ibrahim.
The Evolution of the Italian HPC Infrastructure Carlo Cavazzoni CINECA – Supercomputing Application & Innovation 31 Marzo 2015.
RI EGI-InSPIRE RI Astronomy and Astrophysics Dr. Giuliano Taffoni Dr. Claudio Vuerli.
Instituto de Biocomputación y Física de Sistemas Complejos Cloud resources and BIFI activities in JRA2 Reunión JRU Española.
By: Joel Dominic and Carroll Wongchote 4/18/2012.
WP5 – Infrastructure Operations Test and Production Infrastructures StratusLab kick-off meeting June 2010, Orsay, France GRNET.
Microsoft Partner since 2011
Connect A 3 Contact persons: Sandro D'Elia Anne-Marie Sassen Horizon 2020: LEIT – ICT WP
Fifth International Workshop on Knowledge Discovery, Knowledge Management and Decision Support – EUREKA Big Data architecture for Social Media Sentiment.
Large Scale Semantic Data Integration and Analytics through Cloud: A Case Study in Bioinformatics Tat Thang Parallel and Distributed Computing Centre,
Data Analytics Challenges Some faults cannot be avoided Decrease the availability for running physics Preventive maintenance is not enough Does not take.
Cloud-Computing Cloud Web-Blog Software Application Download Software.
Frontiers of Software Engineering
More than IaaS Academic Cloud Services for Researchers
Pagerank and Betweenness centrality on Big Taxi Trajectory Graph
Information Collection and Presentation Enriched by Remote Sensor Data
Ideas for an ICOS Competence Centre Implementation of an on-demand computation service Ute Karstens, André Bjärby, Oleg Mirzov, Roger Groth, Mitch Selander,
Modern Data Management
Clouds of JINR, University of Sofia and INRNE Join Together
DATA SCIENCE Online Training at GoLogica
Hadoop Clusters Tess Fulkerson.
Melbourne Azure Meetup
Accelerated Computing in Cloud
Cloud DIKW based on HPC-ABDS to integrate streaming and batch Big Data
PhD Programme Computer Science, HKBU
Syllabus and Introduction Keke Chen
Future Requirements of WIS Centres
Knowledge Management Grid
SAP HANA Cost-optimized Hardware for Non-Production
Towards Unified Management
Big Data, Simulations and HPC Convergence
H2020 EU PROJECT | Topic SC1-DTH | GA:
Presentation transcript:

Viet Tran Institute of Informatics, SAS Slovakia

Research activities in Big Data Information Retrieval, Big Data Semantics, Graphs and Networks, Semantic Search Multi-language Text Analysis Knowledge Modeling, Ontologies

Products RDB2Onto: Tool for Relational Data to Ontology Individuals Mapping ACoMA: Acoma process communication on server side and attach relevant knowledge to messages. EMBET: Experience Management based on Text Notes - Active and Context sensitive Recommendation System RIDAR: Relevant Internet Data Resource Identification WEBCRAWLER: WebCrawler downloads recursively web pages.

Selected projects related to Big Data EU projects REDIRNET: Emergency Responder Data Interoperability Network ( ) VENIS: Virtual Enterprises by Networked Interoperability Services ( ) EUSAS: European Urban Simulation for Asymmetric Scenarios ( ) COMMIUS: Community-based Interoperability Utility for SMEs ( ) ADMIRE: Advanced Data Mining and Integration Research for Europe ( ) And many Grid-related projects (EGI-Inspire, EGEE- 1,2,3; DEGREE, Int.Eu.Grid, Kwf-Grid, MediGrid, CrossGrid)

Selected projects related to Big Data National projects CLAN: Cloud Computing for Big Data Analytics ( ) SMART-II: Center of Excellence for SMART technologies( ) SMART-II SIVVP: Slovak Infrastructure for High Performance Computing ( ) SIVVP VEGA: Selected methods, approaches and tools for distributed computing ( ) VEGA: New methods and approaches on information processing and knowledge bases ( )

HADOOP Production cluster 16 nodes of 24 Intel Xeon cores (384 cores in total) 48 GB RAM 1 TB Software Hadoop Spark

HADOOP Development cluster 10 nodes of 12 Intel Xeon cores 48 GB RAM 512 GB storage Software Hadoop Spark Hive Pig

Ongoing infrastructure 52 computing nodes of IBM dx360 M3 (2x 6–core Intel E5645, 48GB RAM, 2x500 GB scratch disk) + service nodes and storages To be shared between cloud and Hadoop/Spark