Cosine similarity metric calculation on low power heterogeneous computing platform Michał Karwatowski 1,2, Sebastian Koryciak 1,2, Ernest Jamro 1,2, Agnieszka.

Slides:



Advertisements
Similar presentations
Grzegorz Wianecki, Mateusz Wójcik, Włodzimierz Funika, Marcin Smętek 1 Institute of Computer Science AGH, Mickiewicza 30, Kraków, Poland 2 Academic.
Advertisements

QuIDE was used during the Quantum Computation classes at DCS AGH The students assessed the usability with the System Usability Scale QuIDE was compared.
Implementation of the Convolution Operation on General Purpose Processors Ernest Jamro AGH Technical University Kraków, Poland.
Bryan Lahartinger. “The Apriori algorithm is a fundamental correlation-based data mining [technique]” “Software implementations of the Aprioiri algorithm.
Computes the partial dot products for only the diagonal and upper triangle of the input matrix. The vector computed by this architecture is added to the.
ECE 699: Lecture 1 Introduction to Zynq.
Technical Specification / Schedule Department of Computer Science and Engineering Michigan State University Spring 2007 Team : CSE 498, Collaborative Design.
A System Solution for High- Performance, Low Power SDR Yuan Lin 1, Hyunseok Lee 1, Yoav Harel 1, Mark Woh 1, Scott Mahlke 1, Trevor Mudge 1 and Krisztian.
Presenting: Itai Avron Supervisor: Chen Koren Final Presentation Spring 2005 Implementation of Artificial Intelligence System on FPGA.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space User Oriented Provisioning of Secure Virtualized.
Xiaomeng Su & Jon Atle Gulla Dept. of Computer and Information Science Norwegian University of Science and Technology Trondheim Norway June 2004 Semantic.
OPL: Our Pattern Language. Background Design Patterns: Elements of Reusable Object-Oriented Software o Introduced patterns o Very influential book Pattern.
Final presentation Encryption/Decryption on embedded system Supervisor: Ina Rivkin students: Chen Ponchek Liel Shoshan Winter 2013 Part A.
Utilising software to enhance your research Eamonn Hynes 5 th November, 2012.
Polish Infrastructure for Supporting Computational Science in the European Research Space Policy Driven Data Management in PL-Grid Virtual Organizations.
Advanced Grid-Enabled System for Online Application Monitoring Main Service Manager is a central component, one per each.
Agenda What can I do with LabVIEW? Why LabVIEW (text based comparison) Re-visit the State Machine Intro to common LabVIEW Architectures.
Study of the parallel techniques for dimensionality reduction and its impact on quality of the text processing algorithms Marcin Pietroń 1,2, Maciej Wielgosz.
The versatile hardware accelerator framework for sparse vector calculations Michał Karwatowski 1,2, Kazimierz Wiatr 12 1 AGH University of Science and.
Advanced Computer Architecture, CSE 520 Generating FPGA-Accelerated DFT Libraries Chi-Li Yu Nov. 13, 2007.
Energy efficient calculations of text similarity measure on FPGA-accelerated computing platforms Michał Karwatowski 1,2, Paweł Russek 1,2, Maciej Wielgosz.
In each iteration macro model creates several micro modules, sends data to them and waits for the results. Using Akka Actors for Managing Iterations in.
The Java profiler based on byte code analysis and instrumentation for many-core hardware accelerators Marcin Pietroń 1,2, Michał Karwatowski 1,2, Kazimierz.
Performance Enhancement of Video Compression Algorithms using SIMD Valia, Shamik Jamkar, Saket.
Guiding Principles. Goals First we must agree on the goals. Several (non-exclusive) choices – Want every CS major to be educated in performance including.
Experience with the OpenStack Cloud for VPH Applications Jan Meizner 1, Maciej Malawski 1,2, Piotr Nowakowski 1, Paweł Suder 1, Marian Bubak 1,2 AGH University.
Company LOGO Mid semester presentation Spring 2008/9 Performed by: Alexander PavlovDavid Domb Supervisor: Mony Orbach GPS/INS Computing System.
AKOGRIMO Integration of Grid services with mobile technologies; validation in e-health, e-learning and disaster management areas CoreGRID European Grid.
Scalarm: Scalable Platform for Data Farming D. Król, Ł. Dutka, M. Wrzeszcz, B. Kryza, R. Słota and J. Kitowski ACC Cyfronet AGH KU KDM, Zakopane, 2013.
Panel 21 July, 2015 Panel Exascale computing systems in e-Infrastructures at HPCS 2015 – The International Conference on High Performance Computing & Simulation.
Floating-Point Divide and Square Root for Efficient FPGA Implementation of Image and Signal Processing Algorithms Xiaojun Wang, Miriam Leeser
FPGA Based Smoke Simulator Jonathan Chang Yun Fei Tianming Miao Guanduo Li.
KUKDM’2011, Zakopane Semantic Based Storage QoS Management Methodology Renata Słota, Darin Nikolow, Jacek Kitowski Institute of Computer Science AGH-UST,
Knowledge based Question Answering System Anurag Gautam Harshit Maheshwari.
TI-84 and TI Connect Sammi Lindstrom. Agenda Why so many cables and what are they used for? (10 minutes) What software should be on my desktop? (5 minutes)
GPS Computer Program Performed by: Moti Peretz Neta Galil Supervised by: Mony Orbach Spring 2009 Part A Presentation High Speed Digital Systems Lab Electrical.
Shouqing Hao Institute of Computing Technology, Chinese Academy of Sciences Processes Scheduling on Heterogeneous Multi-core Architecture.
Company LOGO Final presentation Spring 2008/9 Performed by: Alexander PavlovDavid Domb Supervisor: Mony Orbach GPS/INS Computing System.
Performed By: Itamar Niddam and Lior Motorin Instructor: Inna Rivkin Bi-Semesterial. Winter 2012/2013 3/12/2012.
Parameter Sweep and Resources Scaling Automation in Scalarm Data Farming Platform J. Liput, M. Paciorek, M. Wrona, M. Orzechowski, R. Slota, and J. Kitowski.
Understanding Parallel Computers Parallel Processing EE 613.
Performed by: Itamar Niddam and Lior Motorin Instructor: Inna Rivkin המעבדה למערכות ספרתיות מהירות High speed digital systems laboratory הטכניון - מכון.
Cluster Analysis Data Mining Experiment Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology.
Lecture 0. Course Introduction Prof. Taeweon Suh Computer Science Education Korea University COM509 Computer Systems.
High Speed Digital Systems Lab June 2008 Acceleration of Economic Calculation Developers: Ayal Ozer and Eyal Efrat Mentor: Michael Yampolsky Black & Scholes.
Reconfigurable Supercomputing (2) Key Issues in HPC  Leveling off of performance Traditional Scalar/Vector – long product cycles, too few vendors.
Students: Inna Sigal and Yuval Bar-Tal Supervisor: Tsachi Martsiano Spring 2015.
Accelerating K-Means Clustering with Parallel Implementations and GPU Computing Janki Bhimani Miriam Leeser Ningfang Mi
Chapter I: Introduction to Computer Science. Computer: is a machine that accepts input data, processes the data and creates output data. This is a specific-purpose.
Implementing RISC Multi Core Processor Using HLS Language - BLUESPEC Liam Wigdor Instructor Mony Orbach Shirel Josef Semesterial Winter 2013.
Gedae, Inc. Implementing Modal Software in Data Flow for Heterogeneous Architectures James Steed, Kerry Barnes, William Lundgren Gedae, Inc.
PL-Grid: Polish Infrastructure for Supporting Computational Science in the European Research Space 1 ESIF - The PLGrid Experience ACK Cyfronet AGH PL-Grid.
Co-Designing Accelerators and SoC Interfaces using gem5-Aladdin
Supported in part by NIST/U.S. Department of Commerce
Time-borrowing platform in the Xilinx UltraScale+ family of FPGAs and MPSoCs Ilya Ganusov, Benjamin Devlin.
From VPH-Share to PL-Grid: Atmosphere as an Advanced Frontend
DI4R Conference, September, 28-30, 2016, Krakow
ENG3050 Embedded Reconfigurable Computing Systems
Genomic Data Clustering on FPGAs for Compression
Highly Efficient and Flexible Video Encoder on CPU+FPGA Platform
Computing Disciplines Florida Gulf Coast University
FPGA Interconnection Algorithm
Implementation of a GNSS Space Receiver on a Zynq
Session 5: Computer Time: 2:30 - 3:20 pm Objectives Tasks
Introduction: Why Study Algorithms?
Welcome to the FPGA Tools Course Agenda
LANMC: LSTM-Assisted Non-Rigid Motion Correction
Nagios with The Decision Engine Implementing Passive Checks
A Survey of Interactive Execution Environments
Presentation transcript:

Cosine similarity metric calculation on low power heterogeneous computing platform Michał Karwatowski 1,2, Sebastian Koryciak 1,2, Ernest Jamro 1,2, Agnieszka Dąbrowska-Boruch 1,2, Kazimierz Wiatr 1 1 AGH University of Science and Technology, al. Mickiewicza 30, Kraków, 2 ACK Cyfronet AGH, ul. Nawojki 11, Kraków KUKDM Zakopane

Agenda FPGA based hardware accelerated computing Text similarity analysis Search algorithm implementation Results Future work 2

FPGA based hardware accelerated computing 3

Text similarity analysis 4

Text comparison 5

Hardware 6

Zynq 7

Hardware architecture 8

Compare flow 9

Compare system 10

Tests 100,000 random documents processed to vector form Zynq software solution: One and two cores ARM Cortex-A9 667 MHz Zynq PS + PL solution 8 paralel channels 100 MHz 11

Runtime comparison 12

Future work Compression High performance hardware Higher level language 13

Text comparison 14

Cluster 15

Questions 16