Multiscale, Multigranular Land Cover Classification: Performance Optimization Vijay Gandhi, Abhinaya Sinha 1 st May, 2006.

Slides:



Advertisements
Similar presentations
L3S Research Center University of Hanover Germany
Advertisements

CMSC 611: Advanced Computer Architecture Performance Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted.
SHREYAS PARNERKAR. Motivation Texture analysis is important in many applications of computer image analysis for classification or segmentation of images.
Introduction to MATLAB The language of Technical Computing.
Software & Services Group, Developer Products Division Copyright© 2010, Intel Corporation. All rights reserved. *Other brands and names are the property.
Mining Frequent Spatio-temporal Sequential Patterns
On the Interaction of Tiling and Automatic Parallelization Zhelong Pan, Brian Armstrong, Hansang Bae Rudolf Eigenmann Purdue University, ECE
Structured Sparse Principal Component Analysis Reading Group Presenter: Peng Zhang Cognitive Radio Institute Friday, October 01, 2010 Authors: Rodolphe.
Code Transformations to Improve Memory Parallelism Vijay S. Pai and Sarita Adve MICRO-32, 1999.
1 Spatial Big Data Challenges Intersecting Cloud Computing and Mobility Shashi Shekhar McKnight Distinguished University Professor Department of Computer.
Implementing the Probability Matrix Technique for Positron Emission Tomography By: Chris Markson Student Adviser: Dr. Kaufman.
ECE 562 Computer Architecture and Design Project: Improving Feature Extraction Using SIFT on GPU Rodrigo Savage, Wo-Tak Wu.
Parallelizing Spatial Data Mining Algorithms: A case study with Multiscale and Multigranular Classification PGAS 2006 Vijay Gandhi, Mete Celik, Shashi.
Distributed Algorithms for Secure Multipath Routing
Computer Vision I Introduction Raul Queiroz Feitosa.
May 29, Final Presentation Sajib Barua1 Development of a Parallel Fast Fourier Transform Algorithm for Derivative Pricing Using MPI Sajib Barua.
Data Partitioning for Reconfigurable Architectures with Distributed Block RAM Wenrui Gong Gang Wang Ryan Kastner Department of Electrical and Computer.
Communication [Lower] Bounds for Heterogeneous Architectures Julian Bui.
Cliff Rhyne and Jerry Fu June 5, 2007 Parallel Image Segmenter CSE 262 Spring 2007 Project Final Presentation.
Parallel K-Means Clustering Based on MapReduce The Key Laboratory of Intelligent Information Processing, Chinese Academy of Sciences Weizhong Zhao, Huifang.
1 Context-Inclusive Approach to Speed-up Function Evaluation for Statistical Queries: An Extended Abstract Vijay Gandhi, James Kang, Shashi Shekhar University.
DATA LOCALITY & ITS OPTIMIZATION TECHNIQUES Presented by Preethi Rajaram CSS 548 Introduction to Compilers Professor Carol Zander Fall 2012.
CISC673 – Optimizing Compilers1/34 Presented by: Sameer Kulkarni Dept of Computer & Information Sciences University of Delaware Phase Ordering.
PROJECT MANAGER: YOUNGHOON JEON SYSTEM ARCHITECT: YOUNGHOON JUNG LANGUAGE GURU: JINHYUNG PARK SYSTEM INTEGRATOR: WONJOON SONG VALIDATION AND TESTING: AKSHAI.
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
JPEG C OMPRESSION A LGORITHM I N CUDA Group Members: Pranit Patel Manisha Tatikonda Jeff Wong Jarek Marczewski Date: April 14, 2009.
Vectorization of the 2D Wavelet Lifting Transform Using SIMD Extensions D. Chaver, C. Tenllado, L. Piñuel, M. Prieto, F. Tirado U C M.
Design Space Exploration
Mining Optimal Decision Trees from Itemset Lattices Dr, Siegfried Nijssen Dr. Elisa Fromont KDD 2007.
Object Bank Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 4 th, 2013.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
INDEPENDENT COMPONENT ANALYSIS OF TEXTURES based on the article R.Manduchi, J. Portilla, ICA of Textures, The Proc. of the 7 th IEEE Int. Conf. On Comp.
1 Comparison of Principal Component Analysis and Random Projection in Text Mining Steve Vincent April 29, 2004 INFS 795 Dr. Domeniconi.
Assignment Comments. Report Structure Abstract Write at end Not just a list of contents – Not a to do list – Not a to be done list Summary of article.
UC San Diego / VLSI CAD Laboratory Incremental Multiple-Scan Chain Ordering for ECO Flip-Flop Insertion Andrew B. Kahng, Ilgweon Kang and Siddhartha Nath.
Scheduling Many-Body Short Range MD Simulations on a Cluster of Workstations and Custom VLSI Hardware Sumanth J.V, David R. Swanson and Hong Jiang University.
Image Classification 영상분류
Chapter 21 R(x) Algorithm a) Anomaly Detection b) Matched Filter.
Configurable, reconfigurable, and run-time reconfigurable computing.
Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.
Fast Support Vector Machine Training and Classification on Graphics Processors Bryan Catanzaro Narayanan Sundaram Kurt Keutzer Parallel Computing Laboratory,
CISC Machine Learning for Solving Systems Problems Presented by: Alparslan SARI Dept of Computer & Information Sciences University of Delaware
IXA 1234 : C++ PROGRAMMING CHAPTER 1. PROGRAMMING LANGUAGE Programming language is a computer program that can solve certain problem / task Keyword: Computer.
LOGO A Convolution Accelerator for OR1200 Dawei Fan.
Autonomic scheduling of tasks from data parallel patterns to CPU/GPU core mixes Published in: High Performance Computing and Simulation (HPCS), 2013 International.
QCAdesigner – CUDA HPPS project
GPU Power Model Nandhini Sudarsanan Nathan Vanderby Neeraj Mishra Usha Vinodh
08/10/ NRL Hybrid QR Factorization Algorithm for High Performance Computing Architectures Peter Vouras Naval Research Laboratory Radar Division Professor.
The Hashemite University Computer Engineering Department
Image Processing A Study in Pixel Averaging Building a Resolution Pyramid With Parallel Computing Denise Runnels and Farnaz Zand.
Rate-Based Query Optimization for Streaming Information Sources Stratis D. Viglas Jeffrey F. Naughton.
Linear Analysis and Optimization of Stream Programs Masterworks Presentation Andrew A. Lamb 4/30/2003 Professor Saman Amarasinghe MIT Laboratory for Computer.
1 Double-Patterning Aware DSA Template Guided Cut Redistribution for Advanced 1-D Gridded Designs Zhi-Wen Lin and Yao-Wen Chang National Taiwan University.
Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.
Accelerating K-Means Clustering with Parallel Implementations and GPU Computing Janki Bhimani Miriam Leeser Ningfang Mi
SKMM1013 Programming for Engineers
A 2 veto for Continuous Wave Searches
Hiba Tariq School of Engineering
Parallel Programming By J. H. Wang May 2, 2017.
High Performance Computing on an IBM Cell Processor --- Bioinformatics
You can check broken videos in this slide here :
Improving java performance using Dynamic Method Migration on FPGAs
Content-based Image Retrieval
PROBLEM SOLVING CSC 111.
Multiple Instance Learning: applications to computer vision
Kenneth Moreland Edward Angel Sandia National Labs U. of New Mexico
병렬처리시스템 2005년도 2학기 채 수 환
A Block Based MAP Segmentation for Image Compression
Implementation of a De-blocking Filter and Optimization in PLX
Reporter: Wenkai Cui Institution: Tsinghua University Date:
Presentation transcript:

Multiscale, Multigranular Land Cover Classification: Performance Optimization Vijay Gandhi, Abhinaya Sinha 1 st May, 2006

2 Outline Motivation Problem Statement Related Work Our contributions Results Conclusion

3 Motivation Forest Management Identification of Timber land Habitat Management: Hardwood is an habitat for the endangered spotted owl Courtesy: Junchang Ju (Boston U)

4 Problem Statement Given: Remotely sensed image, land cover class label hierarchy Find: Produce images with land cover classes assigned at multiple spatial scales Objective: Minimize execution time Constraints: Algorithm to achieve the output is already present

5 Problem Statement illustrated Remotely Sensed Image Label Hierarchy C – Conifer H – Hardwood F – Forest

6 Related Work Current implementation in MATLAB GCC Compiler Optimization options Parallel Computing

7 Algorithm Divide input image into dyadic segments recursively Calculate the log-likelihood for each segment using Expectation Maximization Decide the label for each segment and generate images

8 Contribution Improved performance of the algorithm  conversion of existing MATLAB implementation to C programming language Identified the need for parallelizing the C implementation

9 Performance improvement Converted MATLAB implementation to C Why C?  Compiled vs. interpreted language  Compiler directives  Passing by reference MATLAB implementation: 1500 lines of code Issues: Memory management, Indexing, Matrix operations

10 Code Profiling Input Image: 128 x 128 Function # Calls% Time mix_product_elementwise 2,887,869, * mix_divide_by_scalar 2,891,482, * gen_loglike 294,  gen_loglike calculates the log-likelihood of a general class  It uses Expectation Maximization  # gen_loglike = f ( # general classes, image size, spatial scale)  # mix_product_elementwise = f ( spatial scale) * These functions are part of gen_loglike and take 25% of gen_loglike()’s execution time or 20% of overall execution time

11 Parallelizing implementation mix_product_elementwise – multiplies two vectors element-by-element C = A * B C[n] = A[n] * B[n] Use Unified Parallel C (UPC) to parallelize Execution Time ( milliseconds ) # CallsWith ParallelWithout Parallel 100, ,000,

12 Parallelizing implementation Matrix level parallelization did not help Need for parallelizing at the function level Nice to have directives that parallelize a function rather than an individual matrix operation

13 Validation Methodology Tests on real data sets  128 x 128 pixels  64 x 64 pixels Plymouth County, Massachusetts (each pixel – 30m x 30m)  Experimental setup:  Candidates: C and MATLAB implementations  Sun Solaris 512 MB RAM, 1.6 GHz

14 Experimental Validation Execution Time (Seconds)

15 Experimental Validation Execution Time (Seconds)

16 Conclusion Converted from MATLAB to C implementation Improvement in performance Identified the need for parallelizing

17 Future Work Parallelizing the code

18 References Multiscale Multigranular Land Cover Characterization, Kolaczyk E.D. et al, Journal of the American Statistical Association, 2005 Probability and Statistics, 3 rd edition, DeGroot, Schervish Computer Architecture, 2 nd edition, Patterson, Henessey Introduction to Parallel Computing, 2 nd edition, Ananth Grama et al MATLAB code - GCC Optimization - Options.htmlhttp://gcc.gnu.org/onlinedocs/gcc/Optimize- Options.html

19 Acknowledgements Dr. Shashi Shekhar Dr. Eric Kolaczyk Dr. Sucharita Gopal Dr. Junchang Ju

20

21 Problem Statement illustrated Courtesy: Junchang Ju (Boston U)

22 Profile results