Data Compression Conference 2013 Chenggang Yan, Yongdong Zhang, Feng Dai and Liang Li 1.

Slides:



Advertisements
Similar presentations
Parallel Scalability and Efficiency of HEVC Parallelization Approaches
Advertisements

Hongliang Li, Senior Member, IEEE, Linfeng Xu, Member, IEEE, and Guanghui Liu Face Hallucination via Similarity Constraints.
A Highly Parallel Framework for HEVC Coding Unit Partitioning Tree Decision on Many-core Processors Chenggang Yan, Yongdong Zhang, Jizheng Xu, Feng Dai,
Software Architecture of High Efficiency Video Coding for Many-Core Systems with Power- Efficient Workload Balancing Muhammad Usman Karim Khan, Muhammad.
Efficient Bit Allocation and CTU level Rate Control for HEVC Picture Coding Symposium, 2013, IEEE Junjun Si, Siwei Ma, Wen Gao Insitute of Digital Media,
1 Adaptive slice-level parallelism for H.264/AVC encoding using pre macroblock mode selection Bongsoo Jung, Byeungwoo Jeon Journal of Visual Communication.
Temporal Video Denoising Based on Multihypothesis Motion Compensation Liwei Guo; Au, O.C.; Mengyao Ma; Zhiqin Liang; Hong Kong Univ. of Sci. & Technol.,
Video Coding with Spatio-temporal Texture Synthesis and Edge-based inpainting Chunbo Zhu, Xiaoyan Sun, Feng Wu, and Houqiang Li ICME 2008.
Ai-Mei Huang and Truong Nguyen Video Processing LabECE Dept, UCSD, La Jolla, CA This paper appears in: Image Processing, ICIP IEEE International.
Shaobo Zhang, Xiaoyun Zhang, Zhiyong Gao
Binary Image Compression Using Efficient Partitioning into Rectangular Regions IEEE Transactions on Communications Sherif A.Mohamed and Moustafa M. Fahmy.
1 Lecture 25: Parallel Algorithms II Topics: matrix, graph, and sort algorithms Tuesday presentations:  Each group: 10 minutes  Describe the problem,
Connected Components, Directed Graphs, Topological Sort COMP171.
Robust Object Segmentation Using Adaptive Thresholding Xiaxi Huang and Nikolaos V. Boulgouris International Conference on Image Processing 2007.
Multiple Human Objects Tracking in Crowded Scenes Yao-Te Tsai, Huang-Chia Shih, and Chung-Lin Huang Dept. of EE, NTHU International Conference on Pattern.
Learning Multiple Evolutionary Pathways from Cross-sectional Data Niko Beerenwinkel, Jorg Rahnenfuhrer, Martin Daumer, Daniel Hoffmann,Rolf Kaiser, Joachim.
3D Hand Pose Estimation by Finding Appearance-Based Matches in a Large Database of Training Views
Connected Components, Directed graphs, Topological sort COMP171 Fall 2005.
1 of 16 June 21, 2000 Schedulability Analysis for Systems with Data and Control Dependencies Paul Pop, Petru Eles, Zebo Peng Department of Computer and.
A New Rate-Complexity-QP Algorithm for HEVC Intra-Picture Rate Control LING TIAN, YIMIN ZHOU, AND XIAOJUN CAO 2014 INTERNATIONAL CONFERENCE ON COMPUTING,
Topological Sort Introduction. Definition of Topological Sort. Topological Sort is Not Unique. Topological Sort Algorithm. An Example. Implementation.
CoNA : Dynamic Application Mapping for Congestion Reduction in Many-Core Systems 2012 IEEE 30th International Conference on Computer Design (ICCD) M. Fattah,
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’04) /04 $20.00 c 2004 IEEE 1 Li Hong.
Online Dictionary Learning for Sparse Coding International Conference on Machine Learning, 2009 Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro.
MGR: Multi-Level Global Router Yue Xu and Chris Chu Department of Electrical and Computer Engineering Iowa State University ICCAD
Solving Systems of Equations and Inequalities Section 3.1A-B Two variable linear equations Section 3.1C Matrices Resolution of linear systems Section 3.1D.
CAFE router: A Fast Connectivity Aware Multiple Nets Routing Algorithm for Routing Grid with Obstacles Y. Kohira and A. Takahashi School of Computer Science.
Computer Science 112 Fundamentals of Programming II Introduction to Graphs.
GRAPH THEORY.  A graph is a collection of vertices and edges.  An edge is a connection between two vertices (or nodes).  One can draw a graph by marking.
Systems of Linear Equations and Inequalities (Chapter 3)
Introduction to Graphs. Introduction Graphs are a generalization of trees –Nodes or verticies –Edges or arcs Two kinds of graphs –Directed –Undirected.
Yaomin Jin Design of Experiments Morris Method.
Co-Grid: an Efficient Coverage Maintenance Protocol for Distributed Sensor Networks Guoliang Xing; Chenyang Lu; Robert Pless; Joseph A. O ’ Sullivan Department.
Register Placement for High- Performance Circuits M. Chiang, T. Okamoto and T. Yoshimura Waseda University, Japan DATE 2009.
Learning the Structure of Related Tasks Presented by Lihan He Machine Learning Reading Group Duke University 02/03/2006 A. Niculescu-Mizil, R. Caruana.
18 th August 2006 International Conference on Pattern Recognition 2006 Epipolar Geometry from Two Correspondences Michal Perďoch, Jiří Matas, Ondřej Chum.
Experimental Study of Directed Feedback Vertex Set Problem With Rudolf Fleischer and Liwei Yuan Fudan University, Shanghai Xi Wu.
Alignment Tools Used To Locate A Wire And A Laser Beam In The VISA Undulator Project Wire Finder CALIBRATION A special fixture was constructed to calibrate.
Efficient Computing k-Coverage Paths in Multihop Wireless Sensor Networks XuFei Mao, ShaoJie Tang, and Xiang-Yang Li Dept. of Computer Science, Illinois.
IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇.
1 Watermarking Scheme Capable of Resisting Sensitivity Attack IEEE signal processing letters, vol. 14, no. 2, February. 2007, pp Xinpeng Zhang.
Scientific Method. Observation:Question:Hypothesis:
1 Compression and Storage Schemes in a Sensor Network with Spatial and Temporal Coding Techniques You-Chiun Wang, Yao-Yu Hsieh, and Yu-Chee Tseng IEEE.
3-D Direction Aligned Wavelet Transform for Scalable Video Coding Yu Liu 1, King Ngi Ngan 1, and Feng Wu 2 1 Department of Electronic Engineering The Chinese.
EFFICIENT PARALLEL FRAMEWORK FOR H.264 AVC DEBLOCKING FILTER ON MANY-CORE PLATFORM Yongdong Zhang, Member, IEEE, Chenggang Yan, Feng Dai, and Yike Ma.
Highly Parallel Mode Decision Method for HEVC Jun Zhang, Feng Dai, Yike Ma, and Yongdong Zhang Picture Coding Symposium (PCS),
1 Hierarchical Parallelization of an H.264/AVC Video Encoder A. Rodriguez, A. Gonzalez, and M.P. Malumbres IEEE PARELEC 2006.
Biao Wang 1, Ge Chen 1, Luoyi Fu 1, Li Song 1, Xinbing Wang 1, Xue Liu 2 1 Shanghai Jiao Tong University 2 McGill University
Spanning Trees Alyce Brady CS 510: Computer Algorithms.
Watermarking Scheme Capable of Resisting Sensitivity Attack
3. Polygon Triangulation
Lecture 16 CSE 331 Oct 5, 2011.
Topological Sort In this topic, we will discuss: Motivations
Depth-First Search.
The horizontal number line is called the ______. x-axis
Bipartite Matching and Other Graph Algorithms
Lecture 15 CSE 331 Sep 29, 2014.
Lecture 14 CSE 331 Sep 30, 2016.
Graphing on the Coordinate Plane
Connected Components, Directed Graphs, Topological Sort
Lecture 14 CSE 331 Sep 29, 2017.
Lecture 16 CSE 331 Oct 8, 2012.
Lecture 16 CSE 331 Oct 2, 2013.
Bongsoo Jung, Byeungwoo Jeon
WARM UP 1. Name the alternate interior angles
Graphing on the Coordinate Plane
Day 31 translating functions
Dependent Axis Y Answer Output Range f (x) Function Notation
Source : Digital Signal Processing. Vol. 73, pp , Feb
Presentation transcript:

Data Compression Conference 2013 Chenggang Yan, Yongdong Zhang, Feng Dai and Liang Li 1

Outline  Introduction  Related Work  Proposed Method  Experimental Results  Conclusion 2

Introduction (1/2)  HEVC coding tree unit (CTU) 3

Introduction (2/2)  Local parallel method (LPM) Maximum parallelism of LMP is equal or less than 8. independent Pus (IPUs)  Directed acyclic graph (DAG) 4

Related Work (1/2)  Local parallel method (LPM) [16] Motion estimate region (MER) 5 [16] Minhua Zhou, “AHG10: Configurable and CU-group level parallel merge/skip,” JCTVC-H0082, Feb. 2012

Related Work (2/2)  Local parallel method (LPM) 123 M = 16 or 8 6 8

Proposed Method  A. Data Dependency Analysis  B. DAG for CTUs  C. Highly Parallel Framework 7

Proposed Method.A (1/3)  Independent PUs (IPUs) The IPU’s left boundary and MER’s left boundary do not overlap. The IPU’s upper boundary and MER’s upper boundary do not overlap.  123 8

Proposed Method.A (2/3) 9

Proposed Method.A (3/3)  Neighboring CTUs left upper upper-left upper-right 10

Proposed Method  A. Data Dependency Analysis  B. DAG for CTUs  C. Highly Parallel Framework 11

Proposed Method.B (1/4)  Generate a DAG to capture the dependency relationships of CTUs. 12

Proposed Method.B (2/4)  DAG consists of a set of vertices V and edges E. data dependency an edge. Processed remove

Proposed Method.B (3/4)  Condition matrix (CM) 14

Proposed Method.B (4/4) 15

Proposed Method  A. Data Dependency Analysis  B. DAG for CTUs  C. Highly Parallel Framework 16

Proposed Method.C (1/5) 17

Proposed Method.C (2/5)  Step1 : Initialize DQ and CM. DQ is a waiting queue. CM is designed to record the number of related CTUs for each CTU.  Step2 : When some values in the CM become zero, get the corresponding coordinates and push them into DQ. 18

Proposed Method.C (3/5)  Step3 : Get coordinates from DQ and process corresponding CTUs in parallel on many-core platform.  Step4 : Update CM. When a CTU with coordinate (i, j) in CM is processed, the values of coordinates (i+1, j), (i+1, j-1), (i,j+1) and (i+1,j+1) in CM will minus one operation.  Step5 : Repeat above steps 2~4 until each frame is over. 19

Proposed Method.C (4/5)  Maximum parallelism of CTU 123  Maximum parallelism of highly parallel framework 123  Average parallelism of highly parallel framework

Proposed Method.C (5/5) 21

Experimental Results (1/5) 22

Experimental Results (2/5) 23

Experimental Results (3/5) 24

Experimental Results (4/5) 25

Experimental Results (5/5) 26

Conclusion (1/1)  Highly parallel framework provide sufficient parallelism for many-core platforms.  Use the DAG-based order to parallelize CTUs. 27