video coding on multi-core graphics processors

Click here to load reader

Upload: deanna

Post on 23-Feb-2016

48 views

Category:

Documents


0 download

DESCRIPTION

Video Coding on Multi-core Graphics Processors. Ngai -Man Cheung, Xiaopeng Fan, Oscar C. Au, and Man-Cheung Kung. IEEE SIGNAL PROCESSING MAGAZINE MARCH 2010. Outline. Introduction Motion estimation on GPUs GPU-based ME based on loop unrolling - PowerPoint PPT Presentation

TRANSCRIPT

Video Coding on Multi-core Graphics Processors

Ngai-Man Cheung, Xiaopeng Fan, Oscar C. Au, and Man-Cheung KungVideo Coding on Multi-core Graphics ProcessorsIEEE SIGNAL PROCESSING MAGAZINE MARCH 2010OutlineIntroductionMotion estimation on GPUsGPU-based ME based on loop unrollingGPU-based ME based on rearranging the encoding orderRD-optimized intra-mode decision on GPUsMotion compensation on GPUsGPU-based fast motion estimationFast motion estimationThe GPU implementation using tilingExperimentsMy conclusionIntroduction #1H.264Motion estimation [20][22]Intra prediction [27]Motion compensation [10]

Introduction #2ChallengesData dependencyFlow control instructions can degrade the performanceif, switch, do, for, whileAdvantagesfaster

Motion estimation #1GPU-based ME based on loop unrolling [20]SADMetric matching

Motion estimation #2

Motion estimation #3Two parts

2~14Motion estimation #4GPU-based ME based on rearranging the encoding orderRD-optimized ME

40Intra-mode #1RD-optimized intra-mode decision on GPUs [27]http://www.camdemy.com/media/4371

Intra-mode #2

Intra-mode #3

Intra-mode #4

Motion compensation #1Motion compensation on GPUs [10]

Motion compensation #2

4Fast motion estimation #1Simplified unsymmetrical multi-hexagon search [42]Good tradeoff between computational complexity and coding efficiency.minimizing the Lagrangian cost

Fast motion estimation #2

Fast motion estimation #3The GPU implementation using tilingNeighboring block in another tile is unavailable.

Experiments #1

Experiments #2

Experiments #3

Experiments #4

Experiments #5

ConclusionTradeoff between speedup and performance is importance.There seems to be no study on partitioning the encoding flow between CPUs and GPUs.