massively ldpc decoding on multicore architectures present by : fakewen

Massively LDPC Decoding onMulticore Architectures

Present by : fakewen

Authors

• Gabriel Falcao• Leonel Sousa• Vitor Silva

Outline

• Introduction• BELIEF PROPAGATION• DATA STRUCTURES AND PARALLEL

COMPUTING MODELS• PARALLELIZING THE KERNELS EXECUTION• EXPERIMENTAL RESULTS

Outline

Introduction

• LDPC decoding on multicore architectures• LDPC decoders were developed on recent

multicores, such as off-the-shelf general-purpose x86 processors, Graphics Processing Units (GPUs), and the CELL Broadband Engine (CELL/B.E.).

Outline

BELIEF PROPAGATION

• Belief propagation, also known as the SPA, is an iterative algorithm for the computation of joint probabilities

LDPC Decoding

• exploit probabilistic relationships between nodes imposed by parity-check conditions that allow inferring the most likely transmitted codeword.

LDPC Decoding(cont.)

White Gaussian noise

LDPC Decoding(cont.)

Outline

DATA STRUCTURES AND PARALLEL COMPUTING MODELS

• compact data structures to represent the H matrix

Data Structures

• separately code the information about H in two independent data streams, and

remind

• rmn :是 CNm->BNn

• qnm :是 BNn->CNm

Parallel Computational Models

• Parallel Features of the General-Purpose Multicores

• Parallel Features of the GPU• Parallel Features of the CELL/B.E.

Parallel Features of the General-Purpose Multicores

• #pragma omp parallel for

Parallel Features of the GPU

Throughput

Parallel Features of the CELL/B.E.

Throughput

Outline

PARALLELIZING THE KERNELS EXECUTION

• The Multicores Using OpenMP• The GPU Using CUDA• The CELL/B.E.

The Multicores Using OpenMP

The GPU Using CUDA

• Programming the Grid Using a Thread per Node Approach

The GPU Using CUDA(cont.)

• Coalesced Memory Accesses

The CELL/B.E.

• Small Single-SPE Model(A B C)• Large Single-SPE Model

Why Single-SPE Model

• In the single-SPE model, the number of communications between PPE and SPEs is minimum and the PPE is relieved from the costly task of reorganizing data (sorting procedure in Algorithm 4) between data transfers to the SPE.

Outline

EXPERIMENTAL RESULTS

• LDPC Decoding on the General-Purpose x86 Multicores Using OpenMP

• LDPC Decoding on the CELL/B.E.– Small Single-SPE Model– Large Single-SPE Model

• LDPC Decoding on the GPU Using CUDA

LDPC Decoding on the General-Purpose x86 Multicores Using OpenMP

LDPC Decoding on the CELL/B.E.

LDPC Decoding on the CELL/B.E.(cont.)

LDPC Decoding on the GPU Using CUDA

The end

Thank you~

massively ldpc decoding on multicore architectures present by : fakewen

gpu slide

openmp slide

throughput slide

fakewen slide

complexity slide

memory access slide

bnncnm slide

h matrix slide

Documents

analysis of ldpc convolutional codes derived from ldpc block...

chương 3 ldpc

1 implementation of decoders for ldpc block codes and...

5g ldpc-v intel® fpga ip user guide · the 5g ldpc-v ip is...

presentasi ldpc

002.ldpc final 1

ldpc tutorial

massively ldpc decoding on multicore architectures

ldpc based patentus7747934

1gb 220mw ldpc decoder

simple ldpc-staircase fec scheme for fecframe ...

massively parallel assumption-based truth...

robert gallager ldpc 1963

algebra ldpc codes and

ldpc encoding

wyner-ziv coding of motion video presented by fakewen

ldpc codes: achieving the capacity of the binary erasure...

massively parallel ldpc decoding on gpu vivek tulsidas bhat...

ldpc ieee paper

one-bit ldpc message passing decoding based on...