72e7e5182a42c959cb

8/13/2019 72e7e5182a42c959cb

1/4

LOSSY TO LOSSLESS IMAGE COMPRESSION BASED ON REVERSIBLE INTEGER DCT

Lei Wang, Jiaji Wu, Licheng Jiao, Li Zhang and Guangming Shi

Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education of China,

Institute of Intelligent Information Processing, Xidian University, Xi'an 710071, P.R. China

This work is supported by the National Natural Science Foundation of China under Grant Nos.60607010, 60672125, 60672126,

60736043, 60776795; the Program for Cheung Kong Scholars and Innovative Research Team in University (PCSIRT, IRT0645); Hunan

Provincial Natural Science Foundation of China under Grant No.08JJ3123.

ABSTRACT

A progressive image compression scheme is investigated

using reversible integer discrete cosine transform (RDCT)

which is derived from the matrix factorization theory.

Previous techniques based on DCT suffer from bad

performance in lossy image compression compared with

wavelet image codec. And lossless compression methods

such as IntDCT, I2I-DCT and so on could not compare

with JPEG-LS or integer discrete wavelet transform (DWT)

based codec. In this paper, lossy to lossless image

compression can be implemented by our proposed scheme

which consists of RDCT, coefficients reorganization, bit

plane encoding, and reversible integer pre- and post-filters.Simulation results show that our method is competitive

against JPEG-LS and JPEG2000 in lossless compression.

Moreover, our method outperforms JPEG2000 (reversible

5/3 filter) for lossy compression, and the performance is

even comparable with JPEG2000 which adopted

irreversible 9/7 floating-point filter (9/7F filter).

Index TermsBlock transform, JPEG2000, lossless

compression, reversible integer DCT

1. INTRODUCTION

Image compression has been becoming increasinglyimportant with the development of aviation,

communications, internet and space techniques; especially

lossless compression becomes indispensable when there is

no loss of information is tolerable such as medical image,

remote sensing, image archiving, and satellite

communications and so on. Compression ratio and bit

distortion always contradict each other, so the techniques

pursuing for higher compression ratio with less distortion

even without information loss has been one of the popular

research issues in image compression. Fortunately a unified

representation for lossy to lossless image compression can

be provided by reversible coding, so both satisfying

recovered images at reasonable compression ratio and fully

reconstructed images can be obtained from a single

compressed file.

DCT has been applied in many international compression

standards such as JPEG, MPEG, and H.26X and so on for

its special advantages including: highly energy-compacting

capability, transforming block by block as a result of

parallel implementation and low memory requirement.

However, blocking DCT is less of considerations on the

correlations of inter-blocks and always results in blocking

artifacts at low bit rate, both of these defects effect the rate

distortion (RD) performance and visual quality of

reconstructed images. Furthermore, the DCT-based

reversible image compression was not well developed.

Hao et al. [1] have obtained very good results aboutlossless image compression using integer-DCT-based

methods; however, the simulation results are still scarce

about lossy compression. In fact, there is still a large

margin to improve the RD performance. As far as we know,

researches about DCT can be classified into two categories:

first, researches on the integer approximation of DCT

matrix for lossless compression; second, researches on

lossy compression. The first class includes: Chen et al.

proposed the low-cost 8-point Integer DCT (IntDCT) which

is based on the Walsh-Hadamard Transform (WHT) and

integer lifting [2]; Abhayaratne proposed N-point I2I-DCT

by applying recursive methods and lifting techniques,

where N is power of 2 [3]. The second class includes:Xiong presented a DCT-based embedded image coder

which was called EZDCT [4]; Tran et al.designed TDLT

(time domain lapped transform) by adding pre- and post-

filter to the DCT [5]. Although some of the above

algorithms obtained good results, but a progressive

reversible image compression scheme based on DCT with

high performance in both lossy and lossless compression

nearly doesnt exist.

In this paper, we developed a progressive reversible

image compression scheme which can realize lossy to

lossless compression. In our scheme, RDCT is used for

transforming combined with reversible integer pre-filter

which will make blocked source pixels more compatible for

block transform and post-filter for reducing blocking

1037978-1-4244-1764-3/08/$25.00 2008 IEEE ICIP 2008

8/13/2019 72e7e5182a42c959cb

2/4

artifacts. RDCT coefficients are reorganized into sub-band

structure, and then coded by context-based block coding.

Simulation experiments have been taken on benchmark

images for both lossy and lossless image compression and

our algorithm outperforms JPEG-LS [6] and JPEG2000

(5/3 filter) [7] in lossless compression, and performs better

than JPEG2000 (5/3 filter) in reversible lossy compression,

comparable with irreversible JPEG2000 (9/7 F filter).The rest of this paper is organized as follows. A brief

review about DCT and the modified matrix factorization

method will be taken in Sec. 2. In Sec. 3, the realization of

RDCT will be introduced. In the following, RDCT-based

entropy coding will be described in Sec. 4. Finally, the last

section will present the experimental results and discussions.

2. DCT AND MODIFIED MATRIX

FACTORIZATION METHOD

2.1 DCT

The type-II DCT and its inverse in one dimension are givenby the following equations:1

0

2( ) ( ) co s ((2 1) )

2

N

C k

n

kX k x n n

N

(1)1

0

2( ) ( ) co s((2 1) )

2

N

k C

n

kx k X n n

N

(2)for k = 0 N-1, where

10,

2

1 .

k

if k

else

( )x n is the input sequence of length N,

cos((2 1) )2

knN

is the discrete cosine transform kernel.

2.2 Modified Matrix Factorization Method

We will have a brief introduction about the modified matrix

factorization method in this section. Hao has proved that a

nonsingular matrix can be factorized into a product of at

most three triangular elementary reversible matrices

(TERMs) [1]. Galli and Salzo modified the method by

proposing a procedure of quasi-complete pivoting which

leads to a better integer approximation to the original float-

point transform matrix [8].

Suppose a nonsingular matrix

N NA R without loss of

generality with determinant of module 1, the decomposing

formula can be defined as

A PLUS , (3)where L and S are lower triangular matrices, U is upper

matrix and Pis the permutation matrix.

Table I: TERMs Factorized from 4-point DCT Matrix

P L

0 1 0 0 1

1 0 0 0 0.2346 1

0 0 0 1 0.4142 -0.7654 1

0 0 1 0 0.2346 0 -0.6934 1

U S1 -0.2929 -0.0137 -0.6533 1

1 0.3066 0.6533 0 1

1 0.5000 0 0 1

1 0.5307 -0.8626 0.3933 1

3 REVERSIBLE INTEGER HIERARCHICAL DCT

3.1 Reversible Integer Discrete Cosine Transform

From the formula (1), the four-point discrete cosine

transform matrix can be easily got as follows.

0.5000 0.5000 0.5000 0.5000

0.6533 0.2706 -0.2706 -0.6533

0.5000 -0.5000 -0.5000 0.5000

0.2706 -0.6533 0.6533 -0.2706

A

As we can see, the elements of the matrix are values

between -1 and 1, and this will result in float-point

transform coefficients. By calculating we can proved that

the determinant of DCT kernel matrix is equal to 1, so the

matrix factorization theory can be applied to it. Table I

tabulates the factorization result.

Now we take upper TERM Uto illustrate how to realize

reversible integer to integer transform. Suppose ,m nuU ,

then Y = UX can be realized as follows:

, ,1

,

1, 2, , 1.

N

N

m m m m m n nn m

N N N

y u x u x

y u x

where m N

(4)

And its inverse transform is:

,

,1,

/

1

1, 2, ,1.

N N N N

N

m m m n nn mm m

x y u

x y u xu

where m N N

(5)

Where

denotes rounding to the nearest integer.

Obviously, lower TERM can realize reversible integer to

integer transform in the same way and arbitrary point

integer DCT could be realized using the property of TERM.

1038

8/13/2019 72e7e5182a42c959cb

3/4

3.2 Hierarchical RDCT with integer pre- and post-

filters

In our scheme, pre-filter is added to the input signals before

RDCT acting as a flattening operator to make pixels in one

block more homogeneous to improve the efficiency of

compacting energy; while post-filter is added to the

reconstructed signals from inverse RDCT with the function

of de-blocking [5]. The same matrix factorization method is

used to the pre- and post-filter to realize reversible integer

to integer transform. The general formula of pre-filter [5]

can be defined as:

1

2

I J I 0 I JF

J -I 0 V J -I, (6)

where Iand J are identity matrix and reversal identity

matrix respectively. Vis free control matrix.

/ 2 / 2( )II T IV

M S MV J C D C J , (7)

where/ 2

II

MC and

/ 2

IV

MC stand for / 2M point type-II and

type-IV DCT matrix respectively; ,1, ,1S diag sD is a

diagonal matrix where s is a scaling factor. It should be

noticed that, the determinant of the filter matrix Fdoes not

equal to 1 and should be modified to satisfy det 1F

before factorizing.

In order to combine RDCT with wavelet-based codec,

the transform coefficients should be reorganized into tree

structure before coding [4]. With the transformed

coefficients of sub-band structure in hand, a simple way to

improve the efficiency of transform is to apply another

RDCT in the DC sub-band for further de-correlation, and

this can be defined as hierarchical RDCT. Simulation

results show that two-level RDCT could improve the PSNR

by 0.2~0.5 dB compared with one-level RDCT at the same

rate of the same image.

4. RDCT-BASED LOSSY AND LOSSLESS

ENTROPY ENCODING

Once the DCT coefficients have been organized into tree-

structure, wavelet-based codec can be used here.

An improved SPECK algorithm is designed to encode

transform coefficients. The original SPECK [9] has not

adopted high order arithmetic coding. Our algorithm

improves the SPECK by adopting context model likesJPEG2000 on its arithmetic coding for further reducing

correlations of transform coefficients. In our algorithm,

refinement pass coding adopts 3 contexts, sign coding

adopts 5 contexts, and significant coefficients and blocks

coding adopt 20 contexts.

5. RESULTS AND DISCUSSIONS

We perform experiments on still images using DWT and

RDCT combined with several progressive codec, such as

Said and Pearlmans SPIHT [10] and our coding algorithm.

The DWT in SPIHT codec is replaced by RDCT combined

with integer pre- and post-filters. Also, JPEG2000 which

adopts single layer coding stream has been included in ourexperiments for comparison. The lossy and lossless

compression performances are evaluated by PSNR (peak

signal to noise ratio) and bpp (bits per pixel) respectively.

In our simulation experiments, testing images include Lena,

Barbara, Goldhill, Baboon and Finger, all of which are

gray-scale (8bpp) images with size of 512512.

Table II: Lossy compression, PSNR comparison (in dB)

Reversible Irreversible

RDCT (8 point) RDCT (16 point)bpp

JPEG2000

(5/3) SPIHT OURS SPIHT OURS

JPEG2000

(5/3)

JPEG2000

(9/7F)

Lena 512512,8bpp

1 39.31 39.20 39.48 39.22 39.51 39.86 40.35

0.5 36.32 36.55 36.82 36.73 37.00 36.61 37.28

0.25 33.26 33.52 33.81 33.77 34.08 33.44 34.14

Barbara 512512,8bpp

1 35.81 37.05 37.44 37.53 37.95 36.11 37.17

0.5 30.86 32.55 33.03 33.47 33.87 31.03 32.29

0.25 27.36 28.75 29.20 29.85 30.26 27.41 28.39

Goldhill 512512,8bpp

1 35.88 35.98 36.33 35.99 36.39 36.21 36.59

0.5 32.74 32.93 33.21 32.95 33.26 32.91 33.25

0.25 30.09 30.44 30.69 30.46 30.79 30.31 30.54

Baboon 512512,8bpp

1 28.61 28.93 29.27 29.07 29.40 28.63 29.11

0.5 25.06 25.52 25.76 25.66 25.93 25.20 25.59

0.25 22.81 23.21 23.45 23.30 23.52 22.88 23.15

Finger 512512,8bpp

1 30.54 31.67 32.13 32.19 32.66 30.66 31.64

0.5 26.86 27.84 28.20 28.18 28.58 27.09 27.86

0.25 23.68 24.39 24.76 24.80 25.12 23.82 24.37

1039

8/13/2019 72e7e5182a42c959cb

4/4

5.1 Evaluation of lossy compression

Table II summarizes the PSNR of benchmark images with

different methods. Both 88 and 1616 RDCT have been

performed in our experiments. Reversible and irreversible

JPEG2000 have been taken for comparison. Images at

different bit rate can be recovered from a single codestream

and one can see that the results of our method are better

than JPEG2000 for most images at most bit rates. The

PSNR of our method based on RDCT on Barbara are about

2.14~3.01 dB and 0.78~1.86 dB higher than the

performances of JPEG2000 using reversible 5/3 filter and

irreversible 9/7F filter respectively. And 16 point DCTperforms about 0.1~0.4 dB better than 8 point DCT with a

little cost of the complexity increment.

Portions of Goldhill and Barbara, reconstructed from

JPEG2000 and our method based on 16-point RDCT at

0.25 bpp are illustrated in Fig.1. Block artifacts have been

reduced due to the pre- and post-filters adopted in our

method and the detailed textures are preserved better than

JPEG2000.

5.2 Evaluation of lossless compression

Table III illustrates the lossless compression results of our

method, JPEG2000, and JPEG-LS on testing images. It can

be seen that our method based on RDCT outperforms

JPEG2000 (5/3 filter) and JPEG-LS for most images.

6. CONCLUSIONS

In this paper, we present a progressive lossy to lossless

image compression scheme based on hierarchical RDCT.Simulation results show that the new scheme performs well

in both reversible lossy and lossless compression. Besides,

block transform can be implemented parallel as a result of

fast computing compatibility compared with DWT.

7. REFERENCES

[1] P. Hao and Q. Shi, Matrix factorizations for reversible

integer mapping, IEEE Trans. Signal Processing, vol.49,

pp.2314-2324, Oct. 2001.

[2] Y. Chen, S. Oraintara, and T. Nguyen, Integer discrete

cosine transform (IntDCT), inProc. 2nd Int. Conf. Inform.,

Commun. Signa. Process, Dec. 1999.

[3] G.C.K. Abhayaratne, Reversible integer-to-integermapping of N-point orthonormal block transforms. Signal

Processing,v.87 n.5 pp. 950969,2007

[4] Z. Xiong, O. Guleryuz, and M. T. Orchard, A DCT-based

embedded image coder, IEEE Signal Processing Lett., vol.

3, pp. 289290, Nov. 1996.

[5] T. D. Tran, J. Liang, and C. Tu, Lapped transform via time-

domain pre- and post-processing, IEEE Trans. Signal

Process, vol. 51, no. 6, pp. 15571571, Jun. 2003.

[6] ISO / IEC JTC1 SC29 WG1 (JPEG / JBIG), FCD 14495,

Lossless and near-lossless coding of continuous tone still

images (JPEG-LS).

[7] ISO/IEC JTC1/SC 29/WG 1 (ITU-T SG8), The JPEG 2000

Still Image Compression Standard.

[8] L. Galli and S. Salzo, Lossless hyperspectral compression

using KLT,IEEE IGARSS, vol.1, pp.313-316, Sept. 2004.

[9] Asad Islam, William A. Pearlman, Embedded and efficient

low-complexity hierarchical image coder, inProc. Visual

Communications and Image Processing'99, San Jose, CA,

USA, vol.3653, pp. 294-305, 1998.

[10] A. Said, W. A. Pearlman, A new, fast and efficient image

codec based on set partitioning in hierarchical trees, IEEE

Trans. Circuits and Systems for Video Technology, vol.6,

no.3, pp. 243-250, 1996.

Table III: Lossless Performance (in bpp)

OURSJPEG2000

(5/3)JPEG-LS

8 point 16 point

Lena 4.316 4.243 4.341 4.317

Barbara 4.786 4.863 4.689 4.558

Goldhill 4.837 4.712 4.854 4.816

Baboon 6.111 6.038 5.968 5.919

Finger 5.665 5.663 5.535 5.415

(a) Goldhill, 30.79dB (b) Barbara, 30.26 dB

(c) Goldhill, 30.09 dB (d) Barbara, 27.36 dB

(e) Goldhill, 30.54 dB (f) Barbara, 28.39 dB

Fig.1. Portion of Reconstructed Images at 0.25 bpp, our method (top)

versus JPEG2000 5/3 filter (middle) based on reversible transform and

9/7F filter (bottom).

1040

72e7e5182a42c959cb

Documents