72e7e5182a42c959cb

Upload: jin11004

Post on 04-Jun-2018

220 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/13/2019 72e7e5182a42c959cb

    1/4

    LOSSY TO LOSSLESS IMAGE COMPRESSION BASED ON REVERSIBLE INTEGER DCT

    Lei Wang, Jiaji Wu, Licheng Jiao, Li Zhang and Guangming Shi

    Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education of China,

    Institute of Intelligent Information Processing, Xidian University, Xi'an 710071, P.R. China

    This work is supported by the National Natural Science Foundation of China under Grant Nos.60607010, 60672125, 60672126,

    60736043, 60776795; the Program for Cheung Kong Scholars and Innovative Research Team in University (PCSIRT, IRT0645); Hunan

    Provincial Natural Science Foundation of China under Grant No.08JJ3123.

    ABSTRACT

    A progressive image compression scheme is investigated

    using reversible integer discrete cosine transform (RDCT)

    which is derived from the matrix factorization theory.

    Previous techniques based on DCT suffer from bad

    performance in lossy image compression compared with

    wavelet image codec. And lossless compression methods

    such as IntDCT, I2I-DCT and so on could not compare

    with JPEG-LS or integer discrete wavelet transform (DWT)

    based codec. In this paper, lossy to lossless image

    compression can be implemented by our proposed scheme

    which consists of RDCT, coefficients reorganization, bit

    plane encoding, and reversible integer pre- and post-filters.Simulation results show that our method is competitive

    against JPEG-LS and JPEG2000 in lossless compression.

    Moreover, our method outperforms JPEG2000 (reversible

    5/3 filter) for lossy compression, and the performance is

    even comparable with JPEG2000 which adopted

    irreversible 9/7 floating-point filter (9/7F filter).

    Index TermsBlock transform, JPEG2000, lossless

    compression, reversible integer DCT

    1. INTRODUCTION

    Image compression has been becoming increasinglyimportant with the development of aviation,

    communications, internet and space techniques; especially

    lossless compression becomes indispensable when there is

    no loss of information is tolerable such as medical image,

    remote sensing, image archiving, and satellite

    communications and so on. Compression ratio and bit

    distortion always contradict each other, so the techniques

    pursuing for higher compression ratio with less distortion

    even without information loss has been one of the popular

    research issues in image compression. Fortunately a unified

    representation for lossy to lossless image compression can

    be provided by reversible coding, so both satisfying

    recovered images at reasonable compression ratio and fully

    reconstructed images can be obtained from a single

    compressed file.

    DCT has been applied in many international compression

    standards such as JPEG, MPEG, and H.26X and so on for

    its special advantages including: highly energy-compacting

    capability, transforming block by block as a result of

    parallel implementation and low memory requirement.

    However, blocking DCT is less of considerations on the

    correlations of inter-blocks and always results in blocking

    artifacts at low bit rate, both of these defects effect the rate

    distortion (RD) performance and visual quality of

    reconstructed images. Furthermore, the DCT-based

    reversible image compression was not well developed.

    Hao et al. [1] have obtained very good results aboutlossless image compression using integer-DCT-based

    methods; however, the simulation results are still scarce

    about lossy compression. In fact, there is still a large

    margin to improve the RD performance. As far as we know,

    researches about DCT can be classified into two categories:

    first, researches on the integer approximation of DCT

    matrix for lossless compression; second, researches on

    lossy compression. The first class includes: Chen et al.

    proposed the low-cost 8-point Integer DCT (IntDCT) which

    is based on the Walsh-Hadamard Transform (WHT) and

    integer lifting [2]; Abhayaratne proposed N-point I2I-DCT

    by applying recursive methods and lifting techniques,

    where N is power of 2 [3]. The second class includes:Xiong presented a DCT-based embedded image coder

    which was called EZDCT [4]; Tran et al.designed TDLT

    (time domain lapped transform) by adding pre- and post-

    filter to the DCT [5]. Although some of the above

    algorithms obtained good results, but a progressive

    reversible image compression scheme based on DCT with

    high performance in both lossy and lossless compression

    nearly doesnt exist.

    In this paper, we developed a progressive reversible

    image compression scheme which can realize lossy to

    lossless compression. In our scheme, RDCT is used for

    transforming combined with reversible integer pre-filter

    which will make blocked source pixels more compatible for

    block transform and post-filter for reducing blocking

    1037978-1-4244-1764-3/08/$25.00 2008 IEEE ICIP 2008

  • 8/13/2019 72e7e5182a42c959cb

    2/4

    artifacts. RDCT coefficients are reorganized into sub-band

    structure, and then coded by context-based block coding.

    Simulation experiments have been taken on benchmark

    images for both lossy and lossless image compression and

    our algorithm outperforms JPEG-LS [6] and JPEG2000

    (5/3 filter) [7] in lossless compression, and performs better

    than JPEG2000 (5/3 filter) in reversible lossy compression,

    comparable with irreversible JPEG2000 (9/7 F filter).The rest of this paper is organized as follows. A brief

    review about DCT and the modified matrix factorization

    method will be taken in Sec. 2. In Sec. 3, the realization of

    RDCT will be introduced. In the following, RDCT-based

    entropy coding will be described in Sec. 4. Finally, the last

    section will present the experimental results and discussions.

    2. DCT AND MODIFIED MATRIX

    FACTORIZATION METHOD

    2.1 DCT

    The type-II DCT and its inverse in one dimension are givenby the following equations:1

    0

    2( ) ( ) co s ((2 1) )

    2

    N

    C k

    n

    kX k x n n

    N

    (1)1

    0

    2( ) ( ) co s((2 1) )

    2

    N

    k C

    n

    kx k X n n

    N

    (2)for k = 0 N-1, where

    10,

    2

    1 .

    k

    if k

    else

    ( )x n is the input sequence of length N,

    cos((2 1) )2

    knN

    is the discrete cosine transform kernel.

    2.2 Modified Matrix Factorization Method

    We will have a brief introduction about the modified matrix

    factorization method in this section. Hao has proved that a

    nonsingular matrix can be factorized into a product of at

    most three triangular elementary reversible matrices

    (TERMs) [1]. Galli and Salzo modified the method by

    proposing a procedure of quasi-complete pivoting which

    leads to a better integer approximation to the original float-

    point transform matrix [8].

    Suppose a nonsingular matrix

    N NA R without loss of

    generality with determinant of module 1, the decomposing

    formula can be defined as

    A PLUS , (3)where L and S are lower triangular matrices, U is upper

    matrix and Pis the permutation matrix.

    Table I: TERMs Factorized from 4-point DCT Matrix

    P L

    0 1 0 0 1

    1 0 0 0 0.2346 1

    0 0 0 1 0.4142 -0.7654 1

    0 0 1 0 0.2346 0 -0.6934 1

    U S1 -0.2929 -0.0137 -0.6533 1

    1 0.3066 0.6533 0 1

    1 0.5000 0 0 1

    1 0.5307 -0.8626 0.3933 1

    3 REVERSIBLE INTEGER HIERARCHICAL DCT

    3.1 Reversible Integer Discrete Cosine Transform

    From the formula (1), the four-point discrete cosine

    transform matrix can be easily got as follows.

    0.5000 0.5000 0.5000 0.5000

    0.6533 0.2706 -0.2706 -0.6533

    0.5000 -0.5000 -0.5000 0.5000

    0.2706 -0.6533 0.6533 -0.2706

    A

    As we can see, the elements of the matrix are values

    between -1 and 1, and this will result in float-point

    transform coefficients. By calculating we can proved that

    the determinant of DCT kernel matrix is equal to 1, so the

    matrix factorization theory can be applied to it. Table I

    tabulates the factorization result.

    Now we take upper TERM Uto illustrate how to realize

    reversible integer to integer transform. Suppose ,m nuU ,

    then Y = UX can be realized as follows:

    , ,1

    ,

    1, 2, , 1.

    N

    N

    m m m m m n nn m

    N N N

    y u x u x

    y u x

    where m N

    (4)

    And its inverse transform is:

    ,

    ,1,

    /

    1

    1, 2, ,1.

    N N N N

    N

    m m m n nn mm m

    x y u

    x y u xu

    where m N N

    (5)

    Where

    denotes rounding to the nearest integer.

    Obviously, lower TERM can realize reversible integer to

    integer transform in the same way and arbitrary point

    integer DCT could be realized using the property of TERM.

    1038

  • 8/13/2019 72e7e5182a42c959cb

    3/4

    3.2 Hierarchical RDCT with integer pre- and post-

    filters

    In our scheme, pre-filter is added to the input signals before

    RDCT acting as a flattening operator to make pixels in one

    block more homogeneous to improve the efficiency of

    compacting energy; while post-filter is added to the

    reconstructed signals from inverse RDCT with the function

    of de-blocking [5]. The same matrix factorization method is

    used to the pre- and post-filter to realize reversible integer

    to integer transform. The general formula of pre-filter [5]

    can be defined as:

    1

    2

    I J I 0 I JF

    J -I 0 V J -I, (6)

    where Iand J are identity matrix and reversal identity

    matrix respectively. Vis free control matrix.

    / 2 / 2( )II T IV

    M S MV J C D C J , (7)

    where/ 2

    II

    MC and

    / 2

    IV

    MC stand for / 2M point type-II and

    type-IV DCT matrix respectively; ,1, ,1S diag sD is a

    diagonal matrix where s is a scaling factor. It should be

    noticed that, the determinant of the filter matrix Fdoes not

    equal to 1 and should be modified to satisfy det 1F

    before factorizing.

    In order to combine RDCT with wavelet-based codec,

    the transform coefficients should be reorganized into tree

    structure before coding [4]. With the transformed

    coefficients of sub-band structure in hand, a simple way to

    improve the efficiency of transform is to apply another

    RDCT in the DC sub-band for further de-correlation, and

    this can be defined as hierarchical RDCT. Simulation

    results show that two-level RDCT could improve the PSNR

    by 0.2~0.5 dB compared with one-level RDCT at the same

    rate of the same image.

    4. RDCT-BASED LOSSY AND LOSSLESS

    ENTROPY ENCODING

    Once the DCT coefficients have been organized into tree-

    structure, wavelet-based codec can be used here.

    An improved SPECK algorithm is designed to encode

    transform coefficients. The original SPECK [9] has not

    adopted high order arithmetic coding. Our algorithm

    improves the SPECK by adopting context model likesJPEG2000 on its arithmetic coding for further reducing

    correlations of transform coefficients. In our algorithm,

    refinement pass coding adopts 3 contexts, sign coding

    adopts 5 contexts, and significant coefficients and blocks

    coding adopt 20 contexts.

    5. RESULTS AND DISCUSSIONS

    We perform experiments on still images using DWT and

    RDCT combined with several progressive codec, such as

    Said and Pearlmans SPIHT [10] and our coding algorithm.

    The DWT in SPIHT codec is replaced by RDCT combined

    with integer pre- and post-filters. Also, JPEG2000 which

    adopts single layer coding stream has been included in ourexperiments for comparison. The lossy and lossless

    compression performances are evaluated by PSNR (peak

    signal to noise ratio) and bpp (bits per pixel) respectively.

    In our simulation experiments, testing images include Lena,

    Barbara, Goldhill, Baboon and Finger, all of which are

    gray-scale (8bpp) images with size of 512512.

    Table II: Lossy compression, PSNR comparison (in dB)

    Reversible Irreversible

    RDCT (8 point) RDCT (16 point)bpp

    JPEG2000

    (5/3) SPIHT OURS SPIHT OURS

    JPEG2000

    (5/3)

    JPEG2000

    (9/7F)

    Lena 512512,8bpp

    1 39.31 39.20 39.48 39.22 39.51 39.86 40.35

    0.5 36.32 36.55 36.82 36.73 37.00 36.61 37.28

    0.25 33.26 33.52 33.81 33.77 34.08 33.44 34.14

    Barbara 512512,8bpp

    1 35.81 37.05 37.44 37.53 37.95 36.11 37.17

    0.5 30.86 32.55 33.03 33.47 33.87 31.03 32.29

    0.25 27.36 28.75 29.20 29.85 30.26 27.41 28.39

    Goldhill 512512,8bpp

    1 35.88 35.98 36.33 35.99 36.39 36.21 36.59

    0.5 32.74 32.93 33.21 32.95 33.26 32.91 33.25

    0.25 30.09 30.44 30.69 30.46 30.79 30.31 30.54

    Baboon 512512,8bpp

    1 28.61 28.93 29.27 29.07 29.40 28.63 29.11

    0.5 25.06 25.52 25.76 25.66 25.93 25.20 25.59

    0.25 22.81 23.21 23.45 23.30 23.52 22.88 23.15

    Finger 512512,8bpp

    1 30.54 31.67 32.13 32.19 32.66 30.66 31.64

    0.5 26.86 27.84 28.20 28.18 28.58 27.09 27.86

    0.25 23.68 24.39 24.76 24.80 25.12 23.82 24.37

    1039

  • 8/13/2019 72e7e5182a42c959cb

    4/4

    5.1 Evaluation of lossy compression

    Table II summarizes the PSNR of benchmark images with

    different methods. Both 88 and 1616 RDCT have been

    performed in our experiments. Reversible and irreversible

    JPEG2000 have been taken for comparison. Images at

    different bit rate can be recovered from a single codestream

    and one can see that the results of our method are better

    than JPEG2000 for most images at most bit rates. The

    PSNR of our method based on RDCT on Barbara are about

    2.14~3.01 dB and 0.78~1.86 dB higher than the

    performances of JPEG2000 using reversible 5/3 filter and

    irreversible 9/7F filter respectively. And 16 point DCTperforms about 0.1~0.4 dB better than 8 point DCT with a

    little cost of the complexity increment.

    Portions of Goldhill and Barbara, reconstructed from

    JPEG2000 and our method based on 16-point RDCT at

    0.25 bpp are illustrated in Fig.1. Block artifacts have been

    reduced due to the pre- and post-filters adopted in our

    method and the detailed textures are preserved better than

    JPEG2000.

    5.2 Evaluation of lossless compression

    Table III illustrates the lossless compression results of our

    method, JPEG2000, and JPEG-LS on testing images. It can

    be seen that our method based on RDCT outperforms

    JPEG2000 (5/3 filter) and JPEG-LS for most images.

    6. CONCLUSIONS

    In this paper, we present a progressive lossy to lossless

    image compression scheme based on hierarchical RDCT.Simulation results show that the new scheme performs well

    in both reversible lossy and lossless compression. Besides,

    block transform can be implemented parallel as a result of

    fast computing compatibility compared with DWT.

    7. REFERENCES

    [1] P. Hao and Q. Shi, Matrix factorizations for reversible

    integer mapping, IEEE Trans. Signal Processing, vol.49,

    pp.2314-2324, Oct. 2001.

    [2] Y. Chen, S. Oraintara, and T. Nguyen, Integer discrete

    cosine transform (IntDCT), inProc. 2nd Int. Conf. Inform.,

    Commun. Signa. Process, Dec. 1999.

    [3] G.C.K. Abhayaratne, Reversible integer-to-integermapping of N-point orthonormal block transforms. Signal

    Processing,v.87 n.5 pp. 950969,2007

    [4] Z. Xiong, O. Guleryuz, and M. T. Orchard, A DCT-based

    embedded image coder, IEEE Signal Processing Lett., vol.

    3, pp. 289290, Nov. 1996.

    [5] T. D. Tran, J. Liang, and C. Tu, Lapped transform via time-

    domain pre- and post-processing, IEEE Trans. Signal

    Process, vol. 51, no. 6, pp. 15571571, Jun. 2003.

    [6] ISO / IEC JTC1 SC29 WG1 (JPEG / JBIG), FCD 14495,

    Lossless and near-lossless coding of continuous tone still

    images (JPEG-LS).

    [7] ISO/IEC JTC1/SC 29/WG 1 (ITU-T SG8), The JPEG 2000

    Still Image Compression Standard.

    [8] L. Galli and S. Salzo, Lossless hyperspectral compression

    using KLT,IEEE IGARSS, vol.1, pp.313-316, Sept. 2004.

    [9] Asad Islam, William A. Pearlman, Embedded and efficient

    low-complexity hierarchical image coder, inProc. Visual

    Communications and Image Processing'99, San Jose, CA,

    USA, vol.3653, pp. 294-305, 1998.

    [10] A. Said, W. A. Pearlman, A new, fast and efficient image

    codec based on set partitioning in hierarchical trees, IEEE

    Trans. Circuits and Systems for Video Technology, vol.6,

    no.3, pp. 243-250, 1996.

    Table III: Lossless Performance (in bpp)

    OURSJPEG2000

    (5/3)JPEG-LS

    8 point 16 point

    Lena 4.316 4.243 4.341 4.317

    Barbara 4.786 4.863 4.689 4.558

    Goldhill 4.837 4.712 4.854 4.816

    Baboon 6.111 6.038 5.968 5.919

    Finger 5.665 5.663 5.535 5.415

    (a) Goldhill, 30.79dB (b) Barbara, 30.26 dB

    (c) Goldhill, 30.09 dB (d) Barbara, 27.36 dB

    (e) Goldhill, 30.54 dB (f) Barbara, 28.39 dB

    Fig.1. Portion of Reconstructed Images at 0.25 bpp, our method (top)

    versus JPEG2000 5/3 filter (middle) based on reversible transform and

    9/7F filter (bottom).

    1040