[rakutentechconf2013] [c4-1] text detection in product images

Text detection in product images

10/26/2013

Naoki Chiba, Lead Scientist

Rakuten Institute of TechnologyRakuten Inc.http://rit.rakuten.co.jp/

Product images

Sales pitches in images

Applications:• Content retrieval/filtering• Recognition• Translation

RIT Text Detector

Far more accurate Works like magic

Outline

１ Text detection overview

2 Current methods

3 RIT’s approach

Outline

2 Current methods

3 RIT’s approach

Academic Research

Natural scene OCR ≠ traditional scanned OCRCamera capturedIllumination variationsPerspective distortionShort text

Source: ICDAR Text locating competition

Digital-born text Natural-scene text

Product Images - Two Purposes

1. Sales pitches

2. Product list

Text’s role is different

Product list

Sales pitch (Merchant’s names, Price, Shipping)

“Now Printing” images

Showing image unavailability, but..

NotUpdated

Text detection for product images

More accurate

Much Faster

Outline

2 Current methods

3 RIT’s approach

Current methods

1. Texture based (Classifier-based)2. Region based (Connected components)3. Hybrids

1. Texture-based method

Special texture ScanClassifier (SVM, AdaBoost or Neural network)

Problems:

• Scale/Rotation variant

• High computation

2. Region-based method

Local features (edges or color clustering)

Connected component analysisText lines and word separation

Problem:

• False candidates

Output of Stroke width transform

3. Hybrid method

Region based Edge (Stroke Width Transform) Color clustering

Classifier SVM Random Forrest

AdaBoost

Problems

1. Character/word annotationTime-consuming task

2. Transparent textHard to detect

Problem 1: Character/word annotation

Time consuming for many images

Problem 2: Transparent text

?• Weak edges (difficult to detect)

Outline

2 Current methods

3 RIT’s approach

RIT’s Approach

Text image classifier using image-wise annotation

Transparent text detection and background recovery

1. Text image classifier using image-wise annotation

• Text image detection (not char/word)– Image-wise annotation (less time)– Clustering detected regions

(measure text likeliness)

Image-wise Annotation

Draw rectangles

送料無料

Image-wiseClassify text/non-text

text non-text

Character-wise

Clustering detected regions

Region in text imagesRegion in non-text images

x Cluster center

P(C1) = 3/4

P(C4) = 0/3

Comparison

• Rakuten 500 images• Compared w/a traditional region-based method

Current Proposed0.0%

Accuracy

Better than a typical method

RIT’s Approach

Text image classifier using image-wise annotation

Transparent text detection and background recovery

2. Transparent text detection and background recovery

• Edge Detection with adaptive threshold– Image content analysis

• Background recovery– Text color/opacity estimation

Edge detection with adaptive thresholds

Less noise

Weak edges are better preserved

Texture strength

Measuring image complexity

Direction and energy: eigenvectors and eigenvalues[1]

Image patches:

Texture strength:

[1] Xiang Zhu and Peyman Milanfar, “Automatic parameter selection for denoising algorithms using a no-reference measure of image content,” IEEE transactions on image processing, pp. 3116–32, 2010.

Proposed text detection

1. Texture based (Classifier based)

SVM/Random Forest/AdaBoost2. Region based (Connected components)

Edge/Color Clustering3. Hybrids

Region (Edge Stroke Width) + Texture (AdaBoost)

System flow

Components Analysis

Detected text

Stroke width transform and Connected componentInput image Adaptive Edge

detection

Detection result

(a) constant threshold (b) proposed

System flow

Components Analysis

Detected text

Stroke width transform and Connected componentInput image

Backgroundrecovery

Adaptive Edge detection

Transparent Text

T I: observed pixel value

O: original pixel value

• 2 >= equations• Least squares solution• 2 unknown

text coloropacity

Extraction result

(b) recovered(a) original

Comparison with InPainting

Original

InPainting Rakuten

Patented!

Thank you!

Details: ACPR 2013

[rakutentechconf2013] [c4-1] text detection in product images

transparent

characterword

text detection

imagewise

rits approach

product images

connected

region based

Technology

[rakutentechconf2013] [b-3_1] challenge of rakuten ichiba

laser radar detection statistics: a comparison of coherent...

new citroËn c4 picasso & grand c4 picasso · citroËn c4...

[rakutentechconf2013] [b-3_3] rakuten category

[rakutentechconf2013] [b-4] passionate business...

[rakutentechconf2013] [e-4] fusion forensics - a critical...

[rakutentechconf2013] [d-3_2] counting big databy streaming...

[rakutentechconf2013] [a-4] the approach of event in japan...

moriseiki sl/zl150-154mc/smc/y/sy · odclampingunit c4...

1 a b c d - nice · c4 nero c4 black c4 noir c4 negro c4...

19080703 %c4%b1%c4%b1

[rakutentechconf2013] [c-2_2] developing apps for smart tvs

[rakutentechconf2013] [lt] giving life to your ideas to...

haas st-10/15/20/25/30/35(vditurret) ·...

moriseiki duraturn2030,2050,2550 · duraturn2030,2050,2550...

[rakutentechconf2013] [b-0] ux analytics - measure your roi!

new citroËn c4 picasso -...

[rakutentechconf2013] [b-3_2] dwh/hadoop in rakuten ichiba

rakutentechconf2013] [d-3_1] leofs - open the new door

citroËn c4 spacetourer grand c4 spacetourer