e-books presentation. hard copy (book) scanning ocr text document html conversion text formatting...

11
E-Books Presentation

Upload: scott-stone

Post on 03-Jan-2016

225 views

Category:

Documents


1 download

TRANSCRIPT

E-Books

Presentation

Hard Copy (Book)

Scanning

OCR

Text Document

HTML Conversion

Text Formatting

Linking

Image Insertion

Final QC

Soft Copy (JPG/TIFF)

Image Editing Using Photoshop

Process Flow

Source

Source supplied by client in two format.

1. Book 2. Jpg/Tiff image

(Hard copy) (Soft copy)

Source

Scanning

1. Hard copy is scanned at 600 dpi.

Image Editing

1. Text and Image area are separated.

2. Images are edited separately to improve quality and to reduce file size.

OCR

1. Only Text area is recognized, and converted to text document in OCR conversion software.

2. Spell checking is also done simultaneously.

HTML Conversion, Text Formatting

1. Text content after OCR is inserted in html templates which has predefined formats as Style sheet(CSS), Page Links(Back,Home,Next).

2. The text content is formatted using CSS and are proof checked again.

Linking

1. Linking between Pages and Frames

2. Search option is provided using Java Applet.

3. Bookmarks are set and linked to Index pages.

Image Insertion

1. After Images are fine tuned and they are inserted in tables of HTML pages in between text contents.

QC

1. The final HTML pages are rechecked for flaws.

2. The pages are compared with original source.

3. Page formats are Rechecked for all pages.

4. QC is maintained for maximum accuracy.

Final Output