unsd workshop – minsk - dec 2008

40
UNSD Workshop – Minsk - Dec 2008 Amir Angel Director of Government Projects Supporting National Censuses Top Image Systems Data Capture platform for Censuses

Upload: noel-galloway

Post on 30-Dec-2015

45 views

Category:

Documents


0 download

DESCRIPTION

UNSD Workshop – Minsk - Dec 2008. Supporting National Censuses. Top Image Systems Data Capture platform for Censuses. Amir Angel Director of Government Projects. Agenda. Who we are? TIS’s Platform for Censuses Questions & Answers Demo. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: UNSD Workshop – Minsk - Dec 2008

UNSD Workshop – Minsk - Dec 2008

Amir Angel Director of Government Projects

Supporting National Censuses

Top Image SystemsData Capture platform for Censuses

Page 2: UNSD Workshop – Minsk - Dec 2008

2

Agenda

Who we are? TIS’s Platform for Censuses Questions & Answers Demo

Page 3: UNSD Workshop – Minsk - Dec 2008

3

Number of people “Counted” by TIS in Censuses world wide

1,374,026,304

Page 4: UNSD Workshop – Minsk - Dec 2008

TIS’s Experience in Censuses Projects Turkey 1997 & 2000

Brazil 2000

Kenya 2000

South Africa 2001

Slovak Republic 2001

Hong Kong 2001

India 2002

Ireland 2002

Italy 2002 Cyprus 2002 Slovenia 2006 (Census of real estates) Ireland 2006 Hong Kong 2006 South Africa 2007 (Community Survey)

Thailand NSO 2008 (Community Survey)

Largest market share worldwide in census projects information

capture

Largest market share worldwide in census projects information

capture

Page 5: UNSD Workshop – Minsk - Dec 2008

TIS’s Experience in Censuses Projects

2010 RoundProjects Won: • Scottish Census 2011• Belarus 2009

Page 6: UNSD Workshop – Minsk - Dec 2008

Overview - Top Image Systems

Founded 1991

Data Extraction solutions. Specialized in Censuses

Projects.

Since 1996, traded on NASDAQ (TISA)

Since 2006, traded on TASE (TISA)

2 acquisitions in 2007

~250 employees

Page 7: UNSD Workshop – Minsk - Dec 2008

Local offices in:

Europe United Kingdom, Germany, Italy, Spain, France, Benelux

Asia Japan, Singapore, Hong Kong, Shanghai,

Guangzhou (R&D) and Australia

USA North & Latin America

Israel R&D Headquarters

Present in app. 40 countries Strong partner network worldwide Around 800 installed systems worldwide

Page 8: UNSD Workshop – Minsk - Dec 2008

eFlow platform for Censuses

Top Image SystemsData Capture platform for Censuses

Page 9: UNSD Workshop – Minsk - Dec 2008

9

The evolution of data capture in census projects

eFLOWeFLOWFrom OCR into IDR Solution

Page 10: UNSD Workshop – Minsk - Dec 2008

10

TIS’s Census Data Capture Solution

Census Data base

Suggest a Single platform for all enterprise content

Page 11: UNSD Workshop – Minsk - Dec 2008

How does eflow read data?

Top Image SystemsData Capture platform for Censuses

Page 12: UNSD Workshop – Minsk - Dec 2008

12

The Process Flow – Processing Center

CLASSIFY

Completion

RECOGNTION

EXPORT

Workflow Process

Database Host/ERP/Custom

Electronic Archive

InputOutput

Structured & unstructured information

Exception

1010 1

1010101010101010101010101010101010110101010101010101010 1101010101010101010 0

10101010101010101010101010 001 101

101010101010101010 1

010100101

ScannedDocuments

eDocs and Facsimile

Email

Page 13: UNSD Workshop – Minsk - Dec 2008

1313Scanning OCR Validation

Process integrality, implementing a work flow according to the client needs

Export

MFlexibilityctivator

Page 14: UNSD Workshop – Minsk - Dec 2008

14

Flexibility

Page 15: UNSD Workshop – Minsk - Dec 2008

Flexibility

15

Page 16: UNSD Workshop – Minsk - Dec 2008

16

The Process Flow

CLASSIFY

Completion

RECOGNTION

EXPORT

Workflow Process

Database Host/ERP/Custom

Electronic Archive

InputOutput

Structured & unstructured information

Exception

1010 1

1010101010101010101010101010101010110101010101010101010 1101010101010101010 0

10101010101010101010101010 001 101

101010101010101010 1

010100101

ScannedDocuments

eDocs and Facsimile

Email

Page 17: UNSD Workshop – Minsk - Dec 2008

17

Advanced approaches Automatic EFI Matching

– Improving template recognition station speed via the “Force EFI” mechanism, a unique barcode posted on each page

Questioner integrity

Page 18: UNSD Workshop – Minsk - Dec 2008

18

The Process Flow

CLASSIFY

Completion

RECOGNTION

EXPORT

Workflow Process

Database Host/ERP/Custom

Electronic Archive

InputOutput

Structured & unstructured information

Exception

1010 1

1010101010101010101010101010101010110101010101010101010 1101010101010101010 0

10101010101010101010101010 001 101

101010101010101010 1

010100101

ScannedDocuments

eDocs and Facsimile

Email

Page 19: UNSD Workshop – Minsk - Dec 2008

ICR

OMR OCR

Multiple Data Types

Page 20: UNSD Workshop – Minsk - Dec 2008

Recognition engines/technologies embedded in the platform

20

Page 21: UNSD Workshop – Minsk - Dec 2008

*oshua Jo*hu* J*sh*a

Joshua

ICR A ICR B ICR C

Virtual Engine example to increase recognition

Voting Method

Page 22: UNSD Workshop – Minsk - Dec 2008

Automatic approaches Auto Coding

– Coding tasks and data validations performed on the data capture platform: a ‘cost-effective’ solution

– Use one of the statistic software's in the market like ACTR (Canadian statistical software for coding some fields)

– Use Approximate Search tools for improving results via DB (Exorbyte)

Dynamic Dictionary update– Lookup and dictionaries via DB

Page 23: UNSD Workshop – Minsk - Dec 2008

Form Out

Original TIFF EFI DIF

ROI

Reduce network traffic Reduce storage media

Page 24: UNSD Workshop – Minsk - Dec 2008

24

The Process Flow

CLASSIFY

Completion

RECOGNTION

EXPORT

Workflow Process

Database Host/ERP/Custom

Electronic Archive

InputOutput

Structured & unstructured information

Exception

1010 1

1010101010101010101010101010101010110101010101010101010 1101010101010101010 0

10101010101010101010101010 001 101

101010101010101010 1

010100101

ScannedDocuments

eDocs and Facsimile

Email

Page 25: UNSD Workshop – Minsk - Dec 2008

25

Completion Station – Page Mode

Page 26: UNSD Workshop – Minsk - Dec 2008

Field Group Mode Completion

Page 27: UNSD Workshop – Minsk - Dec 2008

Business Logic & Validation

Page 28: UNSD Workshop – Minsk - Dec 2008

Identify false positives Alpha & Numeric fields Highlight for verifications Quality control for ICR

Unique Tiling stations – Checking for false positives

Page 29: UNSD Workshop – Minsk - Dec 2008

Implementing Edits

29

Page 30: UNSD Workshop – Minsk - Dec 2008

Analysis Of Current Form

Dictionaries– Owner name to actual address– Address Database

Date Of Birth : should match with Age Higher Education : Which 12th year of high school Age Of Mum : Child cannot be older than mum Religion : Detailed action Married : if not married shouldn’t have wife And more…

Page 31: UNSD Workshop – Minsk - Dec 2008

CodingComputer Assisted Coding by statistical experts as part of the data capture system (2nd level repair).

Page 32: UNSD Workshop – Minsk - Dec 2008

32

Page 33: UNSD Workshop – Minsk - Dec 2008

33

Custom stations approach

Page 34: UNSD Workshop – Minsk - Dec 2008

34

The Process Flow

CLASSIFY

Completion

RECOGNTION

EXPORT

Workflow Process

Database Host/ERP/Custom

Electronic Archive

InputOutput

Structured & unstructured information

Exception

1010 1

1010101010101010101010101010101010110101010101010101010 1101010101010101010 0

10101010101010101010101010 001 101

101010101010101010 1

010100101

ScannedDocuments

eDocs and Facsimile

Email

Page 35: UNSD Workshop – Minsk - Dec 2008

35

Customized Exported DataExamples

XML

SQL

CSV

Tab Delimited

Page 36: UNSD Workshop – Minsk - Dec 2008

36

Controller

Page 37: UNSD Workshop – Minsk - Dec 2008

37

Monitoring and Management

Page 38: UNSD Workshop – Minsk - Dec 2008

38

Modules

Statistical Data base– Statistical report to monitor the daily,

weekly, monthly rate per user/station– Quality checking using

Page 39: UNSD Workshop – Minsk - Dec 2008

Post Census Usage

Building of new Database for Census Agricultural Census Real Estate Census On going Surveys Tax Office Tourism Board Immigration Department Urban Development Board

Page 40: UNSD Workshop – Minsk - Dec 2008

40

Summery

Data capture and IDR platform (paper, electronic, mobile) and not a recognition product

Proven solution in census data capture! no need to invest time and money in new technology and vendor, minimizing the risk

Extensive experience in the design, development and implementation of real census and other high volume form processing projects. Largest market share worldwide in the processing of census projects,

Huge experience based on long researches in the Census arena

Maximum flexibility, redundancy and robust platform ensuring you meet project timetable to release census results.