USE OF ICR TECHNOLOGY FOR THE INDONESIA 2010

  • Slides: 39
Download presentation
USE OF ICR TECHNOLOGY FOR THE INDONESIA 2010 POPULATION CENSUS BPS Statistics Indonesia New

USE OF ICR TECHNOLOGY FOR THE INDONESIA 2010 POPULATION CENSUS BPS Statistics Indonesia New York, February 2011 1

Background • Population census in Indonesia is held every ten year. • Indonesia has

Background • Population census in Indonesia is held every ten year. • Indonesia has the fourth largest population and the largest archipelago. • History of data processing for population census o 1971 - OMR Technology, mainframe o 1980 – data entry, mainframe o 1990 – data entry, mainframe, distributed o 2000 – OCR technology, PC clusters o 2010 – ICR and mobile technology, PC clusters. 2

Data Processing Centers • Located in 33 Provincial Statistics Offices. VPN 3

Data Processing Centers • Located in 33 Provincial Statistics Offices. VPN 3

Flow of Documents in the Fields DESA SP 2010 L, KBC, RT, ART ENUMERATOR

Flow of Documents in the Fields DESA SP 2010 L, KBC, RT, ART ENUMERATOR KORTIM Doc Pool FIELDS KEC SP 2010 L, KBC, RT, ART Doc Pool BPS BPS DISTRIC BPS Drop Off Receiving & Handling Queuing PROVINCE Expedition Drop Off Expedition/ Next Queuing Receiving & Handling Repack Unpack & Checking Entry SP 2010 -L Queuing Unpack Repack Entry Coding 4

DPC Personnel PROVINCE Database Box & Block Sensus CODING PICKUP OFFICER - 5) Take

DPC Personnel PROVINCE Database Box & Block Sensus CODING PICKUP OFFICER - 5) Take box from queuing room - 6) Registration of pick up box - 7) Deliver box es to Coding Editing Supervisor RECEPTION AREA Box sorting In Queuing Room - 1) Download Box - 2) Put Boxes in the trolley - 3) Input received Data - 4) Arrange box in Queuing Room CODING EDITING SUPERVISOR - 8) Boxes distribution ke petugas Coding Editing - 11. 2) Check & Authorization on any pages discrepancies - 13) Update data box that finished coding editing - 11. 1) Reporting of discrepancies pages Sorting Boxes in Scanning Queue CODING EDITING OFFICER - 9) Box opening - 10) Unbind documents - 11) Pages count - 12) Coding Editing 6

Flow of Processing Documents FIELDS Drop Off Receiving & Handling Registering STAGING DROP-OFF SERVICE

Flow of Processing Documents FIELDS Drop Off Receiving & Handling Registering STAGING DROP-OFF SERVICE DOCUMENT STORAGE REPACKING FUMIGATION SCANNING Unpack & Checking Sorting VALIDATION CORRECTION & COMPLETION DOCUMENT PREPARATION Cutting DOC PREP 7

Flow of Work in DPC : Scanning & Warehouse PROVINCE Database STORAGE OFFICER -

Flow of Work in DPC : Scanning & Warehouse PROVINCE Database STORAGE OFFICER - 12) Place box refer to Put-Away Database Box & Block Sensus SCANNING PICKUP OFFICER - 1) Pickup box from Scanning Queue Box Scanning Queue STORAGE ADMIN - 2) Pickup box registration - 3) Deliver box to Scanning Supervisor SCANNING OFFICER - 5) Register # box - 6) Scan docoments DATA CAPTURE SERVER - 10) Register box - 11) Cetak Put-Away Penyimpanan REPACKING OFFICER - 7) Repacking box - 8)Register finished repack STORAGE OFFICER - 9) Trolley from Repacking to Doc Storage 8

Flow of Data in DPC BPS DPC INFORMATION TECHNOLOGY Data Tabulasi Clean Data Staging

Flow of Data in DPC BPS DPC INFORMATION TECHNOLOGY Data Tabulasi Clean Data Staging RELEASE CAPTURE SYSTEM SUPPORT APPS BPS Server Status box Lokasi box Data Staging RECEPTION SERVICE DOCUMENT STORAGE Image + data Data Validasi Correction & Completion Scanning 9

Batching System • Document batch o 1 SP 2010 KBC o Consist of =

Batching System • Document batch o 1 SP 2010 KBC o Consist of = n SP 2010 RT o Each RT consist of = n SP 2010 ART 10

Capture Process • • Fixed Form Approach High speed Auto classification & separation Accurate

Capture Process • • Fixed Form Approach High speed Auto classification & separation Accurate High Speed ICR engine Accurate High Speed OMR engine Consistency check capability Inter-page business rule validation Multipage business rules validation Low false positive & Tuning 11

Solution Components 12

Solution Components 12

Solution Components • • Guillotine PCs Server Scanner Software Data Capture Training & troubleshooting

Solution Components • • Guillotine PCs Server Scanner Software Data Capture Training & troubleshooting Template Development Distribution, installation & implementation in each DPC (33 locations) 13

Fujitsu Scanner Fi-6800 • • • Scanner Speed : 130 ppm 300 dpi Duty

Fujitsu Scanner Fi-6800 • • • Scanner Speed : 130 ppm 300 dpi Duty Cycle : 100. 000 pages/ day Resolution : 600 dpi Feeder Capacity : 500 pages Paper Size : up to A 3 Imprinter capability : Pre and Post 14

§ Guillotine, workstation, scanner 15

§ Guillotine, workstation, scanner 15

§ Data Capture Server, Validation Server 16

§ Data Capture Server, Validation Server 16

§ Server Console, Server Racks 17

§ Server Console, Server Racks 17

Scanner Allocation # DPC - BPS Offices 1 2 3 4 5 6 7

Scanner Allocation # DPC - BPS Offices 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 NAD Sumatera Utara Sumatera Barat Riau Jambi Sumatera Selatan Bengkulu Lampung Kep Bangka Belitung Kepulauan Riau DKI Jakarta Jawa Barat Jawa tengah DI Yogyakarta Jawa Timur Banten No. of Docs 4. 350. 904 12. 336. 180 4. 413. 328 5. 738. 708 2. 961. 468 7. 534. 780 1. 785. 968 7. 841. 096 1. 138. 932 1. 612. 044 10. 508. 444 37. 095. 756 30. 000 4. 399. 192 34. 511. 536 11. 114. 704 Scanner allocation 1 2 1 1 1 1 2 6 5 1 5 2 18

Scanner Allocation # DPC - BPS Offices 17 Bali 18 Nusa Tenggara Barat 19

Scanner Allocation # DPC - BPS Offices 17 Bali 18 Nusa Tenggara Barat 19 Nusa Tenggara Timur 20 Kalimantan Barat 21 Kalimantan Tengah 22 Kalimantan Selatan 23 Kalimantan Timur 24 Sulawesi Utara 25 Sulawesi Tengah 26 Sulawesi Selatan 27 Sulawesi Tenggara 28 Gorontalo 29 Sulawesi Barat 30 Maluku 31 Maluku Utara 32 Papua Barat 33 Papua No. of Docs 3. 900. 360 5. 485. 668 3. 998. 896 4. 291. 276 2. 393. 316 3. 956. 552 3. 513. 608 2. 669. 324 2. 301. 920 7. 745. 836 2. 158. 264 1. 142. 544 1. 006. 684 1. 082. 632 715. 672 837. 400 2. 400. 544 254. 579. 368 Scanner Allocation 1 1 1 1 1 2 1 1 1 1 51 19

Server, PC Allocation # DPC - BPS Offices 1 2 3 4 5 6

Server, PC Allocation # DPC - BPS Offices 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 NAD Sumatera Utara Sumatera Barat Riau Jambi Sumatera Selatan Bengkulu Lampung Kep Bangka Belitung Kepulauan Riau DKI Jakarta Jawa Barat Jawa tengah DI Yogyakarta Jawa Timur Banten Server PC 2 2 2 4 4 2 4 30 84 33 41 22 52 14 54 11 14 73 344 237 33 284 77 20

Server, PC Allocation # DPC - BPS Offices 17 Bali 18 Nusa Tenggara Barat

Server, PC Allocation # DPC - BPS Offices 17 Bali 18 Nusa Tenggara Barat 19 Nusa Tenggara Timur 20 Kalimantan Barat 21 Kalimantan Tengah 22 Kalimantan Selatan 23 Kalimantan Timur 24 Sulawesi Utara 25 Sulawesi Tengah 26 Sulawesi Selatan 27 Sulawesi Tenggara 28 Gorontalo 29 Sulawesi Barat 30 Maluku 31 Maluku Utara 32 Papua Barat 33 Papua Server PC 2 2 2 2 2 76 28 40 28 30 18 28 25 20 18 53 17 11 10 10 8 9 19 1. 775 21

Networking # DPC - BPS Offices 1 2 3 4 5 6 7 8

Networking # DPC - BPS Offices 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 NAD Sumatera Utara Sumatera Barat Riau Jambi Sumatera Selatan Bengkulu Lampung Kep Bangka Belitung Kepulauan Riau DKI Jakarta Jawa Barat Jawa tengah DI Yogyakarta Jawa Timur Banten Switch 48 node Cable (m) 1 2 1 1 2 8 6 1 7 2 380 940 410 490 300 620 220 640 190 220 830 3, 780 2, 650 410 3, 140 870 22

Networking # DPC - BPS Offices 17 Bali 18 Nusa Tenggara Barat 19 Nusa

Networking # DPC - BPS Offices 17 Bali 18 Nusa Tenggara Barat 19 Nusa Tenggara Timur 20 Kalimantan Barat 21 Kalimantan Tengah 22 Kalimantan Selatan 23 Kalimantan Timur 24 Sulawesi Utara 25 Sulawesi Tengah 26 Sulawesi Selatan 27 Sulawesi Tenggara 28 Gorontalo 29 Sulawesi Barat 30 Maluku 31 Maluku Utara 32 Papua Barat 33 Papua Switch 48 node 1 1 1 1 1 2 1 1 1 1 39 Cable (m) 360 480 360 380 260 330 280 260 630 250 190 180 160 170 21. 400 23

Data Capture Software KOFAX 24

Data Capture Software KOFAX 24

Kofax Implementation Overview Scan Recognition Correction Completion Release 25

Kofax Implementation Overview Scan Recognition Correction Completion Release 25

Software Data Capture Implementation Doc Template Management o Template Registration o Template Setting •

Software Data Capture Implementation Doc Template Management o Template Registration o Template Setting • • Registration Point Field Definition Field Formatting Multi-Engine Voting Dictionary Data Look-Up Business Rules Integrity among pages 26

Data Processing Context Municipality Statistics Office Data Entry Head Quarter RBL Quality Check Field

Data Processing Context Municipality Statistics Office Data Entry Head Quarter RBL Quality Check Field Work Listing Census RBL KBC C 1 Validate, Summarize Send SMS Statistical Coordinator Validate, Send SMS Summarize Provincial Statistics Office Scanning Recognition Correction Validation Monitoring Compiled Data Release Quality Check 27

Capture Process Flow Server Database Server Data Capture • Classification • Recognition PC Document

Capture Process Flow Server Database Server Data Capture • Classification • Recognition PC Document Review PC & Scanner PC Correction PC QUALITY CONTROL PC Completion

Document Preparation • Objective: – To cut the side of forms booklet using paper

Document Preparation • Objective: – To cut the side of forms booklet using paper guillotine – Preparing docs for scanning process 29

 • Kofax Module Scanning : – Scan batch – Page counting of document

• Kofax Module Scanning : – Scan batch – Page counting of document batch in scanning process • QC: – System ensure that the pages of the doc batch match with the registered sum of pages entry before scanning. • Classification: o System will classify based on template • Document Review: o Unrecognized doc will appear in this module o Operator may re-arrange, delete and re-scan the doc 30

Kofax - Module • Recognition: – – Data extraction from processed form unrecognized Data

Kofax - Module • Recognition: – – Data extraction from processed form unrecognized Data for Correction & Completion • Correction: – Character correction which un-recognized by system on below a set of confidence level. Correction made field by field. • Completion: – To complete all correction on one set of document in a document batch refer to validation and business rules that have set in the system • Release: – Exporting image to predefine folder and data to predefine database 31

Kofax - Correction • Sample Screen: 32

Kofax - Correction • Sample Screen: 32

Kofax – Completion • Sample Screen ENTRY PANEL IN TABULAR FORMAT TO CATEGORISED FIELD

Kofax – Completion • Sample Screen ENTRY PANEL IN TABULAR FORMAT TO CATEGORISED FIELD 33

Kofax – Completion LOCATION ID CHECKING , DATA LOOKUP TO DATABASE 34

Kofax – Completion LOCATION ID CHECKING , DATA LOOKUP TO DATABASE 34

Kofax – Completion Business Rules VERIFICATION CHILD NATIONALITY VS BIOLOGICAL FATHER &/ MOTHER Business

Kofax – Completion Business Rules VERIFICATION CHILD NATIONALITY VS BIOLOGICAL FATHER &/ MOTHER Business Rules VERIFICATION CHILD AGE W/ BIOLOGICAL MOTHER 35

Kofax - Release • Objective: – Deliver image to folder in the File Server

Kofax - Release • Objective: – Deliver image to folder in the File Server – Deliver data to database Staging BPS • Scope: – Write data to Database Staging 36

Network Architecture of Data Center 37

Network Architecture of Data Center 37

Network Integration 38

Network Integration 38

Population of Indonesia based on the Census, May 2010 (preliminary figures, Released Aug 2010)

Population of Indonesia based on the Census, May 2010 (preliminary figures, Released Aug 2010) Male (000) Female (000) Male + Female (000) 9, 508 11 8, 048 11 237, 556 39

Thank You 40

Thank You 40