Career Opportunities | Contact Us
 
 
 
Solutions
Incoming/Outgoing Document
The ABC Of Scanning Process
Legends About Document Managements
PaperWork For SAP
 
 


tesiz@g-gsoft.com


 
General  
The ABC of Scanning Process
  • Scanning/Digitizing

Digitizing a document is different from digitizing information on that document. To digitize a document is to create a document in image format in electronic media by scanning. Whereas, digitizing information, for example information on a document is to mean to transfer it into electronic media in text format.
Scanner selection;
Document type that you will scan affects this selection directly. Photographs, documents in colors, colored documents, necessitate slow and low capacity scanners. Documents like paychecks, invoices, agreements necessitate high capacity, and fast scanners. In case documents are fragile historical ones, camera scanners should be used without damaging documents.
Furthermore, answers to questions like size of documents, in colors or not in colors, single or double sided, scanning resolutions, number of pages in total, haw many pages in one month defines selection of scanner. G&G Soft lets you make investment on right scanner with experienced service bureau consultants. We reduce scanning costs with our scanner hiring service.
Scanning/Digitizing has been done in three ways;

  • Transfer into electronic media by using scanner
  • Transferring electronic documents into archive system
  • Conversion of existing documents into unchangeable image format
  • Preparation of Document

This part requires intensive labor, and effort. Documents should be prepared for scanning process. Works such as removing staples, taking paper clips out, fixing worn out papers, putting papers in order, clearing out post-it notes, taking them out of folders take a lot of time. Since organizations do not know generally that these works are time consuming, they cannot follow project calendars. Preparation of documents neatly for scanning process slows down scanning considerably. With service bureau service we offer to our customers, all these procedures are fulfilled by our experienced scanning consultants, and scanning operators.

 

  • Scanner

Scanning process is conversion of paper document into electronic document format.
Specially designed scanners should be used in document scanning processes. Today's multi function printers (MFP) can do scanning as well. You can use these printers for your low volume jobs.
Moreover, fax images incoming from fax servers can be exported into document management system. But, quality of these fax images is poor because of fax technology.

After documents are scanned, image qualities should be checked by a second operator. Degraded images should be marked, and rescanned.


Scanner Categories


It is possible to classify them according to their speeds. However, scan speed in their pamphlets stays under the specified number especially in service bureau processes. This is an important point, and should be paid attention to.
Working group, scan speed 10-20 pages/minute
Department, scan speed 20-50 pages/minute
Archive, scan speed 50-100 pages/minute
Pile archive, scan speed, over 100
Paycheck scanners, fast speed scanners having MICR (Magnetic Ink Character Recognition) feature.
In addition to these, camera scanners should be used for fragile documents, and plotters A0 for big size documents.

Evaluation of scanners with their specifications,


Feeder or flatbed, Scanner has to have a automatic paper sheet feeder for high-speed scanning. Photograph and fragile documents can be scanned by flatbed scanners.
Resolution changes from 100 dpi to 1600 dpi. Nevertheless, place occupied by high resolution image is big. It take much time to transfer big images over network. On the other hand, good resolution increase success ratio in OCR processing. Resolution of 300 dpi for optimum scanning is sufficient for OCR success.
Color, black, and white, grayscale: Since size of documents in color is big, it is generally not preferred. However, it can be used if a photo taking is necessary. The most preferred type is black and white scanning, because it occupies the least space. Grayscale is effective to read graphics on pages.
Double-Sided Scanning, this is a must for scanning both sides of a page at one time.
Scanning Capacity: This is also defines type of scanner to be preferred. Heavy scanning daily necessitates high capacity scanners.
OEM materials: Replacement time and prices of parts to be changed after a certain time are also important factors in selection. Ability to be changed without technical Support should be preference reason.
Image Enhancement
After documents are scanned, a kind of enhancement is done. Processes like deskewing, despeckling, removing hole traces, arranging blank pages decrease document size, and increase quality.

  • Indexing

After documents are scanned, they should be labeled with key words. You can reach documents in document management system by using these words as search criteria. This is why it is very important to do indexing with right words.
Index Fields: These are filled by data entry operators (customer name, date, reference number etc.).
By Barcode use:  Before stack is scanned, index data are transferred to barcode. Thus, after scanning is completed, index data is automatically taken over barcode. You do not have to use data entry operator.
Regional OCR: If quality of scanned documents is good, it is possible to extract by OCR index data from specific region on document. This solution also lets you do indexing.
Searchable PDF: When PDF is selected as document type to be scanned, this selection is not searchable pdf. After document is scanned, text, table, and image should be protected as scanned, and document should be passed through OCR process, and converted in searchable PDF format.
Image file of a document archived in a digital media in different formats (Tiff, Jpeg, Gif, PDF etc.) are passed through OCR process to be able to make a full text search. This lets you reach document quickly by profiting a word on a document except index fields. The difference from searchable PDF is that data gathered after OCR process is kept in a different table. In searchable PDF, data gathered after OCR process is kept in image file in PDF.
It is possible to save scanned documents in different formats:
TIFF: Industry standard used commonly
JPG:  Thanks to compression feature, used especially for documents in colors
PDF: Industry standard. It possible to make a full text search after PDF is processed, and indexed,.
GIF: lets you display, and share high quality graphics files.
DJVU: Technology compressing documents in tiff format 5-10 times without degrading their qualities
Average disk sizes of an image file gathered after scanning of an MS Word file having just text in A4 size:

  • Black, and white, 300 DPI resolution, in TIFF format and compressed by CCITT Group 4 method is about 68 KB.
  • Black, and white, 300 DPI resolution in TIFF format, and compressed by DJVU format is about 12 KB. Files in this format can be opened by general applications like ACDC, IrfanView, Internet explorer, and PaperWork.
  • In colors, 300 DPI resolution, in TIFF format, and compressed by Jpeg method is about 878 KB.
  • Black, and white, 200 DPI resolution in TIFF format, and compressed by ve CCITT Group 4 method is about 45 KB.
  • Black, and white, 200 DPI resolution in TIFF format, and compressed by DJVU format is about 9 KB.
  • In colors, 200 DPI resolution, in TIFF format, and compressed by Jpeg method is about 455 KB.
  • In colors, 300 DPI resolution, and in JPEG format is about 892 KB.
  • Black, and white, 300 DPI resolution, and in JPEG format about 819 KB.
  • In colors, 200 DPI resolution, and in JPEG format is about 452 KB.
  • Black, and white, 200 DPI resolution, and in JPEG format is about 448 KB.
  • In colors, 300 DPI resolution, and in PDF format is about 871 KB.
  • Black, and white, 300 DPI resolution, and in PDF format is about 73 KB.
  • In colors, 200 DPI resolution, and in PDF format is about 453 KB.
  • Black, and white, 200 DPI resolution, and in PDF format is about 46 KB.

 

  • Form Processing

Information on scanned forms is processed by ICR, OCR, OMR engines, and converted into text data. This converted data can and exported to desired applications.
Recognition,
OCR converts printouts into text data.
ICR converts handwritten characters into text data.
OMR reads marking like check boxes. 
Barcode reads data on barcodes.

  • Transfer

Information and documents can be exported to any system easily like;
Workflow system,
ERP system,
Document management, archive system
Disk storage system etc.

 

 

 

        

News : About Us : Our Services : Solutions : Solution Partners : Products : References
Career Opportunities : Contact : Document Scanners