Text Extraction Feature

 

 

The Black Ice Printer Drivers, besides generating an image from the printed document, is also capable of extracting the text information from the printed document.

The text extracted from the document is saved as a standard text file, which can then be processed as necessary. The text file gets the same name as the image, but with the .txt extension.

 

 

Enabling Text Extraction

 

In order to configure Text Output in Black Ice printer driver, navigate to Control Panel > Devices and Printers > right click on the Black Ice Printer Driver > Printing Preferences > navigate to the Profile Manager tab and select the Text Output - Extract text from documents profile.

 

 

If the profile does not exist, navigate to Control Panel > Devices and Printers > right click on the Black Ice Printer Driver > Printing Preferences > navigate to the Text Output tab, and check the Generate Text Output option.

 

 

To retrieve the exact coordinates and styling of the text from the printed document, please select the Add font information, position and style option in the Formatting Style dropdown list. For more information, please refer to the Advanced Text Output.

 

Generate Text Output feature is able to recognize special character sequences in the printed text to extract text information in the text file and in the merge.mrg file. For more information please refer to the Mail Merge.

 

By default, the text file is generated as ANSI text, however, UNICODE text is also supported. In order to turn on the UNICODE text support, select a UNICODE option from the Character Set dropdown list.

 

For the limitations of the Generate Text Output feature, please refer to the Known Limitations.

 

For more information about the Text Output, please refer to the Text Output tab section.

 

 

Multipage and Single Page documents and Page Number delimiters

 

If the Save each page as separate file option is unchecked, and the printed document contains more than one page, all the extracted text is saved into a single text file. The text file contains delimiters for each new page in the “Page 1:”, “Page 2:” format. If the Save each page as separate file option is checked, a separate text file is generated for each image page generated by the driver.

One can disable the Page number delimiters in the text output by checking Disable Page Numbering option.

 

 

Example of how the Text Output works:

 

For Example, if we printing a test page, the output file will look as the following:

Description: AAA11E000001

 

The generated text file contains the following lines:

 

Page 1:

Windows XP

Printer Test Page

Congratulations!

If you can read this information, you have correctly installed your Black Ice TIFF 

Driver on TEST3.

The information below describes your printer driver and port settings.

Submitted Time: 5:17:00 PM 9/20/2004

Computer name: ALPAR

Printer name: Black Ice TIFF

Printer model: Black Ice TIFF Driver

Color support: No

Port name(s): IcePortMR:

Data format: NT EMF 1.003

Share name:

Location:

Comment:

Driver name: BuMDrvNT.dll

Data file: BuMIniNT.ini

Config file: BuMUifNT.dll

Driver version: 6.00

Environment: Windows NT x86

Default datatype: NT EMF 1.003

 

Additional files used by this driver:

C:\XP\System32\spool\DRIVERS\W32X86\3\BuMResNT.DLL

C:\XP\System32\spool\DRIVERS\W32X86\3\TIFF32.DLL (9, 1, 2, 0)

C:\XP\System32\spool\DRIVERS\W32X86\3\JPEG32.DLL

 

This is the end of the printer test page.