VB.NET: OCR Document & Images

How to OCR on document image using VB.NET
Yiigo provides .NET optical character recognition support included in Yiigo.Image for .NET, which can be easily added to your document imaging applications with fully-automated, dynamic, and feature-packed OCR functionalities. Find more OCR information on Yiigo.Imaging for .NET OCR Plugin.
Yiigo.Imaging for .NET OCR Plugin has those following functionalities:
  • Full page and zonal OCR
  • Extract text detail from scanned files
  • Output to searchable text document formats
  • OCR on black and white, grayscale and color images
  • Determine character, word, and line size and location
  • Detect, recognize multiple languages in a single image
For VB.NET developers, Yiigo provides online tutorials to recognize optical characters on documents and images using Yiigo.Image for .NET and .NET OCR Plugin in details. Find more OCR tutorials on OCR Support for Visual C#.NET, Web Imaging Viewer OCR Support and OCR in .NET Winforms.
OCR Image & Documents with VB.NET Sample Code

Requirements

Before starting optical character recognition in VB.NET, make sure that you have installed.

Start to OCR

  • Start your Microsoft Visual Studio;
  • Choose "Project" in "File-New" from the menu;
  • In "New Project" dialog box, choose "Visual Basic Projects";
  • Choose "Windows Application" in the "Templates" List and name it "YiigoImagingOCRVBNET";
  • In the "Solution Explorer" window from the context menu, right-click on the "References" folder, and select "Add Reference...";
  • In the "Add Reference" dialog box, select the ".NET" tab and browse to Yiigo.Imaging.Net.dll library and Yiigo.Imaging.Net.Recognition.Ocr.dll;
  • Add it to your VB.NET applications;
  • Import Yiigo .NET Image Namespace & copy those following codes:
Imports System.IO
Imports System.Drawing.Printing
Imports Yiigo.Imaging
Imports Yiigo.Imaging.Processing
Imports Yiigo.Imaging.Recognition.OCR

' Initialize OCR Engine
Dim OcrEngine As New TesseractOcrEngine()
TesseractOcrEngine.Initialize()
TesseractOcrEngine.Translate(imageCollection, "application/msword", outputPath)
TesseractOcrEngine.Shutdown()
TesseractOcrEngine.Dispose()

' Cancel OCR in Progress
TesseractOcrEngine.PageProgressCancel = "True"

' Recognize Pages
Private Sub button6_Click(sender As Object, e As System.EventArgs) Handles button6.Click
_ocrDocument.Pages.Recognize(Nothing)

Dim pdfFileName As String = "C:\Ocr1.pdf"
_ocrDocument.Save(pdfFileName, DocumentFormat.Pdf, Nothing)
System.Diagnostics.Process.Start(pdfFileName)
End Sub

' Create a Searchable PDF
Dim OcrEngine As New TesseractOcrEngine()
Dim PDFGenerator As New PDFGenerator()
PDFGenerator.Save = "C: \Ocr1.pdf"
Products
Other Plugins
Image Viewer CoreBarcode PluginPDF Read & WriteTesseract OCR PluginForm Processing PluginJBIG2 CodecJPEG2000 CodecISIS ScannerTwain ScannerDICOM ReaderCAD Scanner