itext.pdfocr.tesseract4 4.0.0

Prefix Reserved

dotnet add package itext.pdfocr.tesseract4 --version 4.0.0

NuGet\Install-Package itext.pdfocr.tesseract4 -Version 4.0.0

This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.

<PackageReference Include="itext.pdfocr.tesseract4" Version="4.0.0" />

For projects that support PackageReference, copy this XML node into the project file to reference the package.

paket add itext.pdfocr.tesseract4 --version 4.0.0

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

#r "nuget: itext.pdfocr.tesseract4, 4.0.0"

#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.

// Install itext.pdfocr.tesseract4 as a Cake Addin
#addin nuget:?package=itext.pdfocr.tesseract4&version=4.0.0

// Install itext.pdfocr.tesseract4 as a Cake Tool
#tool nuget:?package=itext.pdfocr.tesseract4&version=4.0.0

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

iText pdfOCR offers Optical Character Recognition functionality to convert your scanned documents, PDFs and images into fully ISO-compliant PDF or PDF/A-3u files making it possible to access and process the text they contain. The output can be configured to be text, a PDF consisting of separate layers for the source image data and a layer containing all recognized text, or as a flattened PDF with the layers merged.

Features:

Powered by the open-source Tesseract 4 engine
Simple, yet flexible API. It is also abstracted, to allow support for different OCR engines with little or no effort from users
Supports multiple input images (BMP, PNM, PNG, JFIF, JPEG or TIFF formats)
Text only extraction option: iText pdfOCR can recognize text in documents and export it as a text file. This can be used to populate external databases or with other tools.

Visit our knowledge base to find code samples, manuals, documentation and more.

You can also find its API here.

Try our code in our developer sandbox or use our free apps, all in our iText Demo Lab.

Product	Compatible and additional computed target framework versions.
.NET Framework	net461 is compatible. net462 was computed. net463 was computed. net47 was computed. net471 was computed. net472 was computed. net48 was computed. net481 was computed.

Compatible target framework(s)

Included target framework(s) (in package)

Learn more about Target Frameworks and .NET Standard.

.NETFramework 4.6.1
- itext.pdfocr.api (>= 4.0.0)

NuGet packages (2)

Showing the top 2 NuGet packages that depend on itext.pdfocr.tesseract4:

Package	Downloads
itext7.pdfocr.tesseract4 pdfOCR is an iText add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compliant PDF or PDF/A-3u files that are accessible, searchable, and suitable for archiving	13.7K
itext.pdf2data pdf2Data by Apryse allows you to extract and process data locked inside your PDF files.	193

GitHub repositories

This package is not used by any popular GitHub repositories.

Version	Downloads	Last updated
4.0.0	92	11/18/2024
3.0.2	1,448	2/7/2024
3.0.1	1,051	10/25/2023

itext.pdfocr.tesseract4 4.0.0

Features:

.NETFramework 4.6.1

NuGet packages (2)

GitHub repositories