Bytescout.PDFExtractor
12.1.2.4156
See the version list below for details.
dotnet add package Bytescout.PDFExtractor --version 12.1.2.4156
NuGet\Install-Package Bytescout.PDFExtractor -Version 12.1.2.4156
<PackageReference Include="Bytescout.PDFExtractor" Version="12.1.2.4156" />
paket add Bytescout.PDFExtractor --version 12.1.2.4156
#r "nuget: Bytescout.PDFExtractor, 12.1.2.4156"
// Install Bytescout.PDFExtractor as a Cake Addin
#addin nuget:?package=Bytescout.PDFExtractor&version=12.1.2.4156
// Install Bytescout.PDFExtractor as a Cake Tool
#tool nuget:?package=Bytescout.PDFExtractor&version=12.1.2.4156
Bytescout PDF Extractor SDK FREE Community Edition for .NET, ASP.NET, ActiveX - extract data from PDF documents
Product | Versions Compatible and additional computed target framework versions. |
---|---|
.NET | net5.0 was computed. net5.0-windows was computed. net6.0 was computed. net6.0-android was computed. net6.0-ios was computed. net6.0-maccatalyst was computed. net6.0-macos was computed. net6.0-tvos was computed. net6.0-windows was computed. net7.0 was computed. net7.0-android was computed. net7.0-ios was computed. net7.0-maccatalyst was computed. net7.0-macos was computed. net7.0-tvos was computed. net7.0-windows was computed. net8.0 was computed. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. |
.NET Core | netcoreapp2.0 is compatible. netcoreapp2.1 was computed. netcoreapp2.2 was computed. netcoreapp3.0 was computed. netcoreapp3.1 was computed. |
.NET Framework | net20 is compatible. net35 was computed. net40 is compatible. net403 was computed. net45 was computed. net451 was computed. net452 was computed. net46 was computed. net461 was computed. net462 was computed. net463 was computed. net47 was computed. net471 was computed. net472 was computed. net48 was computed. net481 was computed. |
-
.NETCoreApp 2.0
- Microsoft.Windows.Compatibility (>= 2.0.0)
NuGet packages (1)
Showing the top 1 NuGet packages that depend on Bytescout.PDFExtractor:
Package | Downloads |
---|---|
BizDoc.Applications.Invoice-Scan
Invoice for BizDoc |
GitHub repositories
This package is not used by any popular GitHub repositories.
Version | Downloads | Last updated |
---|---|---|
13.4.1.4801 | 5,152 | 7/24/2023 |
13.4.1.4792 | 1,592 | 7/20/2023 |
13.4.1.4787 | 1,835 | 7/14/2023 |
13.4.0.4760 | 2,410 | 5/24/2023 |
13.4.0.4755 | 1,635 | 5/24/2023 |
13.4.0.4734 | 1,985 | 5/9/2023 |
13.4.0.4727 | 1,823 | 5/2/2023 |
13.4.0.4717 | 1,713 | 4/17/2023 |
13.3.0.4514 | 6,034 | 9/27/2022 |
13.2.1.4489 | 6,983 | 6/13/2022 |
13.2.0.4485 | 5,535 | 6/7/2022 |
13.1.1.4480 | 4,464 | 5/25/2022 |
13.1.0.4386 | 27,031 | 1/25/2022 |
13.0.1.4281 | 3,028 | 11/8/2021 |
13.0.0.4254 | 2,057 | 10/4/2021 |
12.1.5.4183 | 2,833 | 7/5/2021 |
12.1.5.4181 | 1,795 | 7/5/2021 |
12.1.4.4171 | 2,124 | 6/17/2021 |
12.1.4.4169 | 1,613 | 6/17/2021 |
12.1.3.4167 | 2,007 | 6/16/2021 |
12.1.2.4156 | 1,947 | 5/28/2021 |
12.1.1.4149 | 2,030 | 5/26/2021 |
12.1.1.4145 | 1,847 | 5/26/2021 |
12.1.0.4136 | 2,145 | 5/18/2021 |
12.0.0.4062 | 2,610 | 2/8/2021 |
11.3.0.3983 | 2,783 | 10/26/2020 |
11.2.1.3959 | 3,975 | 9/1/2020 |
11.2.1.3929 | 2,606 | 7/14/2020 |
11.2.1.3926 | 2,070 | 7/9/2020 |
11.2.0.3919 | 2,304 | 6/30/2020 |
11.1.0.3869 | 4,789 | 4/10/2020 |
11.1.0.3864 | 2,290 | 4/4/2020 |
11.1.0.3849 | 2,409 | 3/27/2020 |
11.1.0.3845 | 2,163 | 3/19/2020 |
11.0.0.3834 | 2,278 | 3/6/2020 |
11.0.0.3832 | 2,309 | 3/4/2020 |
11.0.0.3830 | 2,371 | 3/4/2020 |
11.0.0.3815 | 2,337 | 2/21/2020 |
11.0.0.3805 | 2,443 | 2/11/2020 |
10.8.0.3758 | 3,495 | 12/19/2019 |
10.8.0.3750 | 2,322 | 12/17/2019 |
10.8.0.3744 | 2,262 | 12/12/2019 |
10.8.0.3741 | 2,061 | 12/10/2019 |
10.8.0.3736 | 2,280 | 12/6/2019 |
10.8.0.3732 | 2,198 | 12/4/2019 |
10.7.2.3710 | 2,756 | 11/13/2019 |
10.7.1.3705 | 2,245 | 11/11/2019 |
10.7.0.3697 | 2,370 | 11/2/2019 |
10.6.0.3666 | 3,575 | 10/1/2019 |
10.5.0.3637 | 3,031 | 9/2/2019 |
10.4.0.3618 | 2,688 | 8/15/2019 |
10.4.0.3613 | 2,241 | 8/13/2019 |
10.4.0.3602 | 2,580 | 8/7/2019 |
10.3.0.3566 | 2,900 | 7/2/2019 |
10.2.0.3548 | 3,544 | 6/13/2019 |
10.2.0.3534 | 2,183 | 6/11/2019 |
10.2.0.3525 | 2,266 | 6/7/2019 |
10.2.0.3514 | 2,252 | 5/28/2019 |
10.1.0.3444 | 2,705 | 4/5/2019 |
10.1.0.3439 | 1,989 | 4/4/2019 |
10.0.0.3429 | 2,058 | 3/25/2019 |
10.0.0.3427 | 2,064 | 3/25/2019 |
10.0.0.3424 | 2,013 | 3/23/2019 |
10.0.0.3423 | 1,934 | 3/23/2019 |
10.0.0.3422 | 1,975 | 3/23/2019 |
10.0.0.3421 | 1,947 | 3/21/2019 |
9.4.0.3398 | 1,978 | 3/12/2019 |
9.3.0.3366 | 4,027 | 2/12/2019 |
9.3.0.3357 | 1,786 | 2/4/2019 |
9.3.0.3354 | 1,747 | 1/31/2019 |
9.2.0.3293 | 3,509 | 11/20/2018 |
9.2.0.3262 | 2,057 | 10/24/2018 |
9.2.0.3259 | 1,698 | 10/24/2018 |
9.1.0.3170 | 2,960 | 7/26/2018 |
9.1.0.3167 | 2,051 | 7/18/2018 |
9.1.0.3165 | 1,859 | 7/18/2018 |
9.1.0.3163 | 1,867 | 7/18/2018 |
9.0.0.3095 | 3,239 | 4/23/2018 |
9.0.0.3087 | 2,267 | 4/13/2018 |
9.0.0.3080 | 2,127 | 4/11/2018 |
8.8.1.3046 | 2,980 | 2/20/2018 |
8.8.1.3025 | 3,118 | 1/29/2018 |
8.8.0.3021 | 2,200 | 1/23/2018 |
8.7.0.2981 | 15,371 | 11/8/2017 |
8.6.0.2917 | 3,029 | 8/2/2017 |
8.6.0.2912 | 1,883 | 8/1/2017 |
8.5.0.2863 | 2,217 | 6/9/2017 |
8.5.0.2861 | 2,335 | 6/8/2017 |
8.5.0.2856 | 2,080 | 6/1/2017 |
8.4.1.2829 | 6,483 | 4/12/2017 |
8.4.0.2821 | 2,155 | 3/29/2017 |
8.3.0.2809 | 2,991 | 3/13/2017 |
8.3.0.2806 | 1,941 | 3/12/2017 |
8.3.0.2803 | 1,970 | 3/6/2017 |
8.3.0.2801 | 1,978 | 3/6/2017 |
8.3.0.2800 | 1,967 | 3/6/2017 |
8.3.0.2798 | 2,040 | 3/6/2017 |
8.3.0.2796 | 1,948 | 3/6/2017 |
8.3.0.2794 | 1,967 | 3/6/2017 |
8.2.0.2699 | 2,423 | 1/11/2017 |
8.1.1.2606 | 3,432 | 10/25/2016 |
8.1.0.2600 | 2,003 | 10/21/2016 |
8.0.0.2542 | 2,313 | 9/1/2016 |
8.0.0.2541 | 2,047 | 9/1/2016 |
8.0.0.2528 | 2,077 | 8/23/2016 |
8.0.0.2523 | 2,175 | 8/19/2016 |
7.0.0.2493 | 32,387 | 6/27/2016 |
7.0.0.2489 | 1,758 | 6/27/2016 |
7.0.0.2480 | 4,457 | 6/10/2016 |
7.0.0.2474 | 5,345 | 5/26/2016 |
6.30.0.2421 | 2,094 | 3/24/2016 |
6.20.0.2354 | 2,247 | 1/20/2016 |
6.12.0.2239 | 5,155 | 9/22/2015 |
5.20.0.1871 | 2,700 | 2/5/2015 |
5.0.0.1626 | 2,556 | 8/14/2014 |
4.0.0.1487 | 1,896 | 5/31/2014 |
3.40.0.1349 | 2,159 | 3/11/2014 |
3.20.0.1092 | 2,318 | 8/5/2013 |
3.20.0.1075 | 3,185 | 7/12/2013 |
3.10.0.1051 | 2,105 | 6/29/2013 |
3.0.0.839 | 2,080 | 3/26/2013 |
2.50.0.769 | 2,091 | 2/25/2013 |
Bytescout PDF Extractor SDK FREE Community Edition for .NET, ASP.NET, ActiveX.
ByteScout, Inc. (c) 2008-2021.
FREE COMMUNITY EDITION
Compatibility: .NET Framework 2.0 or later; .NET Core 2.0 or later.
Works with: .NET, ASP.NET, ActiveX, Visual Basic 6, Classic ASP, Delphi and others.
Features:
- Extracts data from PDF files in TXT, CSV, XML, XLS, XLSX, JSON formats;
- Extracts embedded images, files and attachments from PDF files;
- Splits and merges PDF files, extracts a single page or range of pages;
- Extracts data from whole document page or specified rectangular region;
- Extracts PDF document information (author, subject, producer etc);
- Detects tables;
- Searches text inside document with regex support;
- Extracts data from PDF forms;
- Reads text from scanned PDF documents using OCR (Optical Character Recognition);
- Provides ActiveX interface to use from legacy programming languages (Visual Basic 6, Delphi) and scripting (VBscript, JScript and others);
- And much more...
NOTE: Community Edition and Trial/Enterprise Edition may differ in features.
History of changes:
12.1.0.4136 (May 18, 2021)
==========================
+ Added property 'TextExtractor.FuzzySearch' that enables 'fuzzy' text search algorithm. It allows to find 'approximately equal' strings.
+ Added 'DocumentSplitter2' class that splits document by found text.
+ Added 'CSVExtractor.NormalizeCSV' property. It makes CSV data produced from different document pages to contain the same number of columns.
+ Added property 'JSONExtractor.OutputStructure' that allows to change the structure of the generated JSON to one of predefined variants for easier postprocessing.
+ Added property 'JSONExtractor.OutputTransformation' that allows to apply JSONPath expression to the generated JSON.
+ Added property 'OCRPageCount' to extractor classes that contains number of pages for which OCR was performed.
+ 'JSONExtractor' and 'XMLExtractor' now add to the generated JSON and XML result the number of process pages and the number of pages for which OCR was performed.
+ Added property 'OCRDetectLines' to extractor classes that improves column detection in scanned documents.
+ Added property 'ConsiderBackgroundColors' to extractor classes that enables detection of background color under text objects. It may helps to improve row and column detection in tables without borders but with color stripes.
+ Added properties 'DocumentMerger.GenerateBookmarks' and 'DocumentMerger.BookmarkTitles' to enable automatic generation of bookmarks pointing to the merged parts.
= Improved PDF optimization in 'DocumentSplitter'.
= 'DocumentMerger' now uses the first input document as the base for the merged document. This allows to keep document information properties and outlines.
= DocumentMerger: added support for profiles.
= MultimediaExtractor: added support for more media types.
- 'TextExtractor.FindAll()' method was ignoring the case sensitivity option.
- Fixed issue with junk empty temporary files generated during OCR.
= Improved parsing of PDF documents.
= Other minor fixes and improvements.
12.0.0.4062 (February 8, 2021)
==============================
+ Added public 'BaseExtractor.ExtractionArea' property (in addition to 'SetExtractionArea()' method) for more intuitive use.
= Added the new property 'ColumnDetectionByTextAlignment' to extractors that affects the detection of table columns without separating lines between.
+ Added support for simplified profiles.
+ DocumentOptimizer: Added the property 'OptimizationOptions.GrayscaleImages' that converts all color images to grayscale.
+ UnsearchablePDFMaker: Added the new property 'KeepSkippedPages' that keeps pages excluded from the processing in the output document.
+ UnsearchablePDFMaker: Added the new property 'Grayscale' that converts all processed pages to grayscale.
+ Added the property 'BaseTextExtractor.TextAnalysisCorruptedTextThreshold' to fine-tune the text analysis.
= Member names in profiles are case-insensitive now.
= Improved filtering of invisible objects.
= Improved detection of bold fonts.
= Improved OCR rotation detection.
= Added missing OCR mode 'OCRMode.TextFromVectorsAndRepairedFonts'.
= RTL fonts detection is now enabled by default.
= JSON extractor now generates clean JSON (without the @ and# characters for attributes).
= Improved support for external Chinese fonts.
= Improved positioning of rotated PDF objects.
= Now the damaged CCITT and JBIG2 images are skipped from rendering avoiding crashes.
= SearchablePDFMaker: improved OCR when 'DiscardExistingDocumentText' is enabled.
= 'SearchablePDFMaker.GetPageOCRCells()' now detects text color.
= OCR in all extractors now detects text color if the 'ConsiderFontColors' property is enabled.
= 'LineGroupingMode.JoinOrphanedRows' now separates rows of different color if 'ConsiderFontColors' property is enabled.
- InfoExtractor: Fixed a crash if the input document is an image.
- Fixed OCR crash on rotated text.
- 'IsOCRRecommendedForPage()' now skips text objects outside the page crop box.
= Improved parsing of PDF documents.
= Other minor fixes and improvements.
11.3.0.3983 (October 26, 2020)
==============================
+ DocumentSplitter: Added support for regions with inverted page numbers. For example, "!1" means "the last page", "!1-!3" or "!3-" means "last three pages".
+ DocumentSplitter: Added support for "*" split range that means "split every single page".
+ Added 'InfoExtractor.Metadata' property that gets XMP metadata from the document.
= Improved joining of multi-line cells in tables without borders ('LineGroupingMode.JoinOrphanedRows' mode).
= Improved detection of OCR language file versions.
= Improved .NET Core 2.0 compatibility.
= Improved unwrapping of multi-line cell text.
- Fixed issue when invisible vector drawings were causing unwanted separation of text objects.
- Fixed extraction from area when running OCR against image file (not PDF!).
= Improved parsing of PDF documents.
- Other minor fixes and improvements.
11.2.0.3919 (June 20, 2020)
===========================
+ 'MultimediaExtractor' now supports extraction of 3D-animation objects.
- 'TextExtractor.Find()' now keeps original font names in found object information.
= Improved column detection in 'ColumnDetectionMode.Borders' mode.
- 'SearchablePDFMaker' did not process vector-only pages. Fixed now.
= Improved regex text search in 'TextExtractor'.
+ Added 'DetectUnderlineTextStyle' and 'DetectStrikeoutTextStyle' properties to 'JSONExtractor' and 'XMLExtractor'.
+ Added 'OCRWhiteList' and 'OCRBlackList' properties to extractors.
+ Added 'Invert' OCR preprocessing filter.
+ Added 'Scale' OCR preprocessing filter.
= Improved joining of multi-line cells in tables without borders ('LineGroupingMode.JoinOrphanedRows' mode).
= Improved performance of 'ImageExtractor'.
+ Added page rectangles to 'InfoExtractor'.
= Improved 'OCRAnalyzer'.
= Improved automatic deletion of duplicated text objects during the extraction.
- Fixed extraction issues in .NET Core version.
= Improved parsing of PDF documents.
- Other minor fixes and improvements.
11.1.0.3845 (March 19, 2020)
============================
+ Added 'OCROverallConfidence' property in all extractors that.
+ SearchablePDFMaker: Added 'KeepOriginalRotation' property.
- SearchablePDFMaker: fixed crash on mixed English-Arabic text recognition.
+ PDF Multitool: Added "Developer Tools" sub-menu to the context menu.
= Improved parsing of PDF documents.
- Other minor fixes and improvements.
...