ExcavatorSharp is a multi-threaded server for scraping web data. It converts HTML code into a structured array of object data. The library allows scraping data from multiple sites simultaneously, in parallel, within a single running application. Create scraping tasks and perform data extraction on a schedule.
The library can be used separately as crawler or parser. Work with sitemap, robots.txt is supported. It supports working with gzip / deflate compression. Work with links by masks, automatic link re-indexing and so on.
See the version list below for details.
Install-Package ExcavatorSharp.WebScraper.x64 -Version 1.0.0
dotnet add package ExcavatorSharp.WebScraper.x64 --version 1.0.0
<PackageReference Include="ExcavatorSharp.WebScraper.x64" Version="1.0.0" />
paket add ExcavatorSharp.WebScraper.x64 --version 1.0.0
First public release of ExcavatorSharp library
This package is not used by any popular GitHub repositories.