IronWebScraper 4.0.4.3

Prefix Reserved

There is a newer version of this package available.
See the version list below for details.

dotnet add package IronWebScraper --version 4.0.4.3

NuGet\Install-Package IronWebScraper -Version 4.0.4.3

This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.

<PackageReference Include="IronWebScraper" Version="4.0.4.3" />

For projects that support PackageReference, copy this XML node into the project file to reference the package.

paket add IronWebScraper --version 4.0.4.3

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

#r "nuget: IronWebScraper, 4.0.4.3"

#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.

// Install IronWebScraper as a Cake Addin
#addin nuget:?package=IronWebScraper&version=4.0.4.3

// Install IronWebScraper as a Cake Tool
#tool nuget:?package=IronWebScraper&version=4.0.4.3

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

Iron WebScraper is a C# web scraping library, allowing developers to simulate & automate human browsing behavior to extract content, files & images from web applications as native .Net objects. Iron Web Scraper manages politeness & multithreading in the background, leaving a developer’s own application easy to understand & maintain.

Iron Web Scraper can be used to migrate content from existing websites as well as build search indexes and monitor website structure & content changes. It's functionality includes:
» Read & extract structured content from web pages using html DOM, Javascript, Xpath, jQuery Style CSS Selectors.
» Fast multi threading allows hundreds of simultaneous requests.
» Politely avoid over stalling remote servers using IP/domain level throttling & optionally respecting robots.txt
» Manage multiple identities, DNS, proxies, user agents, request methods, custom headers, cookies & logins.
» Data exported from websites becomes native C# objects which can be stored or used immediately.
» Exceptions managed in all but the developers own code. Errors and captchas auto retried on failure
» Save, pause, resume, autosave scrape jobs.
» Built in web cache allows for action replay, crash recovery, and querying existing web scrape data. Change scrape logic on the fly, then replay job without internet traffic.

Requires .NET 4.5.2 - Licensing & Support available for commercial deployments. For code examples, documentation & more visit http://ironsoftware.com/cshapr/webscraper. For support please email us at developers@ironsoftware.com.

Product	Compatible and additional computed target framework versions.
.NET Framework	net452 is compatible. net46 was computed. net461 was computed. net462 was computed. net463 was computed. net47 was computed. net471 was computed. net472 was computed. net48 was computed. net481 was computed.

Compatible target framework(s)

Included target framework(s) (in package)

Learn more about Target Frameworks and .NET Standard.

This package has no dependencies.

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version	Downloads	Last updated
2024.11.7	99	11/6/2024
2024.10.1	204	10/1/2024
2024.9.1	219	9/9/2024
2024.8.4	428	7/30/2024
2024.7.1	835	7/5/2024
2024.6.1	452	6/4/2024
2024.5.8	694	4/29/2024
2024.4.4	254	4/4/2024
2024.3.6	342	3/8/2024
2024.2.2	557	1/29/2024
2024.1.3	893	12/29/2023
2023.12.4	932	11/27/2023
2023.11.9	879	10/27/2023
2023.10.8	837	10/6/2023
2023.10.7	791	9/29/2023
2023.9.7	1,032	8/31/2023
2023.8.1	5,803	7/31/2023
2023.7.3	1,196	7/3/2023
2023.6.1	1,253	5/29/2023
2023.5.1	1,123	5/1/2023
2023.4.13	3,746	3/29/2023
2023.2.2	1,425	2/27/2023
2023.1.22	3,282	1/19/2023
2021.11.0	32,812	10/29/2021
2021.9.0	1,837	8/24/2021
2021.6.5	1,498	6/7/2021
4.0.4.3	25,388	4/14/2017
4.0.4	2,210	3/30/2017

Yield method changes to Scrape

Autosave is now instananeous, rather than scheduled

Performance on huge scrape jobs (1 Million+ pages improved)