Exoscan 4.0.0

dotnet add package Exoscan --version 4.0.0

NuGet\Install-Package Exoscan -Version 4.0.0

This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.

<PackageReference Include="Exoscan" Version="4.0.0" />

For projects that support PackageReference, copy this XML node into the project file to reference the package.

paket add Exoscan --version 4.0.0

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

#r "nuget: Exoscan, 4.0.0"

#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.

// Install Exoscan as a Cake Addin
#addin nuget:?package=Exoscan&version=4.0.0

// Install Exoscan as a Cake Tool
#tool nuget:?package=Exoscan&version=4.0.0

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

Interface	Description
IScheduler	Reading and writing from the job queue. By default, the in-memory queue is used, but you can provider your implementation
IVisitedLinkTracker	Tracker of visited links. A default implementation is an in-memory tracker. You can provide your own for Redis, MongoDB, etc.
IPageLoader	Loader that takes URL and returns HTML of the page as a string
IContentParser	Takes HTML and schema and returns JSON representation (JObject).
ILinkParser	Takes HTML as a string and returns page links
IScraperSink	Represents a data store for writing the results of web scraping. Takes the JObject as parameter
ISpider	A spider that does the crawling, parsing, and saving of the data

Project	Description
Exoscan	Library for web scraping
Exoscan.ScraperWorkerService	Example of using Exoscan library in a Worker Service .NET project.
Exoscan.DistributedScraperWorkerService	Example of using Exoscan library in a distributed way wih Azure Service Bus
Exoscan.AzureFuncs	Example of using Exoscan library with serverless approach using Azure Functions
Exoscan.ConsoleApplication	Example of using Exoscan library with in a console application

Version	Downloads	Last updated
4.0.2	539	11/29/2022
4.0.1	317	11/18/2022
4.0.0	316	11/18/2022

Exoscan 4.0.0

Exoscan

Overview

Install

Requirements

📋 Example:

Features:

Usage examples

API overview

SPA parsing example

Persist the progress locally

Authorization

Distributed web scraping with Serverless approach

StartScrapting

ExoscanSpider

Extensibility

Adding a new sink to persist your data

Intrefaces

Main entities

Repository structure

Coming soon:

Features under consideration

net6.0

NuGet packages

GitHub repositories