1. HtmlAgilityPack

    By:

    This is an agile HTML parser that builds a read/write DOM and supports plain XPATH or XSLT (you actually don't HAVE to understand XPATH nor XSLT to use it, don't worry...). It is a .NET code library that allows you to parse "out of the web" HTML files. The parser is very tolerant with "real world" malformed HTML. The object model is very similar to... More information

  2. MisterHexCrawler

    By:

    Simple web crawler that return IObservable using Reactive Extension(Rx) and async await.

  3. totalrecall

    By:

    Crawl and index your (static) asp.net website for searching using an MSBuild target. Simple search query interface. Uses Lucene.Net and NCrawler.

  4. Data Extracting SDK

    By:

    Library allows you to extract different data from text or web sources, analyze text content, extract different information and write your own data mining & extracting applications and services.

  5. Spidy

    By:

    FSharp Web Crawler

  6. Tenteikura

    By:

    A minimal multithreaded web crawler

  7. Advance.DB.Crawler

    By:

    This is my sample package

    62 downloads

  8. Sitecore Search Contrib Crawler

    By:

    The crawling/indexing component of the Sitecore Search Contrib project

  9. SocialSense

    By:

    Crawler for main social networks

  10. Sitecore Search Contrib Searcher

    By:

    The searching/querying component of the Sitecore Search Contrib project