From fb7a0badfdf8ac22b4483dc524ec50d458d745e0 Mon Sep 17 00:00:00 2001 From: Mikhail Faraponov <11322032+moredure@users.noreply.github.com> Date: Mon, 23 Aug 2021 20:57:32 +0300 Subject: [PATCH] Add BUbiNG --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index af6cdd0..4ff07ea 100644 --- a/README.md +++ b/README.md @@ -48,6 +48,7 @@ A collection of awesome web crawler,spider and resources in different languages. * [ACHE Crawler](https://github.com/ViDA-NYU/ache) - An easy to use web crawler for domain-specific search. * [Apache Nutch](http://nutch.apache.org/) - Highly extensible, highly scalable web crawler for production environment. * [anthelion](https://github.com/yahoo/anthelion) - A plugin for Apache Nutch to crawl semantic annotations within HTML pages. +* [BUbiNG](https://github.com/LAW-Unimi/BUbiNG) - The LAW next generation crawler. * [Crawler4j](https://github.com/yasserg/crawler4j) - Simple and lightweight web crawler. * [JSoup](http://jsoup.org/) - Scrapes, parses, manipulates and cleans HTML. * [websphinx](http://www.cs.cmu.edu/~rcm/websphinx/) - Website-Specific Processors for HTML information extraction.