WARCProcessor is a platform independent integrative tool providing specific support to scientists that need to perform experiments in the field of web spam research. In detail, the developed application is specialized in the generation of curated corpus for training and validation purposes, widely supports the WARC format and allows the execution of user workflows using both GUI and command line.
WARCProcessor is open-source, being freely available to the scientific community, provides transparent deployment of new versions and can be executed on any computer without the need of downloading and installing additional packages.