A Python package & command-line tool to gather text on the Web — Trafilatura 2.0.0 documentation
Trafilatura is a Python package and command-line tool designed to gather text on the Web. Its main applications are web crawling, downloads, scraping, and extraction of main texts, comments and metadata.