Web scraping and crawling tools for extracting data from websites
| # | Repository | Stars | |
|---|---|---|---|
| 1 | puppeteer/puppeteer JavaScript API for Chrome and Firefox | 93.9K | |
| 2 | microsoft/playwright Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API. | 85.1K | |
| 3 | scrapy/scrapy Scrapy, a fast high-level web crawling & scraping framework for Python. | 61K | |
| 4 | cheeriojs/cheerio The fast, flexible, and elegant library for parsing and manipulating HTML and XML. | 30.2K |