scraping
crawling
crawler
web-scraping
nodejs
puppeteer
python
framework
hacktoberfest
web-scraping-python
npm
web-crawling
apify
web-crawler
javascript
headless
playwright
automation
scraper
headless-chrome
mongodb
mongoose
chrome-headless
scrapy/scrapy503日前50.1k
Scrapy, a fast high-level web crawling & scraping framework for Python.
apify/crawlee502日前11.4k
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.