/

scraping

crawling
crawler
web-scraping
nodejs
puppeteer
python
framework
hacktoberfest
web-scraping-python
npm
web-crawling
apify
web-crawler
javascript
headless
playwright
automation
scraper
headless-chrome
mongodb
mongoose
chrome-headless

scrapy/scrapy
503日前50.1k

Scrapy, a fast high-level web crawling & scraping framework for Python.

apify/crawlee
502日前11.4k

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

emadehsan/thal
506日前2.4k

Getting started with Puppeteer and Chrome Headless for Web Scraping