/

scraper

automation
headless
crawler
web-scraping
webscraping
monitoring
feedgenerator
twitter-streaming
huginn
twitter
notifications
rss
feed
agent
hacktoberfest
html
jquery
dom
selector
parser
cheerio
htmlparser2
htmlparser
nodejs
npm
web-crawling
apify
web-crawler
javascript
playwright
headless-chrome
puppeteer

huginn/huginn
502日前40.6k

Create agents that monitor and act on your behalf. Your agents are standing by!

cheeriojs/cheerio
502日前27.5k

The fast, flexible, and elegant library for parsing and manipulating HTML and XML.

apify/crawlee
502日前11.4k

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

pwxcoo/chinese-xinhua
501日前10.5k

:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。

guyueyingmu/avbook
504日前9.2k

AV 电影管理系统, avmoo , javbus , javlibrary 爬虫,线上 AV 影片图书馆,AV 磁力链接数据库,Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database