apify/crawlee
steadyCrawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
TypeScript
View on GitHub
Stars
24,057
Forks
1,466
Open issues
136
24h
+15
+0.1%
7d
+103
+0.4%
Refresh
1h
Star history (7 days)
Last checked
just now
Last pushed
5h ago
Next check
just now