Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
601.
A web scraping CLI made for AI that is idempotent (github.com/clemlesne)
81 points
clemlesne
2 years ago
31 comments
602.
Scrape: A simple, higher level interface for Go web scraping (github.com/yhat)
79 points
ericchiang
11 years ago
15 comments
603.
Pjscrape: A web-scraping framework written in JS using PhantomJS and jQuery (nrabinowitz.github.com)
79 points
jamesjyu
15 years ago
8 comments
604.
Show HN: Apify SDK – A scalable web crawling and scraping library for JavaScript (github.com/apifytech)
78 points
jancurn
8 years ago
8 comments
605.
CurlyQ: Command line helper for curl and web scraping (github.com/ttscoff)
67 points
ingve
2 years ago
19 comments
606.
Search-Script-Scrape: Web scraping exercises in Python 3 for data journalists (github.com/compjour)
61 points
danso
11 years ago
12 comments
607.
Pro scraping with Node.JS (github.com/chriso)
60 points
chrisohara
15 years ago
18 comments
608.
Show HN: Pipet – CLI tool for scraping and extracting data online, with pipes (github.com/bjesus)
49 points
yoavm
2 years ago
7 comments
609.
Diffgram scraping emails from commits on GH to send spam (web.archive.org)
40 points
jkittner
4 years ago
4 comments
610.
Paperjs: scriptographer ported to javascript (github.com/paperjs)
27 points
th0ma5
15 years ago
8 comments
611.
Show HN: ScreenSlicer – Automatic, zero-config web scraping (github.com/MachinePublishers)
26 points
logn
12 years ago
16 comments
612.
Opencart illegally stripping license and attribution from reused code (github.com/opencart)
22 points
mouhtasi
12 years ago
6 comments
613.
An open source API for web scraping (github.com/owainlewis)
19 points
owainlewis
11 years ago
10 comments
614.
Gnews – minimalistic JavaScript library for Google News scraping (github.com/DatanewsOrg)
15 points
caballeto
6 years ago
3 comments
615.
Show HN: Obscura – V8-powered headless browser for scraping and AI agents (github.com/h4ckf0r0day)
15 points
jryio
2 months ago
discuss
616.
Scraping Reddit with Akka Streams 1.0M2 (github.com/pkinsky)
14 points
pkinsky
11 years ago
discuss
617.
Show HN: Transistor, a Python web scraping framework for intelligent use cases (github.com/bomquote)
12 points
bobjordan
7 years ago
3 comments
618.
Web-scraping past bot-detection from GitHub Actions (e.g. Walmart prices) (github.com/mdmintz)
11 points
seleniumbase
8 months ago
discuss
619.
Show HN: Wring – CLI web scraping with CSS Sel, JS, XPath, written in PureScript (github.com/osener)
11 points
osener
10 years ago
discuss
620.
Show HN: Kimurai - A modern web scraping framework written in Ruby (github.com/vfreefly)
10 points
benicafe
8 years ago
discuss
621.
Best JavaScript scraping infrastructure
10 points
davidy123
11 years ago
discuss
622.
Show HN: Created Pickaxe a SQL like DSL for web scraping (github.com/bitsummation)
9 points
breeve
10 years ago
5 comments
623.
Scraping GitHub emails and spamming your project is not cool (github.com/SuperDuperDB)
9 points
denysvitali
3 years ago
discuss
624.
Quick but powerful research for AI agents with data scrapping and selenium
8 points
alexvomwald
a year ago
6 comments
625.
Scraping and Extracting the Cablegate HTML in Python (github.com/typecode)
8 points
amahon
16 years ago
discuss
626.
Tadpole the Language for Scraping 0.2.0 – Complex Control Flow, Stealth and More
7 points
zachperkitny
4 months ago
2 comments
627.
Domharvest: Semantic web scraping that survives DOM changes" (github.com/domharvest)
7 points
DomHarvest
5 months ago
2 comments
628.
Crawly – Elixir web scraping framework (github.com/elixir-crawly)
7 points
rahimnathwani
3 years ago
discuss
629.
Show HN: estela, a modern elastic web scraping cluster (github.com/bitmakerla)
7 points
breno
4 years ago
discuss
630.
Show HN: A Node.js script powered by Puppeteer for undetectable web scraping (github.com/darkotodoric)
6 points
darkotodoric
2 years ago
2 comments
More