Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
121.
Yahoo open sources Anthelion web crawler for parsing structured data (github.com/yahoo)
159 points
fangwang
10 years ago
9 comments
122.
Show HN: Voyager – write a web crawler/scraper as a state machine in Rust (github.com/mattsse)
110 points
matsche
5 years ago
11 comments
123.
Katana: A crawling and spidering framework (github.com/projectdiscovery)
99 points
feross
4 years ago
25 comments
124.
Show HN: Apify SDK – A scalable web crawling and scraping library for JavaScript (github.com/apifytech)
78 points
jancurn
8 years ago
8 comments
125.
Show HN: Nebula – A network agnostic DHT crawler (github.com/dennis-tra)
68 points
dennis-tra
2 years ago
22 comments
126.
Show HN: wxpath – Declarative web crawling in XPath (github.com/rodricios)
64 points
rodricios
5 months ago
9 comments
127.
Show HN: An open-source rhythm dungeon crawler in 16 x 9 pixels (github.com/jgalecki)
55 points
jgalecki
a year ago
11 comments
128.
Show HN: I wrote a tiny Python-based HN crawler with scrapy (github.com/mvanveen)
53 points
mvanveen
14 years ago
28 comments
129.
Gerapy: Distributed Crawler Management Framework Based for Scrapy (github.com/Gerapy)
49 points
r_singh
6 years ago
discuss
130.
Show HN: crawl a website and store it in S3 from your browser (github.com/spullara)
43 points
spullara
15 years ago
12 comments
131.
Google Play Store in Numbers. Open Source Crawler for Mobile Apps Data (github.com/MarcelloLins)
39 points
marcellolins
12 years ago
15 comments
132.
Using Node.js and JQuery to Crawl Public Tweets (github.com/bcoe)
35 points
BenjaminCoe
14 years ago
13 comments
133.
Show HN: A modular, durable web-crawler for Clojure (github.com/shriphani)
29 points
shriphani
10 years ago
1 comment
134.
PiCrawler: A distributed web crawler using PiCloud (github.com/studio-ousia)
24 points
ikuyamada
13 years ago
5 comments
135.
Show HN: Yomuco – A simple web crawling library for Node.js (github.com/andraindrops)
23 points
jtakahashi64
2 years ago
3 comments
136.
Ask HN: Are you running a web crawler off the following IPs? It's broken
22 points
latitude
13 years ago
4 comments
137.
A New Web Archival Crawler Tackling Storage+Fidelity Issues (github.com/goelayu)
22 points
systemskid
4 years ago
1 comment
138.
Show HN: SpiderSuite: Advance GUI web security crawler (github.com/3nock)
19 points
3nock
3 years ago
2 comments
139.
Show HN: EndzinSrc – Wikipedia web crawler and PageRank algorithm implementation (github.com/ciganche)
17 points
lsr_ssri
8 years ago
discuss
140.
Show HN: (1 day project) I crawled +50k subreddits and made an interactive graph (github.com/ghgr)
14 points
ghgr
8 years ago
discuss
141.
Harvestman - Quick and dirty web crawling (github.com/mion)
10 points
mion
13 years ago
2 comments
142.
Show HN: A Links Crawler for News (github.com/egcodes)
10 points
egcodes
6 years ago
discuss
143.
Show HN: A web crawler that builds word frequency lists for websites (github.com/calebwin)
9 points
calebhwinston
8 years ago
discuss
144.
Show HN: Craigslist web crawler example in python3 and docker-compose (github.com/estin)
8 points
etatarkin
10 years ago
3 comments
145.
Dungeon-mode: a dungeon crawler game for Emacs (github.com/dungeon-mode)
8 points
dustfinger
6 months ago
discuss
146.
Show HN: I have written a cloud native dark web crawler in Go (github.com/creekorful)
7 points
creekorful
5 years ago
7 comments
147.
Tech-News Web-Crawler, Built on Node.js and jQuery (github.com/bcoe)
7 points
BenjaminCoe
13 years ago
discuss
148.
A spider crawl all room info of airbnb ,include reservation of the room (github.com/plantpark)
7 points
plantpark
10 years ago
discuss
149.
Show HN: (Sukhoi) Minimalist and Powerful Web Crawler in Python (github.com/iogf)
6 points
iogf
9 years ago
5 comments
150.
Anubis: Weighs the soul of HTTP requests using proof-of-work to stop AI crawlers (github.com/TecharoHQ)
6 points
pabs3
a year ago
discuss
More