Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
361.
▲
The Museum of Modern Art Research Dataset
(github.com/MuseumofModernArt)
61 points
danso
11 years ago
15 comments
362.
▲
Mozilla shuts project Iodide: Datascience documents in browsers
(github.com/iodide-project)
46 points
ritwiksaikia
6 years ago
6 comments
363.
▲
Chicago Crime Trends. Analyzing 3GB Dataset from Data.gov with SQL and Graphs
(github.com/axibase)
44 points
rodionos
9 years ago
3 comments
364.
▲
Dataset of Linus Torvalds' rants ranked by hate
(github.com/corollari)
42 points
fctorial
5 years ago
17 comments
365.
▲
ClickHouse Obfuscator – A tool for dataset anonymization
(github.com/ClickHouse)
39 points
rrampage
3 years ago
3 comments
366.
▲
DeepMind's machine-reading question/answer dataset
(github.com/deepmind)
37 points
andrewtbham
11 years ago
3 comments
367.
▲
Madlad-400: A Multilingual and Document-Level Large Audited Dataset
(github.com/google-research)
37 points
the_bookmaker
3 years ago
1 comment
368.
▲
A dataset of crimes committed in Buenos Aires
(github.com/ramadis)
34 points
ramadis
8 years ago
4 comments
369.
▲
Show HN: I used streaming to skip downloading my 45GB dataset
(github.com/DagsHub)
31 points
npRandom
4 years ago
discuss
370.
▲
Toxicity Dataset
(github.com/surge-ai)
25 points
CarrieLab
4 years ago
32 comments
371.
▲
Structured Etymology Dataset
(github.com/droher)
24 points
downboots
a year ago
3 comments
372.
▲
Washington Post publishes dataset of 52,000 criminal homicides
(github.com/washingtonpost)
24 points
danso
8 years ago
2 comments
373.
▲
I have trained StyleGAN2 from scratch with a dataset of female portraits
(github.com/l4rz)
20 points
EvgeniyZh
5 years ago
20 comments
374.
▲
VoxelCNN: Order-Aware Generative Modeling Using the 3D-Craft Dataset
(github.com/facebookresearch)
20 points
ingve
6 years ago
discuss
375.
▲
Show HN: I made this tool for navigating pandas datasets
(github.com/man-group)
20 points
leehcksource
6 years ago
discuss
376.
▲
Show HN: SemHash – Fast Semantic Text Deduplication for Cleaner Datasets
(github.com/MinishLab)
19 points
Pringled
a year ago
6 comments
377.
▲
Show HN: Version code, models, & datasets together in GitHub
19 points
skadamat
3 years ago
6 comments
378.
▲
NLP: A new datasets and metrics library from Hugging Face
(github.com/huggingface)
19 points
julien_c
6 years ago
discuss
379.
▲
Show HN: Dataset of Linus Torvalds' rants sorted by hate
(github.com/corollari)
17 points
corollari
7 years ago
4 comments
380.
▲
GitHub: Awesome-reasoning, a curated list of datasets for reasoning AIs
(github.com/neurallambda)
17 points
neurallambda
2 years ago
discuss
381.
▲
A datastore library on Google App Engine for Clojure
(github.com/making)
16 points
va_coder
16 years ago
discuss
382.
▲
Datastax ripped us off
(github.com/managedfusion)
15 points
Throwadev
13 years ago
4 comments
383.
▲
ICLR 2026 – Institutional Affiliations Dataset and Analysis
(github.com/DmytroLopushanskyy)
15 points
stared
22 days ago
2 comments
384.
▲
Show HN: HTTP-nu – Nushell-scriptable HTTP server with SSE / Datastar
(github.com/cablehead)
14 points
ndyg
4 months ago
2 comments
385.
▲
Easy way to load, create, version, query and visualize computer vision datasets
13 points
morpheusme
4 years ago
discuss
386.
▲
Show HN: Standalone Implementation of WebRTC DataChannels in C++17
(github.com/paullouisageneau)
13 points
chapelierfou
7 years ago
discuss
387.
▲
Show HN: Dataset of 125k Medium Blog Post Titles and Subtitles (With Categories)
(github.com/turbo)
13 points
minxomat
7 years ago
discuss
388.
▲
Show HN: Create datasets more simply and improve AI model with unstructured data
(github.com/adansons)
12 points
KenichiHiguchi
4 years ago
3 comments
389.
▲
Fast and scalable dataset preparation and curation tool from Nvidia
(github.com/NVIDIA)
12 points
shcheklein
2 years ago
discuss
390.
▲
Show HN: ParcelKit integrates Core Data with Dropbox Datastore API
(github.com/overcommitted)
11 points
daniel_levine
13 years ago
7 comments
More