Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
361.
The Museum of Modern Art Research Dataset (github.com/MuseumofModernArt)
61 points
danso
11 years ago
15 comments
362.
Mozilla shuts project Iodide: Datascience documents in browsers (github.com/iodide-project)
46 points
ritwiksaikia
6 years ago
6 comments
363.
Chicago Crime Trends. Analyzing 3GB Dataset from Data.gov with SQL and Graphs (github.com/axibase)
44 points
rodionos
9 years ago
3 comments
364.
Dataset of Linus Torvalds' rants ranked by hate (github.com/corollari)
42 points
fctorial
5 years ago
17 comments
365.
ClickHouse Obfuscator – A tool for dataset anonymization (github.com/ClickHouse)
39 points
rrampage
3 years ago
3 comments
366.
DeepMind's machine-reading question/answer dataset (github.com/deepmind)
37 points
andrewtbham
11 years ago
3 comments
367.
Madlad-400: A Multilingual and Document-Level Large Audited Dataset (github.com/google-research)
37 points
the_bookmaker
3 years ago
1 comment
368.
A dataset of crimes committed in Buenos Aires (github.com/ramadis)
34 points
ramadis
8 years ago
4 comments
369.
Show HN: I used streaming to skip downloading my 45GB dataset (github.com/DagsHub)
31 points
npRandom
4 years ago
discuss
370.
Toxicity Dataset (github.com/surge-ai)
25 points
CarrieLab
4 years ago
32 comments
371.
Structured Etymology Dataset (github.com/droher)
24 points
downboots
a year ago
3 comments
372.
Washington Post publishes dataset of 52,000 criminal homicides (github.com/washingtonpost)
24 points
danso
8 years ago
2 comments
373.
I have trained StyleGAN2 from scratch with a dataset of female portraits (github.com/l4rz)
20 points
EvgeniyZh
5 years ago
20 comments
374.
VoxelCNN: Order-Aware Generative Modeling Using the 3D-Craft Dataset (github.com/facebookresearch)
20 points
ingve
6 years ago
discuss
375.
Show HN: I made this tool for navigating pandas datasets (github.com/man-group)
20 points
leehcksource
6 years ago
discuss
376.
Show HN: SemHash – Fast Semantic Text Deduplication for Cleaner Datasets (github.com/MinishLab)
19 points
Pringled
a year ago
6 comments
377.
Show HN: Version code, models, & datasets together in GitHub
19 points
skadamat
3 years ago
6 comments
378.
NLP: A new datasets and metrics library from Hugging Face (github.com/huggingface)
19 points
julien_c
6 years ago
discuss
379.
Show HN: Dataset of Linus Torvalds' rants sorted by hate (github.com/corollari)
17 points
corollari
7 years ago
4 comments
380.
GitHub: Awesome-reasoning, a curated list of datasets for reasoning AIs (github.com/neurallambda)
17 points
neurallambda
2 years ago
discuss
381.
A datastore library on Google App Engine for Clojure (github.com/making)
16 points
va_coder
16 years ago
discuss
382.
Datastax ripped us off (github.com/managedfusion)
15 points
Throwadev
13 years ago
4 comments
383.
ICLR 2026 – Institutional Affiliations Dataset and Analysis (github.com/DmytroLopushanskyy)
15 points
stared
22 days ago
2 comments
384.
Show HN: HTTP-nu – Nushell-scriptable HTTP server with SSE / Datastar (github.com/cablehead)
14 points
ndyg
4 months ago
2 comments
385.
Easy way to load, create, version, query and visualize computer vision datasets
13 points
morpheusme
4 years ago
discuss
386.
Show HN: Standalone Implementation of WebRTC DataChannels in C++17 (github.com/paullouisageneau)
13 points
chapelierfou
7 years ago
discuss
387.
Show HN: Dataset of 125k Medium Blog Post Titles and Subtitles (With Categories) (github.com/turbo)
13 points
minxomat
7 years ago
discuss
388.
Show HN: Create datasets more simply and improve AI model with unstructured data (github.com/adansons)
12 points
KenichiHiguchi
4 years ago
3 comments
389.
Fast and scalable dataset preparation and curation tool from Nvidia (github.com/NVIDIA)
12 points
shcheklein
2 years ago
discuss
390.
Show HN: ParcelKit integrates Core Data with Dropbox Datastore API (github.com/overcommitted)
11 points
daniel_levine
13 years ago
7 comments
More