Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
451.
▲
GABRIEL – turn messy qualitative corpora into analysis-ready datasets
(github.com/openai)
1 point
michaelsbradley
4 months ago
discuss
452.
▲
Show HN: Vietnam Elections (open, source-linked datasets and site)
(bamboo-filing-cabinet.github.io)
1 point
vietthan
4 months ago
discuss
453.
▲
The Guardian Headline Entailment Training Dataset
(github.com/daoudclarke)
1 point
daoudc
14 years ago
discuss
454.
▲
Fasttfidf: High-performance TF-IDF vectorization for large-scale text datasets
(github.com/purijs)
1 point
jspuri
5 months ago
discuss
455.
▲
Show HN: AI tool that walks citation graph and extracts data to create datasets
(github.com/eamag)
1 point
eamag
6 months ago
discuss
456.
▲
Training YOLO vision models on Kaggle datasets
(github.com/mfranzon)
1 point
walterbell
7 months ago
discuss
457.
▲
Show HN: Gaggle – A DuckDB extension for working with Kaggle datasets
1 point
habedi0
7 months ago
discuss
458.
▲
Show HN: I built a tool to sort a Northern Lights dataset for a CV model
(picsort.coolapso.sh)
1 point
coolapso
7 months ago
discuss
459.
▲
Show HN: Django PostgreSQL Anonymizer – prod → safe dev datasets (beta)
(github.com/CuriousLearner)
1 point
sanyam-khurana
8 months ago
discuss
460.
▲
A toolkit for improving the quality of your LeRobot datasets
(github.com/RoboticsData)
1 point
machinelearning
8 months ago
discuss
461.
▲
A new RAG algorithm to self-heal damaged datasets and query them on a graph
(github.com/iblameandrew)
1 point
scraper02
8 months ago
discuss
462.
▲
Show HN: Tensorpack a CLI tool for semantic discovery across datasets
1 point
AyodeleFikayomi
8 months ago
discuss
463.
▲
Procedural Reasoning Datasets
(github.com/open-thought)
1 point
t55
10 months ago
discuss
464.
▲
Reasoning Gym – Procedural RL reasoning datasets
(github.com/open-thought)
1 point
t55
10 months ago
discuss
465.
▲
Mochi Programming Language v0.6.0 – LINQ syntax for querying datasets
(github.com/mochilang)
1 point
scapbi
a year ago
discuss
466.
▲
Reasoning Gym: Procedural Dataset Generation for Reinforcement Learning
(github.com/open-thought)
1 point
starzmustdie
a year ago
discuss
467.
▲
Datasets Are All You Need (LLM Learns to Prompt from Data)
(github.com/intellectronica)
1 point
intellectronica
a year ago
discuss
468.
▲
A multi-view video behavior monitoring dataset of wild mammals in the Swiss Alps
(github.com/eceo-epfl)
1 point
moatmoat
a year ago
discuss
469.
▲
RKaggle: Bring Kaggle Datasets Straight into the R console
(github.com/benyamindsmith)
1 point
SuperMint
a year ago
discuss
470.
▲
Logic R1: Reproduce DeepSeek R1 Zero on 2K Logic Puzzle Dataset
(github.com/Unakar)
1 point
limoce
a year ago
discuss
471.
▲
Drawdata: Draw Datasets from Within Jupyter
(github.com/koaning)
1 point
yamrzou
a year ago
discuss
472.
▲
Facebook Uncommon Objects in 3D Dataset
(github.com/facebookresearch)
1 point
taikon
a year ago
discuss
473.
▲
LENS: A Leo Satellite Network Measurement Dataset
(github.com/clarkzjw)
1 point
teleforce
2 years ago
discuss
474.
▲
Transform and optimize datasets for fast AI model training
(github.com/Lightning-AI)
1 point
shcheklein
2 years ago
discuss
475.
▲
Synthesizers 1896 – 2024: A Dataset and Exploratory Insights
(github.com/iftah-og)
1 point
Tomte
2 years ago
discuss
476.
▲
Synthesizer 1896-2024: a dataset and exploratory insights
(github.com/iftah-og)
1 point
anigbrowl
2 years ago
discuss
477.
▲
Tool to prepare, curate, version datasets for AI/ML
(github.com/iterative)
1 point
shcheklein
2 years ago
discuss
478.
▲
Transform and Optimize Datasets at Scale
(github.com/Lightning-AI)
1 point
shcheklein
2 years ago
discuss
479.
▲
Show HN: Dataset for South Asian Road Scene Understanding in Autonomous Driving
(github.com/hasibzunair)
1 point
hasibzunair
2 years ago
discuss
480.
▲
I made a viewer for the SWE-Bench dataset
(github.com/mwufi)
1 point
randomcatuser
2 years ago
discuss
More