Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
721.
▲
Show HN: AI tool that walks citation graph and extracts data to create datasets
(github.com/eamag)
1 point
eamag
6 months ago
discuss
722.
▲
Training YOLO vision models on Kaggle datasets
(github.com/mfranzon)
1 point
walterbell
7 months ago
discuss
723.
▲
Show HN: Gaggle – A DuckDB extension for working with Kaggle datasets
1 point
habedi0
7 months ago
discuss
724.
▲
Show HN: I built a tool to sort a Northern Lights dataset for a CV model
(picsort.coolapso.sh)
1 point
coolapso
7 months ago
discuss
725.
▲
Show HN: Django PostgreSQL Anonymizer – prod → safe dev datasets (beta)
(github.com/CuriousLearner)
1 point
sanyam-khurana
8 months ago
discuss
726.
▲
A toolkit for improving the quality of your LeRobot datasets
(github.com/RoboticsData)
1 point
machinelearning
8 months ago
discuss
727.
▲
A new RAG algorithm to self-heal damaged datasets and query them on a graph
(github.com/iblameandrew)
1 point
scraper02
9 months ago
discuss
728.
▲
Show HN: Tensorpack a CLI tool for semantic discovery across datasets
1 point
AyodeleFikayomi
9 months ago
discuss
729.
▲
Procedural Reasoning Datasets
(github.com/open-thought)
1 point
t55
10 months ago
discuss
730.
▲
Reasoning Gym – Procedural RL reasoning datasets
(github.com/open-thought)
1 point
t55
10 months ago
discuss
731.
▲
multi_db: repo that uses Datastar and has a multi db setup, one for each user
(github.com/asmorris)
1 point
thunderbong
a year ago
discuss
732.
▲
Mochi Programming Language v0.6.0 – LINQ syntax for querying datasets
(github.com/mochilang)
1 point
scapbi
a year ago
discuss
733.
▲
Reasoning Gym: Procedural Dataset Generation for Reinforcement Learning
(github.com/open-thought)
1 point
starzmustdie
a year ago
discuss
734.
▲
Datasets Are All You Need (LLM Learns to Prompt from Data)
(github.com/intellectronica)
1 point
intellectronica
a year ago
discuss
735.
▲
A multi-view video behavior monitoring dataset of wild mammals in the Swiss Alps
(github.com/eceo-epfl)
1 point
moatmoat
a year ago
discuss
736.
▲
RKaggle: Bring Kaggle Datasets Straight into the R console
(github.com/benyamindsmith)
1 point
SuperMint
a year ago
discuss
737.
▲
Show HN: I built a Graph Datastore that faster, simpler and cheaper
(github.com/jakobap)
1 point
jpoersc
a year ago
discuss
738.
▲
Logic R1: Reproduce DeepSeek R1 Zero on 2K Logic Puzzle Dataset
(github.com/Unakar)
1 point
limoce
a year ago
discuss
739.
▲
Drawdata: Draw Datasets from Within Jupyter
(github.com/koaning)
1 point
yamrzou
a year ago
discuss
740.
▲
Facebook Uncommon Objects in 3D Dataset
(github.com/facebookresearch)
1 point
taikon
a year ago
discuss
741.
▲
LENS: A Leo Satellite Network Measurement Dataset
(github.com/clarkzjw)
1 point
teleforce
2 years ago
discuss
742.
▲
Transform and optimize datasets for fast AI model training
(github.com/Lightning-AI)
1 point
shcheklein
2 years ago
discuss
743.
▲
Synthesizers 1896 – 2024: A Dataset and Exploratory Insights
(github.com/iftah-og)
1 point
Tomte
2 years ago
discuss
744.
▲
Synthesizer 1896-2024: a dataset and exploratory insights
(github.com/iftah-og)
1 point
anigbrowl
2 years ago
discuss
745.
▲
Transform and Optimize Datasets at Scale
(github.com/Lightning-AI)
1 point
shcheklein
2 years ago
discuss
746.
▲
Show HN: Dataset for South Asian Road Scene Understanding in Autonomous Driving
(github.com/hasibzunair)
1 point
hasibzunair
2 years ago
discuss
747.
▲
I made a viewer for the SWE-Bench dataset
(github.com/mwufi)
1 point
randomcatuser
2 years ago
discuss
748.
▲
Valkey: A Versatile Distributed Key-Value Datastore for Caching and Beyond
(github.com/valkey-io)
1 point
LaunchpadHacker
2 years ago
discuss
749.
▲
Densely Captioned Images (DCI) Dataset
(github.com/facebookresearch)
1 point
zerojames
2 years ago
discuss
750.
▲
Sakuga-42M Dataset: Scaling Up Cartoon Research
(github.com/zhenglinpan)
1 point
lnyan
2 years ago
discuss
More