Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
211.
▲
OpenAI crowd sources LLM benchmarking datasets by offering advanced GPT-4 access
(github.com/openai)
1 point
teaearlgraycold
3 years ago
2 comments
212.
▲
Show HN: Self-hosted DCF workspace using Damodaran datasets, LLM narratives
1 point
softcane
3 months ago
1 comment
213.
▲
Show HN: RAG-corpus-profiler – A linter for RAG datasets (dedup, PII, quality)
(github.com/aashirpersonal)
1 point
aashirpersonal
5 months ago
1 comment
214.
▲
Show HN: React-obj-view – A virtualized object inspector for large datasets
(github.com/vothanhdat)
1 point
datvo
6 months ago
1 comment
215.
▲
Show HN: Wrote a small tool that turns PDFs and docs into fine-tuning datasets
(github.com/Datalore-ai)
1 point
FineTuner42
10 months ago
1 comment
216.
▲
Show HN: DataChain – Tool to create, curate, version AI datasets
(github.com/iterative)
1 point
shcheklein
2 years ago
1 comment
217.
▲
National Park Service Data Is Now Available on Big Query Public Datasets
(github.com/tonymet)
1 point
tonymet
2 years ago
1 comment
218.
▲
Face Alignment API: Simple API to align faces when creating datasets/scraping
(github.com/botoxparty)
1 point
botoxparty
3 years ago
1 comment
219.
▲
GeoCOCO: Transform GIS annotations into COCO datasets for use in deep learning
(github.com/jaspersiebring)
1 point
qtieb
3 years ago
1 comment
220.
▲
Collection of datasets to train your own multi-modal GPT-4/LLMs
(github.com/yaodongC)
1 point
yaodong_lukas
3 years ago
1 comment
221.
▲
PLS GIVE UR FEEDBACK: DPIPE Library to easily create TensorFlow datasets
(github.com/aiporre)
1 point
arielin1
6 years ago
1 comment
222.
▲
Not_notMNIST: Generate your own datasets
1 point
RafazZ
9 years ago
1 comment
223.
▲
Show HN: Cohort Visualizer - A handy tool for browsing cohort datasets
(bslatkin.github.com)
1 point
bslatkin
14 years ago
discuss
224.
▲
Synth-dataset-kit: Generate and audit synthetic datasets from seed data
(github.com/KazKozDev)
1 point
kazkozdev
2 months ago
discuss
225.
▲
GABRIEL – turn messy qualitative corpora into analysis-ready datasets
(github.com/openai)
1 point
michaelsbradley
4 months ago
discuss
226.
▲
Show HN: Vietnam Elections (open, source-linked datasets and site)
(bamboo-filing-cabinet.github.io)
1 point
vietthan
4 months ago
discuss
227.
▲
Fasttfidf: High-performance TF-IDF vectorization for large-scale text datasets
(github.com/purijs)
1 point
jspuri
5 months ago
discuss
228.
▲
Show HN: AI tool that walks citation graph and extracts data to create datasets
(github.com/eamag)
1 point
eamag
5 months ago
discuss
229.
▲
Training YOLO vision models on Kaggle datasets
(github.com/mfranzon)
1 point
walterbell
7 months ago
discuss
230.
▲
Show HN: Gaggle – A DuckDB extension for working with Kaggle datasets
1 point
habedi0
7 months ago
discuss
231.
▲
Show HN: Django PostgreSQL Anonymizer – prod → safe dev datasets (beta)
(github.com/CuriousLearner)
1 point
sanyam-khurana
8 months ago
discuss
232.
▲
A toolkit for improving the quality of your LeRobot datasets
(github.com/RoboticsData)
1 point
machinelearning
8 months ago
discuss
233.
▲
A new RAG algorithm to self-heal damaged datasets and query them on a graph
(github.com/iblameandrew)
1 point
scraper02
8 months ago
discuss
234.
▲
Show HN: Tensorpack a CLI tool for semantic discovery across datasets
1 point
AyodeleFikayomi
8 months ago
discuss
235.
▲
Procedural Reasoning Datasets
(github.com/open-thought)
1 point
t55
10 months ago
discuss
236.
▲
Reasoning Gym – Procedural RL reasoning datasets
(github.com/open-thought)
1 point
t55
10 months ago
discuss
237.
▲
Mochi Programming Language v0.6.0 – LINQ syntax for querying datasets
(github.com/mochilang)
1 point
scapbi
a year ago
discuss
238.
▲
Datasets Are All You Need (LLM Learns to Prompt from Data)
(github.com/intellectronica)
1 point
intellectronica
a year ago
discuss
239.
▲
RKaggle: Bring Kaggle Datasets Straight into the R console
(github.com/benyamindsmith)
1 point
SuperMint
a year ago
discuss
240.
▲
Drawdata: Draw Datasets from Within Jupyter
(github.com/koaning)
1 point
yamrzou
a year ago
discuss
More