Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
211.
OpenAI crowd sources LLM benchmarking datasets by offering advanced GPT-4 access (github.com/openai)
1 point
teaearlgraycold
3 years ago
2 comments
212.
Show HN: Self-hosted DCF workspace using Damodaran datasets, LLM narratives
1 point
softcane
3 months ago
1 comment
213.
Show HN: RAG-corpus-profiler – A linter for RAG datasets (dedup, PII, quality) (github.com/aashirpersonal)
1 point
aashirpersonal
5 months ago
1 comment
214.
Show HN: React-obj-view – A virtualized object inspector for large datasets (github.com/vothanhdat)
1 point
datvo
6 months ago
1 comment
215.
Show HN: Wrote a small tool that turns PDFs and docs into fine-tuning datasets (github.com/Datalore-ai)
1 point
FineTuner42
10 months ago
1 comment
216.
Show HN: DataChain – Tool to create, curate, version AI datasets (github.com/iterative)
1 point
shcheklein
2 years ago
1 comment
217.
National Park Service Data Is Now Available on Big Query Public Datasets (github.com/tonymet)
1 point
tonymet
2 years ago
1 comment
218.
Face Alignment API: Simple API to align faces when creating datasets/scraping (github.com/botoxparty)
1 point
botoxparty
3 years ago
1 comment
219.
GeoCOCO: Transform GIS annotations into COCO datasets for use in deep learning (github.com/jaspersiebring)
1 point
qtieb
3 years ago
1 comment
220.
Collection of datasets to train your own multi-modal GPT-4/LLMs (github.com/yaodongC)
1 point
yaodong_lukas
3 years ago
1 comment
221.
PLS GIVE UR FEEDBACK: DPIPE Library to easily create TensorFlow datasets (github.com/aiporre)
1 point
arielin1
6 years ago
1 comment
222.
Not_notMNIST: Generate your own datasets
1 point
RafazZ
9 years ago
1 comment
223.
Show HN: Cohort Visualizer - A handy tool for browsing cohort datasets (bslatkin.github.com)
1 point
bslatkin
14 years ago
discuss
224.
Synth-dataset-kit: Generate and audit synthetic datasets from seed data (github.com/KazKozDev)
1 point
kazkozdev
2 months ago
discuss
225.
GABRIEL – turn messy qualitative corpora into analysis-ready datasets (github.com/openai)
1 point
michaelsbradley
4 months ago
discuss
226.
Show HN: Vietnam Elections (open, source-linked datasets and site) (bamboo-filing-cabinet.github.io)
1 point
vietthan
4 months ago
discuss
227.
Fasttfidf: High-performance TF-IDF vectorization for large-scale text datasets (github.com/purijs)
1 point
jspuri
5 months ago
discuss
228.
Show HN: AI tool that walks citation graph and extracts data to create datasets (github.com/eamag)
1 point
eamag
5 months ago
discuss
229.
Training YOLO vision models on Kaggle datasets (github.com/mfranzon)
1 point
walterbell
7 months ago
discuss
230.
Show HN: Gaggle – A DuckDB extension for working with Kaggle datasets
1 point
habedi0
7 months ago
discuss
231.
Show HN: Django PostgreSQL Anonymizer – prod → safe dev datasets (beta) (github.com/CuriousLearner)
1 point
sanyam-khurana
8 months ago
discuss
232.
A toolkit for improving the quality of your LeRobot datasets (github.com/RoboticsData)
1 point
machinelearning
8 months ago
discuss
233.
A new RAG algorithm to self-heal damaged datasets and query them on a graph (github.com/iblameandrew)
1 point
scraper02
8 months ago
discuss
234.
Show HN: Tensorpack a CLI tool for semantic discovery across datasets
1 point
AyodeleFikayomi
8 months ago
discuss
235.
Procedural Reasoning Datasets (github.com/open-thought)
1 point
t55
10 months ago
discuss
236.
Reasoning Gym – Procedural RL reasoning datasets (github.com/open-thought)
1 point
t55
10 months ago
discuss
237.
Mochi Programming Language v0.6.0 – LINQ syntax for querying datasets (github.com/mochilang)
1 point
scapbi
a year ago
discuss
238.
Datasets Are All You Need (LLM Learns to Prompt from Data) (github.com/intellectronica)
1 point
intellectronica
a year ago
discuss
239.
RKaggle: Bring Kaggle Datasets Straight into the R console (github.com/benyamindsmith)
1 point
SuperMint
a year ago
discuss
240.
Drawdata: Draw Datasets from Within Jupyter (github.com/koaning)
1 point
yamrzou
a year ago
discuss
More