Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
421.
▲
OpenAI crowd sources LLM benchmarking datasets by offering advanced GPT-4 access
(github.com/openai)
1 point
teaearlgraycold
3 years ago
2 comments
422.
▲
Show HN: Critical Role Dungeons and Dragons dialogue sumamrization dataset
(github.com/RevanthRameshkumar)
1 point
cosmicskewl
6 years ago
2 comments
423.
▲
LogHub: A large dataset of real-world logs to benchmark your tools
(github.com/logpai)
1 point
kvaranasi_
2 months ago
1 comment
424.
▲
Show HN: Self-hosted DCF workspace using Damodaran datasets, LLM narratives
1 point
softcane
3 months ago
1 comment
425.
▲
Show HN: JSON dataset of 1,100 trending AI image prompts from X
(github.com/jau123)
1 point
jaujaujau
4 months ago
1 comment
426.
▲
Show HN: RAG-corpus-profiler – A linter for RAG datasets (dedup, PII, quality)
(github.com/aashirpersonal)
1 point
aashirpersonal
5 months ago
1 comment
427.
▲
Realistic enterprise security dataset with 23-day APT campaign
(github.com/gregdiy)
1 point
PhantomArmor
6 months ago
1 comment
428.
▲
Show HN: React-obj-view – A virtualized object inspector for large datasets
(github.com/vothanhdat)
1 point
datvo
6 months ago
1 comment
429.
▲
Claude Container 1.3.0 – Dockerized Claude Code with API Proxy and Datasette
(github.com/nezhar)
1 point
nezhar
8 months ago
1 comment
430.
▲
Show HN: Wrote a small tool that turns PDFs and docs into fine-tuning datasets
(github.com/Datalore-ai)
1 point
FineTuner42
10 months ago
1 comment
431.
▲
Show HN: DataChain – Tool to create, curate, version AI datasets
(github.com/iterative)
1 point
shcheklein
2 years ago
1 comment
432.
▲
National Park Service Data Is Now Available on Big Query Public Datasets
(github.com/tonymet)
1 point
tonymet
2 years ago
1 comment
433.
▲
Face Alignment API: Simple API to align faces when creating datasets/scraping
(github.com/botoxparty)
1 point
botoxparty
3 years ago
1 comment
434.
▲
GeoCOCO: Transform GIS annotations into COCO datasets for use in deep learning
(github.com/jaspersiebring)
1 point
qtieb
3 years ago
1 comment
435.
▲
A 30K-utterance dataset by making GPT-4 prompt two ChatGPT instances to converse
(github.com/radi-cho)
1 point
radicho123
3 years ago
1 comment
436.
▲
Dataset of ACM CCS'22 Paper “Understanding Security Issues in the NFT Ecosystem”
(github.com/ucsb-seclab)
1 point
holmessherl0ck
3 years ago
1 comment
437.
▲
Show HN: Proof of Concept – ExpressJS Web Application Firewall and Dataset
(github.com/fwd)
1 point
usernamebias
5 years ago
1 comment
438.
▲
COVIDx-US, largest curated open-access ultrasound imaging dataset for Covid-19
(github.com/nrc-cnrc)
1 point
ziptron
5 years ago
1 comment
439.
▲
Racism and the “Load_boston” Dataset
(github.com/BCG-Gamma)
1 point
amrrs
5 years ago
1 comment
440.
▲
PLS GIVE UR FEEDBACK: DPIPE Library to easily create TensorFlow datasets
(github.com/aiporre)
1 point
arielin1
6 years ago
1 comment
441.
▲
Show HN: Easily-configurable machine learning dataset pipelines
(github.com/jake-mason)
1 point
mason_jake
7 years ago
1 comment
442.
▲
Not_notMNIST: Generate your own datasets
1 point
RafazZ
9 years ago
1 comment
443.
▲
Show HN: Automatic Validation, Correction and Generation of Dataset Metadata
(github.com/ahmadassaf)
1 point
ahmadassaf
11 years ago
discuss
444.
▲
Small ground truth labeled dataset for swedish parking signs
(github.com/klintan)
1 point
klintcho
12 years ago
discuss
445.
▲
Dataset for 22 years of arXiv citation links
(github.com/paperscape)
1 point
robjk
12 years ago
discuss
446.
▲
A full-stack Last.fm 1k dataset insights page using Go/ClickHouse/React
(github.com/el10savio)
1 point
ugabuga
4 days ago
discuss
447.
▲
Show HN: Cohort Visualizer - A handy tool for browsing cohort datasets
(bslatkin.github.com)
1 point
bslatkin
14 years ago
discuss
448.
▲
Swedish Construction FAQ: 503 bilingual Q&A dataset, CC BY 4.0
(github.com/zaragoza-ab)
1 point
DecDEPO
2 months ago
discuss
449.
▲
Show HN: Fastdedup – Rust dataset deduplication (2:55 vs. 7:55 688MB vs. 22GB)
(wapplewhite4.github.io)
1 point
wapplewhite4
3 months ago
discuss
450.
▲
Show HN: Talpa – Datasette-powered reading stats dashboards for Kobo and Kindle
(github.com/gildo)
1 point
fyskij
3 months ago
discuss
More