Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
121.
▲
Which other AI search engines should we keep an eye on?
1 point
james_chu
2 years ago
discuss
122.
▲
Ask HN: What are some public real-time data sources?
1 point
amath
3 years ago
discuss
123.
▲
An analysis of 7,020,950 NFT transactions on the Ethereum blockchain [pdf]
(github.com/bugout-dev)
4 points
zomglings
5 years ago
2 comments
124.
▲
Public Real-Time Datasets and Sources
(github.com/bytewax)
4 points
skadamat
3 years ago
discuss
125.
▲
Tech.ml.dataset – A Clojure high performance data processing system
(github.com/techascent)
4 points
simonpure
5 years ago
discuss
126.
▲
tech.ml.dataset: A Clojure high performance data processing system
(github.com/techascent)
3 points
wlkr
2 years ago
discuss
127.
▲
Aave V2 Health Factor Dataset
(github.com/credprotocol)
3 points
willwolf
4 years ago
discuss
128.
▲
Show HN: UK Government Datasets
(github.com/i-dot-ai)
2 points
crimsoneer
a year ago
discuss
129.
▲
tech.ml.dataset: A Clojure high performance data processing system
(github.com/techascent)
1 point
tosh
2 months ago
discuss
130.
▲
100K Fake US People Profiles Dataset
(github.com/marko-simic)
1 point
qa-guy
4 years ago
discuss
131.
▲
An analysis of 7M NFT transactions on the Ethereum blockchain [pdf]
(github.com/bugout-dev)
1 point
mpaepper
5 years ago
discuss
132.
▲
Launch HN: Activeloop (YC S18) – Data lake for deep learning
64 points
davidbuniat
4 years ago
24 comments
133.
▲
Ask HN: How are you extracting the best performance out of your RAG pipeline?
5 points
imaravind
2 years ago
4 comments
134.
▲
Lip2Wav: Synthesize Speech Only from the Lip Movements
4 points
prajwalkr
6 years ago
discuss
135.
▲
Show HN: SJT- A lightweight structured JSON table format for APIs
3 points
yukiakai
9 months ago
1 comment
136.
▲
InfoSeek: The First Open-Source Framework for Deep Research Data Synthesis
2 points
BAAIBeijing
9 months ago
1 comment
137.
▲
Show HN: RandomForestGenerator – CSV to ML in the browser, but local
(jonaraphael.github.io)
2 points
jonaraphael
5 months ago
discuss
138.
▲
Measuring Compositional Generalization in ML Architectures
1 point
esdee
6 years ago
discuss
139.
▲
Free/Open Source Datasets
(github.com/rasbt)
2 points
rouma7
11 years ago
discuss
140.
▲
Satellite Image Time Series Datasets
(github.com/corentin-dfg)
2 points
sebg
3 years ago
discuss
141.
▲
Show HN: Simple Python script to split (DL)training data (CNNs mainly)
(github.com/chinmayshah99)
2 points
chinmays
7 years ago
discuss
142.
▲
Chinese Language Corpora for Sentiment Analysis
(github.com/Lab41)
1 point
ghosthamlet
8 years ago
discuss
143.
▲
Show HN: Open Prompts – dataset of 10M Stable Diffusion generations
(github.com/krea-ai)
279 points
vipermu
4 years ago
71 comments
144.
▲
Tell HN: Full Hacker News dataset now available on BigQuery
238 points
minimaxir
11 years ago
43 comments
145.
▲
Dat – Distributed Dataset Synchronization and Versioning
(github.com/datproject)
229 points
ColinWright
9 years ago
39 comments
146.
▲
A multimodal dataset with one trillion tokens
(github.com/mlfoundations)
224 points
kulikalov
2 years ago
52 comments
147.
▲
An MNIST-like fashion product dataset
(github.com/zalandoresearch)
220 points
kashifr
9 years ago
21 comments
148.
▲
Qri: A global dataset version control system built on the distributed web
(github.com/qri-io)
204 points
anewhnaccount2
7 years ago
42 comments
149.
▲
Visualizations for machine learning datasets
(github.com/PAIR-code)
178 points
happy-go-lucky
9 years ago
7 comments
150.
▲
Finetuning of Falcon-7B LLM Using QLoRA on Mental Health Conversational Dataset
(github.com/iamarunbrahma)
160 points
iamarunbrahma
3 years ago
108 comments
More