Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
121.
▲
An analysis of 7M NFT transactions on the Ethereum blockchain [pdf]
(github.com/bugout-dev)
1 point
mpaepper
5 years ago
discuss
122.
▲
Launch HN: Activeloop (YC S18) – Data lake for deep learning
64 points
davidbuniat
4 years ago
24 comments
123.
▲
Ask HN: How are you extracting the best performance out of your RAG pipeline?
5 points
imaravind
2 years ago
4 comments
124.
▲
Lip2Wav: Synthesize Speech Only from the Lip Movements
4 points
prajwalkr
6 years ago
discuss
125.
▲
Show HN: SJT- A lightweight structured JSON table format for APIs
3 points
yukiakai
9 months ago
1 comment
126.
▲
InfoSeek: The First Open-Source Framework for Deep Research Data Synthesis
2 points
BAAIBeijing
9 months ago
1 comment
127.
▲
Show HN: RandomForestGenerator – CSV to ML in the browser, but local
(jonaraphael.github.io)
2 points
jonaraphael
5 months ago
discuss
128.
▲
Measuring Compositional Generalization in ML Architectures
1 point
esdee
6 years ago
discuss
129.
▲
Free/Open Source Datasets
(github.com/rasbt)
2 points
rouma7
11 years ago
discuss
130.
▲
Satellite Image Time Series Datasets
(github.com/corentin-dfg)
2 points
sebg
3 years ago
discuss
131.
▲
Show HN: Simple Python script to split (DL)training data (CNNs mainly)
(github.com/chinmayshah99)
2 points
chinmays
7 years ago
discuss
132.
▲
Chinese Language Corpora for Sentiment Analysis
(github.com/Lab41)
1 point
ghosthamlet
8 years ago
discuss
133.
▲
Show HN: Open Prompts – dataset of 10M Stable Diffusion generations
(github.com/krea-ai)
279 points
vipermu
4 years ago
71 comments
134.
▲
Tell HN: Full Hacker News dataset now available on BigQuery
238 points
minimaxir
11 years ago
43 comments
135.
▲
Dat – Distributed Dataset Synchronization and Versioning
(github.com/datproject)
229 points
ColinWright
9 years ago
39 comments
136.
▲
A multimodal dataset with one trillion tokens
(github.com/mlfoundations)
224 points
kulikalov
2 years ago
52 comments
137.
▲
An MNIST-like fashion product dataset
(github.com/zalandoresearch)
220 points
kashifr
9 years ago
21 comments
138.
▲
Qri: A global dataset version control system built on the distributed web
(github.com/qri-io)
204 points
anewhnaccount2
7 years ago
42 comments
139.
▲
Visualizations for machine learning datasets
(github.com/PAIR-code)
178 points
happy-go-lucky
9 years ago
7 comments
140.
▲
Finetuning of Falcon-7B LLM Using QLoRA on Mental Health Conversational Dataset
(github.com/iamarunbrahma)
160 points
iamarunbrahma
3 years ago
108 comments
141.
▲
Hypersim, Photorealistic Synthetic Dataset for Indoor Scene Understanding
(github.com/apple)
122 points
homarp
5 years ago
20 comments
142.
▲
Show HN: Dlt – Python library to automate the creation of datasets
(colab.research.google.com)
114 points
MatthausK
3 years ago
54 comments
143.
▲
Driving dataset for car autopilot AI training
(github.com/commaai)
100 points
EvgeniyZh
10 years ago
44 comments
144.
▲
Boston housing price dataset was removed from scikit-learn 1.2
(github.com/scikit-learn)
81 points
ok123456
3 years ago
84 comments
145.
▲
RipTable – multi-threaded Python data analytics tools for numpy arrays/datasets
(github.com/rtosholdings)
79 points
aldanor
6 years ago
14 comments
146.
▲
Show HN: Hyperparam: OSS tools for exploring datasets locally in the browser
(hyperparam.app)
77 points
platypii
a year ago
21 comments
147.
▲
Comma2k19 – A dataset of over 33 hours of commute in California's 280 highway
(github.com/commaai)
70 points
pd0wm
7 years ago
35 comments
148.
▲
How to query data.gov json datasets with SQL: a case study
(github.com/axibase)
68 points
rodionos
9 years ago
1 comment
149.
▲
The Museum of Modern Art Research Dataset
(github.com/MuseumofModernArt)
61 points
danso
11 years ago
15 comments
150.
▲
Chicago Crime Trends. Analyzing 3GB Dataset from Data.gov with SQL and Graphs
(github.com/axibase)
44 points
rodionos
9 years ago
3 comments
More