Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
211.
▲
Show HN: Qri, a free and open source distributed dataset versioning tool
5 points
rgardaphe
7 years ago
discuss
212.
▲
Show HN: MNIST-Sequence – Generate dataset for sequences of handwritten digits
(github.com/ankitaggarwal011)
5 points
aaggarwal
9 years ago
discuss
213.
▲
VisualNexus – Training Pipeline for Visual Dataset Segmentation and Labeling
(github.com/kyegomez)
4 points
Reclaimer
3 years ago
3 comments
214.
▲
Addressing for PHP: Postal address management powered by Google's dataset
(github.com/commerceguys)
4 points
robertDouglass
12 years ago
1 comment
215.
▲
Show HN: Transform Unstructured Data into Usable Datasets
(github.com/wizenheimer)
4 points
wizenheimer
2 years ago
1 comment
216.
▲
Show HN: Cerebras-GPT-2.7B finetuned on Stanford Alpaca dataset
(github.com/lxe)
4 points
lxe
3 years ago
1 comment
217.
▲
Lichess Combined Puzzle-Game Dataset
(github.com/mcognetta)
4 points
mcyc
4 years ago
1 comment
218.
▲
Show HN: Drone Deploy Dataset – Segmentation with Pytorch
(github.com/s3nh)
4 points
s3nhxx
6 years ago
1 comment
219.
▲
TMAP: Visualizing High-Dimensional Data Sets as MSTs
(github.com/reymond-group)
4 points
daenuprobst
7 years ago
1 comment
220.
▲
Warp: convert and analyze data sets at light speed on Mac (just open sourced)
(github.com/pixelspark)
4 points
misterdata
10 years ago
1 comment
221.
▲
Show HN: pqry – A fast, lightweight CLI tool to diagnose Parquet datasets
(github.com/symblic)
4 points
setzeno
4 months ago
discuss
222.
▲
Show HN: Lance – Open lakehouse format for multimodal AI datasets
(github.com/lance-format)
4 points
criexe
5 months ago
discuss
223.
▲
A curated list of global electrical grid maps, datasets and resources
(github.com/open-energy-transition)
4 points
protontypes
7 months ago
discuss
224.
▲
The Well: A 15TB Collection of Physics Simulation Datasets
(github.com/PolymathicAI)
4 points
Anon84
9 months ago
discuss
225.
▲
Show HN: I built an offline VIN decoder using the NHTSA vPIC dataset
(github.com/cardog-ai)
4 points
samsullivan
a year ago
discuss
226.
▲
Show HN: Mount remote repositories and datasets managed by Git LFS locally
(github.com/git-lfs-fuse)
4 points
rueian
a year ago
discuss
227.
▲
Show HN: New AI Dataset Based on LibGen and Sci-Hub
(github.com/soskek)
4 points
superpirate
3 years ago
discuss
228.
▲
Alpaca dataset from Stanford, cleaned and curated
(github.com/gururise)
4 points
freediver
3 years ago
discuss
229.
▲
HaGRID is a large image dataset for hand gesture recognition systems
(github.com/hukenovs)
4 points
taubek
4 years ago
discuss
230.
▲
CO3D (Dataset for Image to 3D Reconstruction, by FB)
(github.com/facebookresearch)
4 points
schleck8
5 years ago
discuss
231.
▲
Texthero: A Python toolkit to work with text-based dataset effortlessly
(github.com/jbesomi)
4 points
nlpword
6 years ago
discuss
232.
▲
Show HN: Texthero, a Pandas-like API to work with text-dataset only
(github.com/jbesomi)
4 points
jonathanbesomi
6 years ago
discuss
233.
▲
Russian Open Speech to Text (STT/ASR) Dataset
(github.com/snakers4)
4 points
isqad
7 years ago
discuss
234.
▲
Awesome-Twitter-data: A list of Twitter datasets and related resources
(github.com/shaypal5)
4 points
shaypalachy
8 years ago
discuss
235.
▲
PeerRead: A Dataset of Scientific Peer Reviews
(github.com/allenai)
4 points
indescions_2018
8 years ago
discuss
236.
▲
Pypixgrid: generate vector tiles for the exploration of spatio-temporal datasets
(translate.googleusercontent.com)
4 points
based2
9 years ago
discuss
237.
▲
Dat – Distributed Dataset Synchronization and Versioning [pdf]
(github.com/datproject)
4 points
potomak
9 years ago
discuss
238.
▲
Show HN: DataBrewer – A CLI-tool to search and discover datasets
(github.com/rolando)
4 points
darkrho
9 years ago
discuss
239.
▲
Udacity adds 183gb of data to its driving dataset
(github.com/udacity)
4 points
EvgeniyZh
10 years ago
discuss
240.
▲
Show HN: Create simulated datasets in Python with Simulacrum
(github.com/jbrambleDC)
4 points
jbrambleDC
10 years ago
discuss
More