Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
211.
DataDM – Search and analyze datasets with LLMs (github.com/approximatelabs)
5 points
cle
3 years ago
discuss
212.
DataDM: Open-source local-LLM code-interpreter with dataset search (github.com/approximatelabs)
5 points
bluecoconut
3 years ago
discuss
213.
Show HN: Multiobjective Large-Scale Fashion Dataset with Distributional Shifts (github.com/st-tech)
5 points
nanikano
5 years ago
discuss
214.
Show HN: H5records – simple large dataset for pytorch training (github.com/theblackcat102)
5 points
polymorph1sm
5 years ago
discuss
215.
Show HN: Create APIs for static datasets without writing a single line of code (github.com/roapi)
5 points
houqp
5 years ago
discuss
216.
Show HN: We made a dataset differ! (Free, Open source) (github.com/qri-io)
5 points
rgardaphe
7 years ago
discuss
217.
Show HN: Qri, a free and open source distributed dataset versioning tool
5 points
rgardaphe
7 years ago
discuss
218.
Show HN: MNIST-Sequence – Generate dataset for sequences of handwritten digits (github.com/ankitaggarwal011)
5 points
aaggarwal
9 years ago
discuss
219.
VisualNexus – Training Pipeline for Visual Dataset Segmentation and Labeling (github.com/kyegomez)
4 points
Reclaimer
3 years ago
3 comments
220.
Addressing for PHP: Postal address management powered by Google's dataset (github.com/commerceguys)
4 points
robertDouglass
12 years ago
1 comment
221.
Show HN: Transform Unstructured Data into Usable Datasets (github.com/wizenheimer)
4 points
wizenheimer
2 years ago
1 comment
222.
Show HN: Cerebras-GPT-2.7B finetuned on Stanford Alpaca dataset (github.com/lxe)
4 points
lxe
3 years ago
1 comment
223.
Lichess Combined Puzzle-Game Dataset (github.com/mcognetta)
4 points
mcyc
4 years ago
1 comment
224.
Show HN: Drone Deploy Dataset – Segmentation with Pytorch (github.com/s3nh)
4 points
s3nhxx
6 years ago
1 comment
225.
Show HN: pqry – A fast, lightweight CLI tool to diagnose Parquet datasets (github.com/symblic)
4 points
setzeno
4 months ago
discuss
226.
Show HN: Lance – Open lakehouse format for multimodal AI datasets (github.com/lance-format)
4 points
criexe
5 months ago
discuss
227.
A curated list of global electrical grid maps, datasets and resources (github.com/open-energy-transition)
4 points
protontypes
7 months ago
discuss
228.
The Well: A 15TB Collection of Physics Simulation Datasets (github.com/PolymathicAI)
4 points
Anon84
9 months ago
discuss
229.
Show HN: I built an offline VIN decoder using the NHTSA vPIC dataset (github.com/cardog-ai)
4 points
samsullivan
a year ago
discuss
230.
Show HN: Mount remote repositories and datasets managed by Git LFS locally (github.com/git-lfs-fuse)
4 points
rueian
a year ago
discuss
231.
Show HN: New AI Dataset Based on LibGen and Sci-Hub (github.com/soskek)
4 points
superpirate
3 years ago
discuss
232.
Alpaca dataset from Stanford, cleaned and curated (github.com/gururise)
4 points
freediver
3 years ago
discuss
233.
HaGRID is a large image dataset for hand gesture recognition systems (github.com/hukenovs)
4 points
taubek
4 years ago
discuss
234.
CO3D (Dataset for Image to 3D Reconstruction, by FB) (github.com/facebookresearch)
4 points
schleck8
5 years ago
discuss
235.
Texthero: A Python toolkit to work with text-based dataset effortlessly (github.com/jbesomi)
4 points
nlpword
6 years ago
discuss
236.
Show HN: Texthero, a Pandas-like API to work with text-dataset only (github.com/jbesomi)
4 points
jonathanbesomi
6 years ago
discuss
237.
Russian Open Speech to Text (STT/ASR) Dataset (github.com/snakers4)
4 points
isqad
7 years ago
discuss
238.
Awesome-Twitter-data: A list of Twitter datasets and related resources (github.com/shaypal5)
4 points
shaypalachy
8 years ago
discuss
239.
PeerRead: A Dataset of Scientific Peer Reviews (github.com/allenai)
4 points
indescions_2018
8 years ago
discuss
240.
Pypixgrid: generate vector tiles for the exploration of spatio-temporal datasets (translate.googleusercontent.com)
4 points
based2
9 years ago
discuss
More