Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
481.
DataChain: Enrich, transform and curate datasets for ML (github.com/iterative)
1 point
shcheklein
2 years ago
discuss
482.
Densely Captioned Images (DCI) Dataset (github.com/facebookresearch)
1 point
zerojames
2 years ago
discuss
483.
Sakuga-42M Dataset: Scaling Up Cartoon Research (github.com/zhenglinpan)
1 point
lnyan
2 years ago
discuss
484.
TorchGeo: Datasets and pre-trained models for geospatial data (github.com/microsoft)
1 point
zerojames
2 years ago
discuss
485.
Renumics/spotlight: Interactively explore unstructured datasets from dataframes (github.com/Renumics)
1 point
rbanffy
2 years ago
discuss
486.
Show HN: Data Contract CLI – Test your datasets (github.com/datacontract)
1 point
aiobe
2 years ago
discuss
487.
ClimateSet – A Large-Scale Climate Model Dataset for Machine Learning (github.com/RolnickLab)
1 point
Brajeshwar
2 years ago
discuss
488.
Show HN: VQASynth – pipelines to synthesize VQA datasets (github.com/remyxai)
1 point
backflippinbozo
2 years ago
discuss
489.
Dataflux Dataset for PyTorch (github.com/GoogleCloudPlatform)
1 point
mattirv
2 years ago
discuss
490.
Multi-bitrate JPEG compression perceptual evaluation dataset 2023 (github.com/google-research)
1 point
ksec
2 years ago
discuss
491.
Access to public agricultural datasets for agricultural deep learning tasks (github.com/Project-AgML)
1 point
protontypes
3 years ago
discuss
492.
Full-fledged APIs for slowly moving datasets without writing code (github.com/roapi)
1 point
raider
3 years ago
discuss
493.
Anaconda's 2023 State of Data Science Dataset (github.com/anaconda)
1 point
amath
3 years ago
discuss
494.
Dataset on GitHub Users and Repositories (54M users, 220M repositories) (github.com/trickest)
1 point
zaric
3 years ago
discuss
495.
Tokenmonster: Determine tokens to optimally represents a dataset (github.com/alasdairforsythe)
1 point
anotherpaulg
3 years ago
discuss
496.
VoxelGPT: Open-source AI assistant for curating computer vision datasets (github.com/voxel51)
1 point
sickeythecat
3 years ago
discuss
497.
Auncel: Fast Approximate Vector Queries on Large Unstructured Datasets (github.com/pkusys)
1 point
teleforce
3 years ago
discuss
498.
Visualize your dataset using DINOv2 embedding
1 point
dnth
3 years ago
discuss
499.
Glami-1M: A Multilingual Image-Text Fashion Dataset (github.com/glami)
1 point
vackosar
4 years ago
discuss
500.
Learn to split any dataset with one line of code to break model’s generalization (github.com/YujiaBao)
1 point
Boga2510
4 years ago
discuss
501.
Show HN: MMAP_ninja: a library fo storing ML datasets in memory mapped format (github.com/hristo-vrigazov)
1 point
hvrigazov
4 years ago
discuss
502.
A Dataset and Explorer for 3D Signed Distance Functions (SDFs) (github.com/tovacinni)
1 point
speps
4 years ago
discuss
503.
A collection of mathematical art: a 3D signed distance function dataset (github.com/tovacinni)
1 point
tovacinni
4 years ago
discuss
504.
Strategic Transport Planning Dataset for Deep Graph Neural Networks (github.com/nikita68)
1 point
nikita68
4 years ago
discuss
505.
IPFS PyTorch Dataset (github.com/JakeKalstad)
1 point
boredumb
4 years ago
discuss
506.
IKEA 3D Assembly Dataset (github.com/IKEA)
1 point
robin_reala
5 years ago
discuss
507.
Pass: A large-scale image dataset that doesn't include any humans (github.com/yukimasano)
1 point
optimalsolver
5 years ago
discuss
508.
Reconnaissance Using Rapid7 Open Datasets (github.com/tg12)
1 point
MikeAshley178
5 years ago
discuss
509.
Latrend – Framework for clustering longitudinal datasets in a standardized way (github.com/philips-software)
1 point
JeroenKnoops1
5 years ago
discuss
510.
Booksum – A Collection of Datasets for Long-Form Narrative Summarization (github.com/salesforce)
1 point
simonpure
5 years ago
discuss
More