Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
121.
A curated list of global electrical grid maps, datasets and resources (github.com/open-energy-transition)
4 points
protontypes
7 months ago
discuss
122.
The Well: A 15TB Collection of Physics Simulation Datasets (github.com/PolymathicAI)
4 points
Anon84
9 months ago
discuss
123.
Show HN: Mount remote repositories and datasets managed by Git LFS locally (github.com/git-lfs-fuse)
4 points
rueian
a year ago
discuss
124.
Awesome-Twitter-data: A list of Twitter datasets and related resources (github.com/shaypal5)
4 points
shaypalachy
8 years ago
discuss
125.
Pypixgrid: generate vector tiles for the exploration of spatio-temporal datasets (translate.googleusercontent.com)
4 points
based2
9 years ago
discuss
126.
Show HN: DataBrewer – A CLI-tool to search and discover datasets (github.com/rolando)
4 points
darkrho
9 years ago
discuss
127.
Show HN: Create simulated datasets in Python with Simulacrum (github.com/jbrambleDC)
4 points
jbrambleDC
10 years ago
discuss
128.
hfsearch: a fast cli tool to discover models and datasets on HuggingFace (github.com/HenokB)
3 points
henok_ademtew
6 months ago
1 comment
129.
Show HN: Torque – A declarative, typesafe DSL for LLM training datasets (MIT) (github.com/qforge-dev)
3 points
michalwarda
7 months ago
1 comment
130.
Hugging Face AI Sheets, open-source tool to vibe test models on your datasets (github.com/huggingface)
3 points
dvilasuero
10 months ago
1 comment
131.
Promptwright: Generate large synthetic datasets using a local LLM (github.com/StacklokLabs)
3 points
trickleup
2 years ago
1 comment
132.
Easily convert YouTube, Torrent and Enterprise videos into LLM datasets (github.com/qet-lab)
3 points
m_2018
2 years ago
1 comment
133.
UpliftML: An uplift modeling library that handles web scale datasets (github.com/bookingcom)
3 points
TaXxEr
5 years ago
1 comment
134.
A tool for creating deep learning datasets (github.com/dicroce)
3 points
dicroce
5 years ago
1 comment
135.
Crossfader: Autoencoders to find structure in arbitrary datasets (github.com/bettermg)
3 points
vierja
11 years ago
discuss
136.
Open Data Hub Data Browser – Explore and Query Open Datasets (github.com/noi-techpark)
3 points
KadambariSuresh
3 months ago
discuss
137.
WebZFS Modern Web Management for ZFS Pools/Datasets/Snapshots/Smart Monitoring (github.com/webzfs)
3 points
vermaden
5 months ago
discuss
138.
DataChain: Prepare and curate datasets for AI/ML (github.com/iterative)
3 points
shcheklein
2 years ago
discuss
139.
Reladiff: High-performance diffing of large datasets across databases (github.com/erezsh)
3 points
PaulHoule
2 years ago
discuss
140.
RNNoise 0.2 – now trained using only publicly available CC-licensed datasets (github.com/xiph)
3 points
pabs3
2 years ago
discuss
141.
Show HN: How simple (but clever) algorithms can find label issues in datasets (playground.cleanlab.ai)
3 points
calebchiam
4 years ago
discuss
142.
Show HN: Free Datasets for Spatial Engineers and Location Analysts
3 points
floriankuwala
4 years ago
discuss
143.
Trustfall: A new, datasource-agnostic way to connect and query datasets (github.com/obi1kenobi)
3 points
tosh
4 years ago
discuss
144.
Covid-19 datasets by Our World in Data updated daily (github.com/owid)
3 points
escot
4 years ago
discuss
145.
The most comprehensive benchmark datasets for federated learning to date
3 points
xfzhu
5 years ago
discuss
146.
xarray: N-Dimensional labeled arrays and datasets in Python (github.com/pydata)
3 points
teleforce
5 years ago
discuss
147.
List of tools and datasets for anomaly detection on time-series data (github.com/rob-med)
3 points
mooreds
5 years ago
discuss
148.
Datasets.io: An open source multi-tool for exploring and publishing data (github.com/simonw)
3 points
gilad
5 years ago
discuss
149.
Datasets and Evaluation Metrics for NLP (github.com/huggingface)
3 points
dragonsh
6 years ago
discuss
150.
Chatito: Generate NLP datasets using a simple DSL (github.com/rodrigopivi)
3 points
nickswalker
6 years ago
discuss
More