Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
241.
Pypixgrid: generate vector tiles for the exploration of spatio-temporal datasets (translate.googleusercontent.com)
4 points
based2
9 years ago
discuss
242.
Dat – Distributed Dataset Synchronization and Versioning [pdf] (github.com/datproject)
4 points
potomak
9 years ago
discuss
243.
Show HN: DataBrewer – A CLI-tool to search and discover datasets (github.com/rolando)
4 points
darkrho
9 years ago
discuss
244.
Udacity adds 183gb of data to its driving dataset (github.com/udacity)
4 points
EvgeniyZh
10 years ago
discuss
245.
Show HN: Create simulated datasets in Python with Simulacrum (github.com/jbrambleDC)
4 points
jbrambleDC
10 years ago
discuss
246.
Show HN: Kiln - Interactive LLM fine-tuning, dataset collab & synthetic data gen (github.com/Kiln-AI)
3 points
scosman
a year ago
2 comments
247.
Large New Dataset 220k AI Art Text to Image Prompts (github.com/lee101)
3 points
wrdsmsh321
2 years ago
2 comments
248.
hfsearch: a fast cli tool to discover models and datasets on HuggingFace (github.com/HenokB)
3 points
henok_ademtew
6 months ago
1 comment
249.
Show HN: Torque – A declarative, typesafe DSL for LLM training datasets (MIT) (github.com/qforge-dev)
3 points
michalwarda
7 months ago
1 comment
250.
Hugging Face AI Sheets, open-source tool to vibe test models on your datasets (github.com/huggingface)
3 points
dvilasuero
10 months ago
1 comment
251.
Promptwright: Generate large synthetic datasets using a local LLM (github.com/StacklokLabs)
3 points
trickleup
2 years ago
1 comment
252.
Easily convert YouTube, Torrent and Enterprise videos into LLM datasets (github.com/qet-lab)
3 points
m_2018
2 years ago
1 comment
253.
CodeCapybara: Code Writing LLaMa Finetuned on Deepmind Dataset (github.com/AI4Code-Research)
3 points
brucethemoose2
3 years ago
1 comment
254.
UpliftML: An uplift modeling library that handles web scale datasets (github.com/bookingcom)
3 points
TaXxEr
5 years ago
1 comment
255.
A tool for creating deep learning datasets (github.com/dicroce)
3 points
dicroce
5 years ago
1 comment
256.
Show HN: A dataset of 40k professionally-written summaries of news articles (github.com/curationcorp)
3 points
CurationCorp
6 years ago
1 comment
257.
Crossfader: Autoencoders to find structure in arbitrary datasets (github.com/bettermg)
3 points
vierja
11 years ago
discuss
258.
Machine Learning: Access Tiny Images Dataset with Python (github.com/cioc)
3 points
cioc
13 years ago
discuss
259.
Open Data Hub Data Browser – Explore and Query Open Datasets (github.com/noi-techpark)
3 points
KadambariSuresh
3 months ago
discuss
260.
JQuery dataset() Plugin (github.com/realchaseadams)
3 points
nwienert
14 years ago
discuss
261.
WebZFS Modern Web Management for ZFS Pools/Datasets/Snapshots/Smart Monitoring (github.com/webzfs)
3 points
vermaden
5 months ago
discuss
262.
Data-morph: Morph a dataset into select shapes, while preserving the statistics (github.com/stefmolin)
3 points
ZeljkoS
9 months ago
discuss
263.
Show HN: Synthetic dataset generator for NLP and tabular data (github.com/VoxDroid)
3 points
voxdroid
a year ago
discuss
264.
DataChain: Prepare and curate datasets for AI/ML (github.com/iterative)
3 points
shcheklein
2 years ago
discuss
265.
Reladiff: High-performance diffing of large datasets across databases (github.com/erezsh)
3 points
PaulHoule
2 years ago
discuss
266.
RNNoise 0.2 – now trained using only publicly available CC-licensed datasets (github.com/xiph)
3 points
pabs3
2 years ago
discuss
267.
ClickHouse-Obfuscator – a tool for dataset anonymization (github.com/ClickHouse)
3 points
aeontech
3 years ago
discuss
268.
CommaVQ: Dataset of 100k Driving Videos (github.com/commaai)
3 points
kklisura
3 years ago
discuss
269.
Img2dataset: Turns large sets of image URLs to an image dataset (github.com/rom1504)
3 points
wildpeaks
3 years ago
discuss
270.
Dataset with Vulgar and Offensive California Vanity License Plates (github.com/veltman)
3 points
RamblingCTO
3 years ago
discuss
More