Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
121.
▲
Easily convert YouTube, Torrent and Enterprise videos into LLM datasets
(github.com/qet-lab)
3 points
m_2018
2 years ago
1 comment
122.
▲
UpliftML: An uplift modeling library that handles web scale datasets
(github.com/bookingcom)
3 points
TaXxEr
5 years ago
1 comment
123.
▲
A tool for creating deep learning datasets
(github.com/dicroce)
3 points
dicroce
5 years ago
1 comment
124.
▲
Crossfader: Autoencoders to find structure in arbitrary datasets
(github.com/bettermg)
3 points
vierja
11 years ago
discuss
125.
▲
Open Data Hub Data Browser – Explore and Query Open Datasets
(github.com/noi-techpark)
3 points
KadambariSuresh
3 months ago
discuss
126.
▲
WebZFS Modern Web Management for ZFS Pools/Datasets/Snapshots/Smart Monitoring
(github.com/webzfs)
3 points
vermaden
5 months ago
discuss
127.
▲
DataChain: Prepare and curate datasets for AI/ML
(github.com/iterative)
3 points
shcheklein
2 years ago
discuss
128.
▲
Reladiff: High-performance diffing of large datasets across databases
(github.com/erezsh)
3 points
PaulHoule
2 years ago
discuss
129.
▲
RNNoise 0.2 – now trained using only publicly available CC-licensed datasets
(github.com/xiph)
3 points
pabs3
2 years ago
discuss
130.
▲
Show HN: How simple (but clever) algorithms can find label issues in datasets
(playground.cleanlab.ai)
3 points
calebchiam
4 years ago
discuss
131.
▲
Show HN: Free Datasets for Spatial Engineers and Location Analysts
3 points
floriankuwala
4 years ago
discuss
132.
▲
Trustfall: A new, datasource-agnostic way to connect and query datasets
(github.com/obi1kenobi)
3 points
tosh
4 years ago
discuss
133.
▲
Covid-19 datasets by Our World in Data updated daily
(github.com/owid)
3 points
escot
4 years ago
discuss
134.
▲
The most comprehensive benchmark datasets for federated learning to date
3 points
xfzhu
5 years ago
discuss
135.
▲
xarray: N-Dimensional labeled arrays and datasets in Python
(github.com/pydata)
3 points
teleforce
5 years ago
discuss
136.
▲
List of tools and datasets for anomaly detection on time-series data
(github.com/rob-med)
3 points
mooreds
5 years ago
discuss
137.
▲
Datasets.io: An open source multi-tool for exploring and publishing data
(github.com/simonw)
3 points
gilad
5 years ago
discuss
138.
▲
Datasets and Evaluation Metrics for NLP
(github.com/huggingface)
3 points
dragonsh
6 years ago
discuss
139.
▲
Chatito: Generate NLP datasets using a simple DSL
(github.com/rodrigopivi)
3 points
nickswalker
6 years ago
discuss
140.
▲
Show HN: ETL and EDA on the Covid-19 global datasets using pandas and matplotlib
(github.com/PhantomInsights)
3 points
Agent_Phantom
6 years ago
discuss
141.
▲
Google released new datasets from Borg
(github.com/google)
3 points
zekrioca
6 years ago
discuss
142.
▲
Show HN: Flexible data exploration for mid-size datasets
(github.com/stefanhoelzl)
3 points
stefanhoelzl
7 years ago
discuss
143.
▲
Python script to accelerate the creation of custom computer vision datasets
(github.com/ahmedbesbes)
3 points
ahmedbesbes
7 years ago
discuss
144.
▲
Show HN: Hub – Serverless Scalable Numpy Array for Managing Datasets
(github.com/snarkai)
3 points
davidbuniat
7 years ago
discuss
145.
▲
An excel plugin to access academic research datasets, backed by CovenantSQL
(github.com/melancholiaforever)
3 points
Dowwie
7 years ago
discuss
146.
▲
Show HN: Masquerade: A Postgres proxy that masks sensitive datasets in real time
(github.com/TonicAI)
3 points
akamor
7 years ago
discuss
147.
▲
Dendrite: Querying large datasets on a single host at near-interactive speeds
(github.com/jwhitbeck)
3 points
tosh
7 years ago
discuss
148.
▲
Maptable: Converts datasets to heat map, filters and table [JS]
(github.com/Packet-Clearing-House)
3 points
pjf
7 years ago
discuss
149.
▲
Short jokes datasets
(github.com/amoudgl)
3 points
sytelus
8 years ago
discuss
150.
▲
Show HN: Convenient Data Loader for many common NLP datasets
(github.com/chakki-works)
3 points
hironsan
9 years ago
discuss
More