Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
391.
Show HN: A CLI tool for maintaining datasets in a centralized repository (github.com/ezhou7)
2 points
nightrunner11
7 years ago
discuss
392.
Library to scrape and clean web pages to create datasets (github.com/chiphuyen)
2 points
khartig
7 years ago
discuss
393.
Venmo Transaction Dataset (github.com/sa7mon)
2 points
_salmon
7 years ago
discuss
394.
Real numbers, data science and chaos: Fit any dataset with a single parameter (github.com/Ranlot)
2 points
Ranlot
7 years ago
discuss
395.
Lazynlp: Library to scrape and clean web pages to create datasets (github.com/chiphuyen)
2 points
Osiris30
7 years ago
discuss
396.
OpenWebText: Open Clone of OpenAI's GPT-2 WebText Dataset (github.com/jcpeterson)
2 points
joshuacpeterson
7 years ago
discuss
397.
Lazynlp: A library to scrape, clean, de-duplicate webpages to create datasets (github.com/chiphuyen)
2 points
korym
7 years ago
discuss
398.
DeepWeeds: A Multiclass Weed Species Image Dataset for Deep Learning (github.com/AlexOlsen)
2 points
lainon
7 years ago
discuss
399.
Show HN: Python Script to Generate Fake Datasets for Testing ML/DL Workflows (github.com/minimaxir)
2 points
minimaxir
7 years ago
discuss
400.
Open source tool for merging datasets (github.com/funkeinteraktiv)
2 points
chrtze
7 years ago
discuss
401.
Analyzing League of Legends Dataset with Pandas and Python3 (gist.github.com)
2 points
kiyanwang
8 years ago
discuss
402.
Tracking progress in NLP tasks and datasets (github.com/sebastianruder)
2 points
neuhaus
8 years ago
discuss
403.
Show HN: Simple Recommender System for MovieLens Dataset built with JavaScript (github.com/javascript-machine-learning)
2 points
rwieruch
8 years ago
discuss
404.
Chatito – Generate training datasets for slot filling chatbots in a breeze (github.com/rodrigopivi)
2 points
prodrod
9 years ago
discuss
405.
Starcraft AI Research Dataset (github.com/TorchCraft)
2 points
jonbaer
9 years ago
discuss
406.
StarData: A StarCraft AI Research Dataset (github.com/TorchCraft)
2 points
indescions_2017
9 years ago
discuss
407.
Using the Dataset API for TensorFlow Input Pipelines (github.com/tensorflow)
2 points
mrry
9 years ago
discuss
408.
FMA: A Dataset for Music Analysis (github.com/mdeff)
2 points
sndean
9 years ago
discuss
409.
FMA dataset: 106k songs, 1TB, 343 days of audio (github.com/mdeff)
2 points
mdeff
9 years ago
discuss
410.
Rambler&Co Released Benchmark of XGBoost, VW and Spark ML on 1TB Criteo Dataset (github.com/rambler-digital-solutions)
2 points
pklemenkov
9 years ago
discuss
411.
OpenRefine – assess the quality of datasets (github.com/OpenRefine)
2 points
chirau
9 years ago
discuss
412.
Show HN: A hacking challenge based on the MNIST dataset (github.com/scvalencia)
2 points
scvalencia
9 years ago
discuss
413.
Golang: In memory dataset filtering (github.com/mattevans)
2 points
mattevansnz
9 years ago
discuss
414.
Collectible Card Game to Code Dataset (github.com/deepmind)
2 points
aaronyy
10 years ago
discuss
415.
Notes for “WikiQA: A challenge dataset for open-domain question answering” paper (gist.github.com)
2 points
shagunsodhani
10 years ago
discuss
416.
Datasets on fee-based open access publishing (github.com/OpenAPC)
2 points
Erikun
10 years ago
discuss
417.
A dataset of foosball sounds and a simple CNN with TensorFlow (github.com/dk1027)
2 points
dk1027
10 years ago
discuss
418.
Show HN: Download UCI datasets with python (gist.github.com)
2 points
thewhitetulip
10 years ago
discuss
419.
Ask HN: How to handle large datasets in front end of data apps? (github.com/Kanaries)
1 point
loa_observer
3 years ago
3 comments
420.
Gen-Selective Pseudo Labeling, Based on Datasets and Serverless Inference API (github.com/louisbrulenaudet)
1 point
brulenaudet
2 years ago
2 comments
More