Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
391.
▲
Datasets and Evaluation Metrics for NLP (True Open Source GPT Alternative)
(github.com/huggingface)
2 points
dragonsh
6 years ago
discuss
392.
▲
Datasets and evaluation metrics for natural language processing(NLP)
(github.com/huggingface)
2 points
dragonsh
6 years ago
discuss
393.
▲
Datasets and Evaluation Metrics for Natural Language Processing (NLP)
(github.com/huggingface)
2 points
dragonsh
6 years ago
discuss
394.
▲
Show HN: Covidify – coronavirus dataset and visualization generator
2 points
AaronWard
6 years ago
discuss
395.
▲
Show HN: A CLI tool for maintaining datasets in a centralized repository
(github.com/ezhou7)
2 points
nightrunner11
7 years ago
discuss
396.
▲
Library to scrape and clean web pages to create datasets
(github.com/chiphuyen)
2 points
khartig
7 years ago
discuss
397.
▲
Venmo Transaction Dataset
(github.com/sa7mon)
2 points
_salmon
7 years ago
discuss
398.
▲
Real numbers, data science and chaos: Fit any dataset with a single parameter
(github.com/Ranlot)
2 points
Ranlot
7 years ago
discuss
399.
▲
Lazynlp: Library to scrape and clean web pages to create datasets
(github.com/chiphuyen)
2 points
Osiris30
7 years ago
discuss
400.
▲
OpenWebText: Open Clone of OpenAI's GPT-2 WebText Dataset
(github.com/jcpeterson)
2 points
joshuacpeterson
7 years ago
discuss
401.
▲
Lazynlp: A library to scrape, clean, de-duplicate webpages to create datasets
(github.com/chiphuyen)
2 points
korym
7 years ago
discuss
402.
▲
DeepWeeds: A Multiclass Weed Species Image Dataset for Deep Learning
(github.com/AlexOlsen)
2 points
lainon
7 years ago
discuss
403.
▲
Show HN: Python Script to Generate Fake Datasets for Testing ML/DL Workflows
(github.com/minimaxir)
2 points
minimaxir
7 years ago
discuss
404.
▲
Open source tool for merging datasets
(github.com/funkeinteraktiv)
2 points
chrtze
7 years ago
discuss
405.
▲
Scrape multiple crypto currency data sets-write to single .csv
(github.com/rootVIII)
2 points
rootVIII
8 years ago
discuss
406.
▲
Analyzing League of Legends Dataset with Pandas and Python3
(gist.github.com)
2 points
kiyanwang
8 years ago
discuss
407.
▲
Tracking progress in NLP tasks and datasets
(github.com/sebastianruder)
2 points
neuhaus
8 years ago
discuss
408.
▲
He Data Linter: Lightweight, Automated Sanity Checking for ML Data Sets
(github.com/brain-research)
2 points
blopeur
8 years ago
discuss
409.
▲
Show HN: Simple Recommender System for MovieLens Dataset built with JavaScript
(github.com/javascript-machine-learning)
2 points
rwieruch
8 years ago
discuss
410.
▲
Chatito – Generate training datasets for slot filling chatbots in a breeze
(github.com/rodrigopivi)
2 points
prodrod
9 years ago
discuss
411.
▲
Starcraft AI Research Dataset
(github.com/TorchCraft)
2 points
jonbaer
9 years ago
discuss
412.
▲
StarData: A StarCraft AI Research Dataset
(github.com/TorchCraft)
2 points
indescions_2017
9 years ago
discuss
413.
▲
Using the Dataset API for TensorFlow Input Pipelines
(github.com/tensorflow)
2 points
mrry
9 years ago
discuss
414.
▲
FMA: A Dataset for Music Analysis
(github.com/mdeff)
2 points
sndean
9 years ago
discuss
415.
▲
FMA dataset: 106k songs, 1TB, 343 days of audio
(github.com/mdeff)
2 points
mdeff
9 years ago
discuss
416.
▲
Rambler&Co Released Benchmark of XGBoost, VW and Spark ML on 1TB Criteo Dataset
(github.com/rambler-digital-solutions)
2 points
pklemenkov
9 years ago
discuss
417.
▲
OpenRefine – assess the quality of datasets
(github.com/OpenRefine)
2 points
chirau
9 years ago
discuss
418.
▲
Show HN: A hacking challenge based on the MNIST dataset
(github.com/scvalencia)
2 points
scvalencia
9 years ago
discuss
419.
▲
Golang: In memory dataset filtering
(github.com/mattevans)
2 points
mattevansnz
9 years ago
discuss
420.
▲
Collectible Card Game to Code Dataset
(github.com/deepmind)
2 points
aaronyy
10 years ago
discuss
More