Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
181.
▲
Reladiff: High-performance diffing of large datasets across databases
(github.com/erezsh)
2 points
todsacerdoti
2 years ago
discuss
182.
▲
OpenForest – A catalogue of open access forest datasets
(github.com/RolnickLab)
2 points
Brajeshwar
2 years ago
discuss
183.
▲
Fabricator – OSS framework to generate datasets with LLMs
(github.com/flairNLP)
2 points
aantti
3 years ago
discuss
184.
▲
Show HN: A Python toolkit for working with parquet datasets on AWS
(github.com/marwan116)
2 points
ortamina
3 years ago
discuss
185.
▲
Processing large JSON datasets by streaming
(github.com/kashifrazzaqui)
2 points
kashif
3 years ago
discuss
186.
▲
RedPajama-Data: Code for preparing large datasets
(github.com/togethercomputer)
2 points
harrisonpowers
3 years ago
discuss
187.
▲
Show HN: DescribeML is a VSCode language plugin to describe ML datasets
(github.com/SOM-Research)
2 points
softmodeling
4 years ago
discuss
188.
▲
HuggingFace/evaluate: A library for easily evaluating ML models and datasets
(github.com/huggingface)
2 points
occamschainsaw
4 years ago
discuss
189.
▲
Open-source motion datasets collected by Bandai Namco Research
(github.com/BandaiNamcoResearchInc)
2 points
nikolay
4 years ago
discuss
190.
▲
Ivis: Dimensionality Reduction In Large Datasets Using Siamese Networks
(github.com/beringresearch)
2 points
optimalsolver
4 years ago
discuss
191.
▲
Gretel-synthetics: open-source library to create synthetic datasets
(github.com/gretelai)
2 points
meowterspace42
5 years ago
discuss
192.
▲
Witch-Trials: Datasets and Code for “Witch Trials” (Leeson and Russ 2018)
(github.com/JakeRuss)
2 points
DyslexicAtheist
6 years ago
discuss
193.
▲
Sweetviz: Visualize and compare datasets, target values and associations
(github.com/fbdesignpro)
2 points
polm23
6 years ago
discuss
194.
▲
Datasets and Evaluation Metrics for NLP (True Open Source GPT Alternative)
(github.com/huggingface)
2 points
dragonsh
6 years ago
discuss
195.
▲
Datasets and evaluation metrics for natural language processing(NLP)
(github.com/huggingface)
2 points
dragonsh
6 years ago
discuss
196.
▲
Datasets and Evaluation Metrics for Natural Language Processing (NLP)
(github.com/huggingface)
2 points
dragonsh
6 years ago
discuss
197.
▲
Show HN: A CLI tool for maintaining datasets in a centralized repository
(github.com/ezhou7)
2 points
nightrunner11
7 years ago
discuss
198.
▲
Library to scrape and clean web pages to create datasets
(github.com/chiphuyen)
2 points
khartig
7 years ago
discuss
199.
▲
Lazynlp: Library to scrape and clean web pages to create datasets
(github.com/chiphuyen)
2 points
Osiris30
7 years ago
discuss
200.
▲
Lazynlp: A library to scrape, clean, de-duplicate webpages to create datasets
(github.com/chiphuyen)
2 points
korym
7 years ago
discuss
201.
▲
Show HN: Python Script to Generate Fake Datasets for Testing ML/DL Workflows
(github.com/minimaxir)
2 points
minimaxir
7 years ago
discuss
202.
▲
Open source tool for merging datasets
(github.com/funkeinteraktiv)
2 points
chrtze
7 years ago
discuss
203.
▲
Tracking progress in NLP tasks and datasets
(github.com/sebastianruder)
2 points
neuhaus
8 years ago
discuss
204.
▲
Chatito – Generate training datasets for slot filling chatbots in a breeze
(github.com/rodrigopivi)
2 points
prodrod
9 years ago
discuss
205.
▲
Working with datasets in Clojure: select,where,aggregate,join,order,crosstab,etc
(github.com/emiruz)
2 points
usgroup
9 years ago
discuss
206.
▲
OpenRefine – assess the quality of datasets
(github.com/OpenRefine)
2 points
chirau
9 years ago
discuss
207.
▲
Datasets on fee-based open access publishing
(github.com/OpenAPC)
2 points
Erikun
10 years ago
discuss
208.
▲
Show HN: Download UCI datasets with python
(gist.github.com)
2 points
thewhitetulip
10 years ago
discuss
209.
▲
Ask HN: How to handle large datasets in front end of data apps?
(github.com/Kanaries)
1 point
loa_observer
3 years ago
3 comments
210.
▲
Gen-Selective Pseudo Labeling, Based on Datasets and Serverless Inference API
(github.com/louisbrulenaudet)
1 point
brulenaudet
2 years ago
2 comments
More