Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
181.
Reladiff: High-performance diffing of large datasets across databases (github.com/erezsh)
2 points
todsacerdoti
2 years ago
discuss
182.
OpenForest – A catalogue of open access forest datasets (github.com/RolnickLab)
2 points
Brajeshwar
2 years ago
discuss
183.
Fabricator – OSS framework to generate datasets with LLMs (github.com/flairNLP)
2 points
aantti
3 years ago
discuss
184.
Show HN: A Python toolkit for working with parquet datasets on AWS (github.com/marwan116)
2 points
ortamina
3 years ago
discuss
185.
Processing large JSON datasets by streaming (github.com/kashifrazzaqui)
2 points
kashif
3 years ago
discuss
186.
RedPajama-Data: Code for preparing large datasets (github.com/togethercomputer)
2 points
harrisonpowers
3 years ago
discuss
187.
Show HN: DescribeML is a VSCode language plugin to describe ML datasets (github.com/SOM-Research)
2 points
softmodeling
4 years ago
discuss
188.
HuggingFace/evaluate: A library for easily evaluating ML models and datasets (github.com/huggingface)
2 points
occamschainsaw
4 years ago
discuss
189.
Open-source motion datasets collected by Bandai Namco Research (github.com/BandaiNamcoResearchInc)
2 points
nikolay
4 years ago
discuss
190.
Ivis: Dimensionality Reduction In Large Datasets Using Siamese Networks (github.com/beringresearch)
2 points
optimalsolver
4 years ago
discuss
191.
Gretel-synthetics: open-source library to create synthetic datasets (github.com/gretelai)
2 points
meowterspace42
5 years ago
discuss
192.
Witch-Trials: Datasets and Code for “Witch Trials” (Leeson and Russ 2018) (github.com/JakeRuss)
2 points
DyslexicAtheist
6 years ago
discuss
193.
Sweetviz: Visualize and compare datasets, target values and associations (github.com/fbdesignpro)
2 points
polm23
6 years ago
discuss
194.
Datasets and Evaluation Metrics for NLP (True Open Source GPT Alternative) (github.com/huggingface)
2 points
dragonsh
6 years ago
discuss
195.
Datasets and evaluation metrics for natural language processing(NLP) (github.com/huggingface)
2 points
dragonsh
6 years ago
discuss
196.
Datasets and Evaluation Metrics for Natural Language Processing (NLP) (github.com/huggingface)
2 points
dragonsh
6 years ago
discuss
197.
Show HN: A CLI tool for maintaining datasets in a centralized repository (github.com/ezhou7)
2 points
nightrunner11
7 years ago
discuss
198.
Library to scrape and clean web pages to create datasets (github.com/chiphuyen)
2 points
khartig
7 years ago
discuss
199.
Lazynlp: Library to scrape and clean web pages to create datasets (github.com/chiphuyen)
2 points
Osiris30
7 years ago
discuss
200.
Lazynlp: A library to scrape, clean, de-duplicate webpages to create datasets (github.com/chiphuyen)
2 points
korym
7 years ago
discuss
201.
Show HN: Python Script to Generate Fake Datasets for Testing ML/DL Workflows (github.com/minimaxir)
2 points
minimaxir
7 years ago
discuss
202.
Open source tool for merging datasets (github.com/funkeinteraktiv)
2 points
chrtze
7 years ago
discuss
203.
Tracking progress in NLP tasks and datasets (github.com/sebastianruder)
2 points
neuhaus
8 years ago
discuss
204.
Chatito – Generate training datasets for slot filling chatbots in a breeze (github.com/rodrigopivi)
2 points
prodrod
9 years ago
discuss
205.
Working with datasets in Clojure: select,where,aggregate,join,order,crosstab,etc (github.com/emiruz)
2 points
usgroup
9 years ago
discuss
206.
OpenRefine – assess the quality of datasets (github.com/OpenRefine)
2 points
chirau
9 years ago
discuss
207.
Datasets on fee-based open access publishing (github.com/OpenAPC)
2 points
Erikun
10 years ago
discuss
208.
Show HN: Download UCI datasets with python (gist.github.com)
2 points
thewhitetulip
10 years ago
discuss
209.
Ask HN: How to handle large datasets in front end of data apps? (github.com/Kanaries)
1 point
loa_observer
3 years ago
3 comments
210.
Gen-Selective Pseudo Labeling, Based on Datasets and Serverless Inference API (github.com/louisbrulenaudet)
1 point
brulenaudet
2 years ago
2 comments
More