Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
601.
▲
Roapi: Create APIs for slow moving datasets without writing code
(github.com/roapi)
2 points
sea-gold
2 years ago
discuss
602.
▲
Reladiff: High-performance diffing of large datasets across databases
(github.com/erezsh)
2 points
todsacerdoti
2 years ago
discuss
603.
▲
The largest dataset of LLM jailbreak prompts
(github.com/verazuo)
2 points
titaniumrain
2 years ago
discuss
604.
▲
Microsoft/MS-MARCO-Web-Search: A large-scale information-rich web dataset
(github.com/microsoft)
2 points
alexmolas
2 years ago
discuss
605.
▲
OpenForest – A catalogue of open access forest datasets
(github.com/RolnickLab)
2 points
Brajeshwar
2 years ago
discuss
606.
▲
Dataset to extract stock tickers from NL
(github.com/rohanmahen)
2 points
rohanmahen
2 years ago
discuss
607.
▲
Show HN: Lightly Insights – open-source dataset analysis
(github.com/lightly-ai)
2 points
isusmelj
3 years ago
discuss
608.
▲
Fabricator – OSS framework to generate datasets with LLMs
(github.com/flairNLP)
2 points
aantti
3 years ago
discuss
609.
▲
Framework to easily create LLM powered bots over any dataset
(github.com/embedchain)
2 points
ensocode
3 years ago
discuss
610.
▲
Show HN: A Python toolkit for working with parquet datasets on AWS
(github.com/marwan116)
2 points
ortamina
3 years ago
discuss
611.
▲
Just in Time Datastructures
(github.com/UBOdin)
2 points
danny00
3 years ago
discuss
612.
▲
Processing large JSON datasets by streaming
(github.com/kashifrazzaqui)
2 points
kashif
3 years ago
discuss
613.
▲
RedPajama-Data: Code for preparing large datasets
(github.com/togethercomputer)
2 points
harrisonpowers
3 years ago
discuss
614.
▲
OpenFEMA Samples – Code, dataset, and analysis samples that utilize OpenFEMA API
(github.com/FEMA)
2 points
mindcrime
3 years ago
discuss
615.
▲
Benchmark of simple operations against common KV datastores with Python clients
(github.com/alisaifee)
2 points
indydevs
3 years ago
discuss
616.
▲
Open Source AI Image Classifier with Automatic Dataset Creator
(github.com/serpapi)
2 points
thefoolofdaath
3 years ago
discuss
617.
▲
Show HN: DescribeML is a VSCode language plugin to describe ML datasets
(github.com/SOM-Research)
2 points
softmodeling
4 years ago
discuss
618.
▲
Darmok and Jalad at Tanagra: Dataset and Model for English-Tamarian Translation
(github.com/cognitiveailab)
2 points
darwinwhy
4 years ago
discuss
619.
▲
SimilarVerbBank: Dataset of similar verbs formed with the Apriori algorithm
(github.com/nlptechbook)
2 points
jxireal
4 years ago
discuss
620.
▲
HuggingFace/evaluate: A library for easily evaluating ML models and datasets
(github.com/huggingface)
2 points
occamschainsaw
4 years ago
discuss
621.
▲
Open-source motion datasets collected by Bandai Namco Research
(github.com/BandaiNamcoResearchInc)
2 points
nikolay
4 years ago
discuss
622.
▲
Show HN: Bollywood Lyrics Dataset
(github.com/hbdeshmukh)
2 points
hdesh
4 years ago
discuss
623.
▲
Ivis: Dimensionality Reduction In Large Datasets Using Siamese Networks
(github.com/beringresearch)
2 points
optimalsolver
4 years ago
discuss
624.
▲
Show HN: H5records – large dataset format for deep learning
(github.com/theblackcat102)
2 points
polymorph1sm
5 years ago
discuss
625.
▲
PythonProgrammingPuzzles: A Dataset of Python Challenges for AI Research
(github.com/microsoft)
2 points
lnyan
5 years ago
discuss
626.
▲
Okdb can be the primitive datastructure for building many datastructures
(github.com/pre-srfi)
2 points
bryanrasmussen
5 years ago
discuss
627.
▲
Gretel-synthetics: open-source library to create synthetic datasets
(github.com/gretelai)
2 points
meowterspace42
5 years ago
discuss
628.
▲
AutoViz: Automatically visualize any dataset, any size with one line of code
(github.com/AutoViML)
2 points
optimalsolver
5 years ago
discuss
629.
▲
World Mortality Dataset – 2020 vs. past
(github.com/akarlinsky)
2 points
puttycat
5 years ago
discuss
630.
▲
Apache Pinot: A Real-Time Distributed OLAP Datastore
(github.com/apache)
2 points
caetris1
5 years ago
discuss
More