Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
361.
Rill transforms data sets into opinionated dashboards using SQL. BI-as-code (github.com/rilldata)
2 points
nateb2022
2 years ago
discuss
362.
Roapi: Create APIs for slow moving datasets without writing code (github.com/roapi)
2 points
sea-gold
2 years ago
discuss
363.
Reladiff: High-performance diffing of large datasets across databases (github.com/erezsh)
2 points
todsacerdoti
2 years ago
discuss
364.
The largest dataset of LLM jailbreak prompts (github.com/verazuo)
2 points
titaniumrain
2 years ago
discuss
365.
Microsoft/MS-MARCO-Web-Search: A large-scale information-rich web dataset (github.com/microsoft)
2 points
alexmolas
2 years ago
discuss
366.
OpenForest – A catalogue of open access forest datasets (github.com/RolnickLab)
2 points
Brajeshwar
2 years ago
discuss
367.
Dataset to extract stock tickers from NL (github.com/rohanmahen)
2 points
rohanmahen
2 years ago
discuss
368.
Show HN: Lightly Insights – open-source dataset analysis (github.com/lightly-ai)
2 points
isusmelj
3 years ago
discuss
369.
Fabricator – OSS framework to generate datasets with LLMs (github.com/flairNLP)
2 points
aantti
3 years ago
discuss
370.
Framework to easily create LLM powered bots over any dataset (github.com/embedchain)
2 points
ensocode
3 years ago
discuss
371.
Show HN: A Python toolkit for working with parquet datasets on AWS (github.com/marwan116)
2 points
ortamina
3 years ago
discuss
372.
Processing large JSON datasets by streaming (github.com/kashifrazzaqui)
2 points
kashif
3 years ago
discuss
373.
RedPajama-Data: Code for preparing large datasets (github.com/togethercomputer)
2 points
harrisonpowers
3 years ago
discuss
374.
OpenFEMA Samples – Code, dataset, and analysis samples that utilize OpenFEMA API (github.com/FEMA)
2 points
mindcrime
3 years ago
discuss
375.
Open Source AI Image Classifier with Automatic Dataset Creator (github.com/serpapi)
2 points
thefoolofdaath
3 years ago
discuss
376.
Show HN: DescribeML is a VSCode language plugin to describe ML datasets (github.com/SOM-Research)
2 points
softmodeling
4 years ago
discuss
377.
Darmok and Jalad at Tanagra: Dataset and Model for English-Tamarian Translation (github.com/cognitiveailab)
2 points
darwinwhy
4 years ago
discuss
378.
SimilarVerbBank: Dataset of similar verbs formed with the Apriori algorithm (github.com/nlptechbook)
2 points
jxireal
4 years ago
discuss
379.
HuggingFace/evaluate: A library for easily evaluating ML models and datasets (github.com/huggingface)
2 points
occamschainsaw
4 years ago
discuss
380.
Open-source motion datasets collected by Bandai Namco Research (github.com/BandaiNamcoResearchInc)
2 points
nikolay
4 years ago
discuss
381.
Show HN: Bollywood Lyrics Dataset (github.com/hbdeshmukh)
2 points
hdesh
4 years ago
discuss
382.
Ivis: Dimensionality Reduction In Large Datasets Using Siamese Networks (github.com/beringresearch)
2 points
optimalsolver
4 years ago
discuss
383.
Show HN: H5records – large dataset format for deep learning (github.com/theblackcat102)
2 points
polymorph1sm
5 years ago
discuss
384.
PythonProgrammingPuzzles: A Dataset of Python Challenges for AI Research (github.com/microsoft)
2 points
lnyan
5 years ago
discuss
385.
Gretel-synthetics: open-source library to create synthetic datasets (github.com/gretelai)
2 points
meowterspace42
5 years ago
discuss
386.
AutoViz: Automatically visualize any dataset, any size with one line of code (github.com/AutoViML)
2 points
optimalsolver
5 years ago
discuss
387.
World Mortality Dataset – 2020 vs. past (github.com/akarlinsky)
2 points
puttycat
5 years ago
discuss
388.
Hypersim: A Photorealistic Synthetic Dataset for Indoor Scene Understanding (github.com/apple)
2 points
Anon84
5 years ago
discuss
389.
Witch-Trials: Datasets and Code for “Witch Trials” (Leeson and Russ 2018) (github.com/JakeRuss)
2 points
DyslexicAtheist
6 years ago
discuss
390.
Sweetviz: Visualize and compare datasets, target values and associations (github.com/fbdesignpro)
2 points
polm23
6 years ago
discuss
More