Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
61.
An analysis of 7,020,950 NFT transactions on the Ethereum blockchain [pdf] (github.com/bugout-dev)
4 points
zomglings
5 years ago
2 comments
62.
Public Real-Time Datasets and Sources (github.com/bytewax)
4 points
skadamat
3 years ago
discuss
63.
Show HN: UK Government Datasets (github.com/i-dot-ai)
2 points
crimsoneer
a year ago
discuss
64.
An analysis of 7M NFT transactions on the Ethereum blockchain [pdf] (github.com/bugout-dev)
1 point
mpaepper
5 years ago
discuss
65.
Launch HN: Activeloop (YC S18) – Data lake for deep learning
64 points
davidbuniat
4 years ago
24 comments
66.
InfoSeek: The First Open-Source Framework for Deep Research Data Synthesis
2 points
BAAIBeijing
9 months ago
1 comment
67.
Satellite Image Time Series Datasets (github.com/corentin-dfg)
2 points
sebg
3 years ago
discuss
68.
Chinese Language Corpora for Sentiment Analysis (github.com/Lab41)
1 point
ghosthamlet
8 years ago
discuss
69.
Visualizations for machine learning datasets (github.com/PAIR-code)
178 points
happy-go-lucky
9 years ago
7 comments
70.
Show HN: Dlt – Python library to automate the creation of datasets (colab.research.google.com)
114 points
MatthausK
3 years ago
54 comments
71.
RipTable – multi-threaded Python data analytics tools for numpy arrays/datasets (github.com/rtosholdings)
79 points
aldanor
6 years ago
14 comments
72.
Show HN: Hyperparam: OSS tools for exploring datasets locally in the browser (hyperparam.app)
77 points
platypii
a year ago
21 comments
73.
How to query data.gov json datasets with SQL: a case study (github.com/axibase)
68 points
rodionos
9 years ago
1 comment
74.
Datasets for Reconstructing Visual Perception from Brain Data (github.com/seelikat)
62 points
katsee
3 months ago
16 comments
75.
Show HN: I made this tool for navigating pandas datasets (github.com/man-group)
20 points
leehcksource
6 years ago
discuss
76.
Show HN: SemHash – Fast Semantic Text Deduplication for Cleaner Datasets (github.com/MinishLab)
19 points
Pringled
a year ago
6 comments
77.
Show HN: Version code, models, & datasets together in GitHub
19 points
skadamat
3 years ago
6 comments
78.
NLP: A new datasets and metrics library from Hugging Face (github.com/huggingface)
19 points
julien_c
6 years ago
discuss
79.
GitHub: Awesome-reasoning, a curated list of datasets for reasoning AIs (github.com/neurallambda)
17 points
neurallambda
2 years ago
discuss
80.
Datasetq: jq for Datasets; Polars-powered Parquet/JSON/CSV query lang/cli (github.com/datasetq)
15 points
djb-at-durable
6 months ago
2 comments
81.
Easy way to load, create, version, query and visualize computer vision datasets
13 points
morpheusme
4 years ago
discuss
82.
Show HN: Create datasets more simply and improve AI model with unstructured data (github.com/adansons)
12 points
KenichiHiguchi
4 years ago
3 comments
83.
Show HN: Download HuggingFace Models/Datasets easily and super fast (github.com/bodaay)
10 points
qqqbodaayqqq
3 years ago
2 comments
84.
Show HN: Training synthetic models on highly complex datasets (github.com/gretelai)
10 points
repeat_or
4 years ago
2 comments
85.
Show HN: React-like Declarative DSL for building synthetic LLM datasets (github.com/qforge-dev)
10 points
arturwala
7 months ago
discuss
86.
Kangas: Explore Multimedia Datasets at Scale (github.com/comet-ml)
9 points
dmoura
4 years ago
2 comments
87.
Nvidia open sources the synthetic data framework used to build Nemotron datasets
8 points
alexwatson405
6 months ago
1 comment
88.
Open Thoughts: Curating the best reasoning datasets (github.com/open-thoughts)
8 points
madiator
a year ago
discuss
89.
Show HN: Automate Variable Selection for Research on Big Datasets (Open-Source) (github.com/MalikHarrisAhm)
8 points
mha23
2 years ago
discuss
90.
Our classifier outperforms CatBoost, XGBoost, LightGBM on 5 benchmark datasets (github.com/LinearBoost)
6 points
hamid9
2 years ago
5 comments
More