Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
361.
▲
UC Berkeley's open-source Vicuna LLM chatbot released new improved model weights
(huggingface.co)
26 points
covi
3 years ago
3 comments
362.
▲
Llama can now see and run on your device – welcome Llama 3.2
(huggingface.co)
26 points
nitramm
2 years ago
1 comment
363.
▲
DeepSeek-v3.1
(huggingface.co)
26 points
meetpateltech
10 months ago
discuss
364.
▲
DeepSeek-v3.1-Base
(huggingface.co)
25 points
meetpateltech
10 months ago
7 comments
365.
▲
Llama 1.3B Trained on 200B Tokens for Commercial Use
(huggingface.co)
25 points
vsroy
3 years ago
7 comments
366.
▲
New Phi-3.5 Models from Microsoft, including new MoE
(huggingface.co)
25 points
thecal
2 years ago
3 comments
367.
▲
LLM: Transformer Is Linear
(huggingface.co)
25 points
frednoodle
2 years ago
1 comment
368.
▲
NousResearch/Nous-Hermes-2-Yi-34B
(huggingface.co)
24 points
simonpure
2 years ago
discuss
369.
▲
Accelerating Stable Diffusion XL Inference with Jax on Cloud TPU v5e
(huggingface.co)
23 points
rayshan
3 years ago
6 comments
370.
▲
Show HN: DALL·E mini – Generate images from text
(huggingface.co)
23 points
rg111
5 years ago
4 comments
371.
▲
HuggingFace - Tencent launches Hunyuan Large which outperforms Llama 3.1 405B
(huggingface.co)
23 points
janik-io
2 years ago
1 comment
372.
▲
DeepSeek-v3.1
(huggingface.co)
23 points
bparsons
10 months ago
discuss
373.
▲
Mistral Small 3.2 (24B-Instruct-2506)
(huggingface.co)
23 points
georgehill
a year ago
discuss
374.
▲
Open source speech foundation model that runs locally on CPU in real-time
(huggingface.co)
22 points
neuphonic
8 months ago
10 comments
375.
▲
Lineage Explorer for open source models – Hugging Face Space
(huggingface.co)
22 points
pauldowman
2 years ago
1 comment
376.
▲
Llama 22B: 13B V2 with 33B attention heads frankensteined on
(huggingface.co)
22 points
brucethemoose2
3 years ago
1 comment
377.
▲
Show HN: Fineweb-Edu-Fortified dataset: Fineweb-Edu deduped, embeddings included
(huggingface.co)
22 points
neutralino1
2 years ago
discuss
378.
▲
Mistral-7B-OpenOrca. First 7B model to beat all other models <30B
(huggingface.co)
21 points
guybedo
3 years ago
12 comments
379.
▲
Qwen3 235B beats Claude on some code benchmarks
(huggingface.co)
21 points
willahmad
a year ago
2 comments
380.
▲
Würstchen: Fast Diffusion for Image Generation
(huggingface.co)
21 points
cmitsakis
3 years ago
2 comments
381.
▲
Kyutai 1.6B Streaming TTS
(huggingface.co)
21 points
robotswantdata
a year ago
discuss
382.
▲
Llama 3.2
(huggingface.co)
21 points
typpo
2 years ago
discuss
383.
▲
Selene Mini: Open-sourced SOTA small language-model-as-a-judge
(huggingface.co)
20 points
kaikaidai
a year ago
4 comments
384.
▲
DeepSeek-v3.2-Speciale
(huggingface.co)
20 points
b16m
6 months ago
discuss
385.
▲
Code Generation with HuggingFace
(huggingface.co)
20 points
dsr12
4 years ago
discuss
386.
▲
Ernie-ViLG better anime quality than Stable Diffusion
(huggingface.co)
19 points
avocado2
4 years ago
4 comments
387.
▲
The smallest VLM ever: 250M parameters
(huggingface.co)
19 points
pixel_art
a year ago
1 comment
388.
▲
Fine-tune and deploy open LLMs as containers using AIKit - Part 1
(huggingface.co)
19 points
sozercan
2 years ago
1 comment
389.
▲
makeMoE: Implement a Sparse Mixture of Experts LLM from Scratch
(huggingface.co)
19 points
avi1x
2 years ago
1 comment
390.
▲
AMD and: Large Language Models Out-of-the-Box Acceleration with AMD GPU
(huggingface.co)
19 points
kristianp
2 years ago
1 comment
More