Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
361.
UC Berkeley's open-source Vicuna LLM chatbot released new improved model weights (huggingface.co)
26 points
covi
3 years ago
3 comments
362.
Llama can now see and run on your device – welcome Llama 3.2 (huggingface.co)
26 points
nitramm
2 years ago
1 comment
363.
DeepSeek-v3.1 (huggingface.co)
26 points
meetpateltech
10 months ago
discuss
364.
DeepSeek-v3.1-Base (huggingface.co)
25 points
meetpateltech
10 months ago
7 comments
365.
Llama 1.3B Trained on 200B Tokens for Commercial Use (huggingface.co)
25 points
vsroy
3 years ago
7 comments
366.
New Phi-3.5 Models from Microsoft, including new MoE (huggingface.co)
25 points
thecal
2 years ago
3 comments
367.
LLM: Transformer Is Linear (huggingface.co)
25 points
frednoodle
2 years ago
1 comment
368.
NousResearch/Nous-Hermes-2-Yi-34B (huggingface.co)
24 points
simonpure
2 years ago
discuss
369.
Accelerating Stable Diffusion XL Inference with Jax on Cloud TPU v5e (huggingface.co)
23 points
rayshan
3 years ago
6 comments
370.
Show HN: DALL·E mini – Generate images from text (huggingface.co)
23 points
rg111
5 years ago
4 comments
371.
HuggingFace - Tencent launches Hunyuan Large which outperforms Llama 3.1 405B (huggingface.co)
23 points
janik-io
2 years ago
1 comment
372.
DeepSeek-v3.1 (huggingface.co)
23 points
bparsons
10 months ago
discuss
373.
Mistral Small 3.2 (24B-Instruct-2506) (huggingface.co)
23 points
georgehill
a year ago
discuss
374.
Open source speech foundation model that runs locally on CPU in real-time (huggingface.co)
22 points
neuphonic
8 months ago
10 comments
375.
Lineage Explorer for open source models – Hugging Face Space (huggingface.co)
22 points
pauldowman
2 years ago
1 comment
376.
Llama 22B: 13B V2 with 33B attention heads frankensteined on (huggingface.co)
22 points
brucethemoose2
3 years ago
1 comment
377.
Show HN: Fineweb-Edu-Fortified dataset: Fineweb-Edu deduped, embeddings included (huggingface.co)
22 points
neutralino1
2 years ago
discuss
378.
Mistral-7B-OpenOrca. First 7B model to beat all other models <30B (huggingface.co)
21 points
guybedo
3 years ago
12 comments
379.
Qwen3 235B beats Claude on some code benchmarks (huggingface.co)
21 points
willahmad
a year ago
2 comments
380.
Würstchen: Fast Diffusion for Image Generation (huggingface.co)
21 points
cmitsakis
3 years ago
2 comments
381.
Kyutai 1.6B Streaming TTS (huggingface.co)
21 points
robotswantdata
a year ago
discuss
382.
Llama 3.2 (huggingface.co)
21 points
typpo
2 years ago
discuss
383.
Selene Mini: Open-sourced SOTA small language-model-as-a-judge (huggingface.co)
20 points
kaikaidai
a year ago
4 comments
384.
DeepSeek-v3.2-Speciale (huggingface.co)
20 points
b16m
6 months ago
discuss
385.
Code Generation with HuggingFace (huggingface.co)
20 points
dsr12
4 years ago
discuss
386.
Ernie-ViLG better anime quality than Stable Diffusion (huggingface.co)
19 points
avocado2
4 years ago
4 comments
387.
The smallest VLM ever: 250M parameters (huggingface.co)
19 points
pixel_art
a year ago
1 comment
388.
Fine-tune and deploy open LLMs as containers using AIKit - Part 1 (huggingface.co)
19 points
sozercan
2 years ago
1 comment
389.
makeMoE: Implement a Sparse Mixture of Experts LLM from Scratch (huggingface.co)
19 points
avi1x
2 years ago
1 comment
390.
AMD and: Large Language Models Out-of-the-Box Acceleration with AMD GPU (huggingface.co)
19 points
kristianp
2 years ago
1 comment
More