Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
841.
▲
LLM quantization severely damages model quality and perplexity
(github.com/ggerganov)
2 points
behnamoh
3 years ago
3 comments
842.
▲
Llama.cpp with CUDA Support on Original Jetson Nano (4GB)
(github.com/kreier)
2 points
Abishek_Muthian
2 months ago
2 comments
843.
▲
LLaMA Terminal Completion, a local virtual assistant for the terminal
(github.com/adammpkins)
2 points
adammpkins
3 years ago
2 comments
844.
▲
Show HN: Liteparse, an OSS universal fast document parser by LlamaParse team
(github.com/run-llama)
2 points
pierre
3 months ago
1 comment
845.
▲
How to verify that a snippet of Python code doesn't access protected members
(github.com/run-llama)
2 points
tslmy
2 years ago
1 comment
846.
▲
Finetune LLaMa2 for Any Language
(github.com/UnderstandLingBV)
2 points
UnderstandLing
2 years ago
1 comment
847.
▲
Llama2.mojo - outperforms Karpathy’s llama2.c by 30% in multi-threaded inference
(github.com/tairov)
2 points
swyx
3 years ago
1 comment
848.
▲
Show HN: Llama2.f90 – Toy LLaMA2 model inference in Fortran
(github.com/rbitr)
2 points
andy99
3 years ago
1 comment
849.
▲
Python bindings (and OpenAI API compatible server) for llama.cpp
(github.com/abetlen)
2 points
tosh
3 years ago
1 comment
850.
▲
Llama.swift
(github.com/alexrozanski)
2 points
alexrozanski
3 years ago
1 comment
851.
▲
Show HN: Llamactl – Self-hosted LLM manager with OpenAI-compatible routing
(github.com/lordmathis)
2 points
lordmathis
3 months ago
discuss
852.
▲
Llama-swap: Reliable model swapping
(github.com/mostlygeek)
2 points
hbcondo714
3 months ago
discuss
853.
▲
Show HN: Llamada – minimalist toolkit to define functions with prompts
(github.com/blaesus)
2 points
blaesus
7 months ago
discuss
854.
▲
WebAssembly binding for llama.cpp – Enabling on-browser LLM inference
(github.com/ngxson)
2 points
selvan
a year ago
discuss
855.
▲
Llama2.c on the Commodore C64
(github.com/ytmytm)
2 points
radeeyate
a year ago
discuss
856.
▲
llama2psp: llama2.c running on the PSP
(github.com/sixf0ur)
2 points
pizza
2 years ago
discuss
857.
▲
Llama.vim: Plugin for Neovim
(github.com/ggerganov)
2 points
mariuz
2 years ago
discuss
858.
▲
Llama.vim: Plugin for Neovim
(github.com/ggerganov)
2 points
ibobev
2 years ago
discuss
859.
▲
Meta-Llama/Llama-stack: Model components of the Llama Stack APIs
(github.com/meta-llama)
2 points
swyx
2 years ago
discuss
860.
▲
Model Components of the Llama Stack APIs
(github.com/meta-llama)
2 points
mmq
2 years ago
discuss
861.
▲
Character Spacing Bypass in Prompt-Guard-86M Classifier
(github.com/meta-llama)
2 points
edent
2 years ago
discuss
862.
▲
Meta's Agentic System: Multistep reasoning,websearch,code interpreter for Llama3
(github.com/meta-llama)
2 points
ignoramous
2 years ago
discuss
863.
▲
Llama-Factory: A WebUI for Efficient Fine-Tuning of 100 LLMs
(github.com/hiyouga)
2 points
ulrischa
2 years ago
discuss
864.
▲
Llama_cpp provides Ruby bindings for llama.cpp
(github.com/yoshoku)
2 points
thunderbong
2 years ago
discuss
865.
▲
Attention and final logit soft-capping, update scaling factor to Gemma2
(github.com/ggerganov)
2 points
tosh
2 years ago
discuss
866.
▲
file-organizer
(github.com/run-llama)
2 points
tosh
2 years ago
discuss
867.
▲
Wllama: WebAssembly Binding for Llama.cpp
(github.com/ngxson)
2 points
tosh
2 years ago
discuss
868.
▲
Llama3 Inference in Pure Java
(github.com/mukel)
2 points
mikepapadim
2 years ago
discuss
869.
▲
ggml: Add Flash Attention
(github.com/ggerganov)
2 points
tosh
2 years ago
discuss
870.
▲
llama.cpp bfloat16 support
(github.com/ggerganov)
2 points
indigodaddy
2 years ago
discuss
More