Search: github.com/ggerganov | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

61.

Show HN: GPT-J inference on the CPU using C/C++ (github.com/ggerganov)

5 points

4 years ago

62.

Llama.cpp: SOTA 2-bit quants (github.com/ggerganov)

5 points

2 years ago

63.

Falcon 40B Working on Ggml (github.com/ggerganov)

5 points

3 years ago

64.

Whisper.cpp: Port of OpenAI's Whisper model in C/C++ (github.com/ggerganov)

5 points

4 years ago

65.

HNTERM – Browse Hacker News interactively in your terminal (github.com/ggerganov)

5 points

4 years ago

66.

ImTui: Immediate Mode Text-Based User Interface C++ Library (github.com/ggerganov)

5 points

4 years ago

67.

Real-Time Capturing Exact Keystrokes Using Sound (github.com/ggerganov)

5 points

8 years ago

68.

Llama.cpp speculative sampling: 2x faster inference for large models (github.com/ggerganov)

4 points

3 years ago

69.

Show HN: GGWave – Data over Sound for Microcontrollers (github.com/ggerganov)

4 points

4 years ago

70.

ImTui: Immediate Mode Text-Based User Interface C++ Library (github.com/ggerganov)

4 points

5 years ago

71.

Llama.cpp PR with 99% of code written by DeepSeek-R1 (github.com/ggerganov)

4 points

a year ago

72.

Wchess (github.com/ggerganov)

4 points

2 years ago

73.

Whisper.wasm (github.com/ggerganov)

4 points

3 years ago

74.

AMD ROCm Support Added to Llama.cpp (github.com/ggerganov)

4 points

3 years ago

75.

Full GPU Inference of LLaMA on Apple Silicon Using Metal (github.com/ggerganov)

4 points

3 years ago

76.

Inference at the Edge (github.com/ggerganov)

4 points

3 years ago

77.

Whisper: performant port of OpenAI's Whisper spech recognition model in C/C++ (github.com/ggerganov)

4 points

4 years ago

78.

ImTui: Immediate Mode Text-Based User Interface Library for C++ (github.com/ggerganov)

4 points

6 years ago

79.

ImTui: Immediate mode text-based user interface library (github.com/ggerganov)

4 points

6 years ago

80.

Llama.cpp now supports tool calling (OpenAI-compatible) (github.com/ggerganov)

3 points

a year ago

81.

GGML Flash Attention support merged into llama.cpp (github.com/ggerganov)

3 points

2 years ago

82.

Show HN: GPT-2 inference on the CPU using C/C++ (github.com/ggerganov)

3 points

4 years ago

83.

Show HN: Tweet2Doom – A Twitter bot that plays Doom (github.com/ggerganov)

3 points

5 years ago

84.

Show HN: ggwave – tiny data-over-sound library (github.com/ggerganov)

3 points

5 years ago

85.

Whisper.cpp: Looking for Maintainers (github.com/ggerganov)

3 points

a year ago

86.

Distributed LLM Inference with Llama.cpp (github.com/ggerganov)

3 points

2 years ago

87.

Control Vectors have been added to llama.cpp (github.com/ggerganov)

3 points

2 years ago

88.

Llama.cpp supports distributed inference across machines on a local network (github.com/ggerganov)

3 points

2 years ago

89.

CUDA: Faster Mixtral Prompt Processing (github.com/ggerganov)

3 points

2 years ago

90.

Llama.cpp: Support for Phi-2 (github.com/ggerganov)

3 points

2 years ago