Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
31.
▲
Show HN: LLM Thematic Generalization Benchmark
(github.com/lechmazur)
6 points
zone411
a year ago
discuss
32.
▲
LLM Confabulation (Hallucination) Leaderboard
(github.com/lechmazur)
6 points
zone411
2 years ago
discuss
33.
▲
Elimination Game: Multi-Agent LLM Social Reasoning, Strategy, and Deception
(github.com/lechmazur)
5 points
zone411
a year ago
discuss
34.
▲
Show HN: LLM Creative Story-Writing Benchmark
(github.com/lechmazur)
5 points
zone411
a year ago
discuss
35.
▲
RootAsRole: A secure alternative to sudo/su using principle of least privilege
(github.com/LeChatP)
5 points
sbt567
3 years ago
discuss
36.
▲
Show HN: LLM Sycophancy Benchmark: Opposite-Narrator Contradictions
(github.com/lechmazur)
3 points
zone411
3 months ago
discuss
37.
▲
Freebind: An IPv6 address rate limiting evasion tool (that also supports IPv4)
(github.com/blechschmidt)
3 points
ocean_moist
2 years ago
discuss
38.
▲
Pix2tex – LaTeX OCR
(github.com/lukas-blecher)
3 points
marcodiego
4 years ago
discuss
39.
▲
Minimal and beautiful theme for the GNOME Desktop Environment
(github.com/hdni)
2 points
grigio
12 years ago
2 comments
40.
▲
RootAsRole – A better alternative to sudo(-rs)/su
(github.com/LeChatP)
2 points
p4bl0
a month ago
discuss
41.
▲
Show HN: A vibe-coded low-level PKCS#11 Terraform provider
(github.com/blechschmidt)
2 points
blechschmidt
4 months ago
discuss
42.
▲
Elimination Game Benchmark: Social Reasoning, Strategy, and Deception in LLMs
(github.com/lechmazur)
2 points
amichail
a year ago
discuss
43.
▲
Step-Game: Assessing LLM Collaboration and Deception Under Pressure
(github.com/lechmazur)
2 points
amichail
a year ago
discuss
44.
▲
OpenUDID - Opensourced UDID replacement
(github.com/ylechelle)
2 points
BenSS
15 years ago
discuss
45.
▲
Accurately calculating the number of legal chess positions
(github.com/lechmazur)
2 points
slyall
5 years ago
discuss
46.
▲
LLM Position Bias Benchmark: Swapped-Order Pairwise Judging
(github.com/lechmazur)
1 point
zone411
a month ago
discuss
47.
▲
Benchmark that evaluates LLMs using 759 NYT Connections puzzles
(github.com/lechmazur)
1 point
ShrugLife
6 months ago
discuss
48.
▲
NYT Connections LLM Benchmark
(github.com/lechmazur)
1 point
cainxinth
6 months ago
discuss
49.
▲
Show HN: LLM UI Challenge
(github.com/alechewitt)
1 point
alechewitt
6 months ago
discuss
50.
▲
Mathias Lechner
(github.com)
1 point
rolph
2 years ago
discuss
51.
▲
FizzBuzzLang – the programming language no one asked for
(github.com/lechien73)
1 point
shever
6 years ago
discuss
52.
▲
Logo comment – generate logos for source code comments
(github.com/alechewitt)
1 point
alechewitt1
11 years ago
discuss
53.
▲
Clifm 1.17 (Lechuck) is out!
(github.com/leo-arch)
1 point
archcrack
2 years ago
1 comment
54.
▲
Show HN: Open-source Canadian COVID-19 bot
6 points
lecha
6 years ago
1 comment
55.
▲
PINCE – A GDB front-end/reverse engineering tool focused on games
(github.com/korcankaraokcu)
93 points
blechschmidt
10 years ago
3 comments
56.
▲
Replay server responses from a HAR file
(github.com/Stuk)
7 points
lechevalierd3on
11 years ago
discuss
57.
▲
Show HN: Rpmrepo-update – Incremental RPM repo updates for S3 (no full sync)
(github.com/e2llm)
2 points
Alechko
5 months ago
1 comment
58.
▲
Curated LLM prompts for debugging with runtime DOM snapshots
(github.com/e2llm)
2 points
Alechko
9 months ago
1 comment
59.
▲
Show HN: MedSynth – Multi-lingual synthetic healthcare data with OCR artifacts
(github.com/e2llm)
1 point
Alechko
4 months ago
1 comment
60.
▲
Yelp love
(github.com/yelp)
1 point
lechevalierd3on
9 years ago
discuss
More