Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
511.
▲
Show HN: Terminal-Bench-RL: Training long-horizon terminal agents with RL
(github.com/Danau5tin)
125 points
Danau5tin
10 months ago
12 comments
512.
▲
Show HN: Reinforcement Learning from Scratch with TypeScript
(github.com/desi-ivanov)
7 points
evolveyourmind
3 years ago
discuss
513.
▲
TallMountain – Stoic Virtue Ethics for an LLM Agent
(github.com/seamus-brady)
3 points
s_brady
8 months ago
6 comments
514.
▲
Show HN: RL from Scratch
(github.com/desi-ivanov)
3 points
evolveyourmind
3 years ago
discuss
515.
▲
Scaling Coding-Agent RL to 32x H100s. 160% Improvement on Stanford's TBench
(github.com/Danau5tin)
2 points
Danau5tin
7 months ago
1 comment
516.
▲
MFEK GPT-3 Policy – on GPT-3 aided computer code appearing in our FOSS libraries
(github.com/MFEK)
2 points
kopipe
3 years ago
1 comment
517.
▲
Show HN: Recursive Language Model for Querying Human Action by Ludwig von Mises
(github.com/mateolafalce)
2 points
lafalce
5 months ago
discuss
518.
▲
Sparse Predictive Hierarchies, an alternative to deep learning [pdf]
(github.com/ogmacorp)
2 points
craigjb
7 years ago
discuss
519.
▲
Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting
(github.com/ChenRocks)
2 points
blopeur
8 years ago
discuss
520.
▲
Open source remake of Lode Runner for Roku box/tv
(github.com/lvcabral)
2 points
lvcabral
10 years ago
discuss
521.
▲
Ask HN: Why is this Racket code so fast?
30 points
exdsq
4 years ago
27 comments
522.
▲
Show HN: CLI to Test Supabase RLS Policies
(github.com/Rodrigotari1)
4 points
rodrigotarca
8 months ago
4 comments
523.
▲
Show HN: Tape/Z – a toolkit for analysing z/OS assembler (HLASM) code
(github.com/avishek-sen-gupta)
2 points
armorer
a year ago
discuss
524.
▲
Deep Reinforcement Learning in Depth in 60 Days
(github.com/andri27-ts)
189 points
andri27
8 years ago
19 comments
525.
▲
Show HN: COBOL-REKT, a toolkit for analysing and reverse-engineering COBOL
(github.com/avishek-sen-gupta)
91 points
armorer
2 years ago
49 comments
526.
▲
Car Reinforcement Learning Training
(github.com/leesweqq)
4 points
kyleliiii
10 months ago
1 comment
527.
▲
Master Deep Reinforcement Learning – Week 3
(github.com/andri27-ts)
4 points
andri27
8 years ago
discuss
528.
▲
Prince of Persia Port for Roku Box and TVs
(github.com/lvcabral)
3 points
lvcabral
10 years ago
discuss
529.
▲
In-Context Reinforcement Learning
(github.com/dunnolab)
2 points
vokneruk
2 years ago
discuss
530.
▲
Typesetting.js
(rlemon.github.com)
2 points
jrgifford
14 years ago
discuss
531.
▲
Show HN: Retro 3000: 80s-style CLI API but with modern capabilities and easy API
(github.com/sdegutis)
2 points
sdegutis
7 years ago
discuss
532.
▲
Deep Reinforcement Learning in Depth Week 5 – TRPO and PPO
(github.com/andri27-ts)
1 point
andri27
8 years ago
discuss
533.
▲
Show HN: Minimalist self-hosted CI server written on Raku
6 points
melezhik
2 years ago
1 comment
534.
▲
Show HN: TextPolicy – reinforcement learning for text generation on a MacBook
(github.com/teilomillet)
4 points
teilom
9 months ago
discuss
535.
▲
Show HN: Infinate –O(k)constant-time spatial attention for unlimited LLM context
(github.com/ch1pu)
1 point
ch1pu
5 months ago
discuss
536.
▲
Show HN: RLM-Toolkit – Secure LangChain
1 point
Chgdz
5 months ago
discuss
537.
▲
Show HN: Smart glasses that tell me when to stop pouring
(github.com/RealComputer)
5 points
tash_2s
3 months ago
7 comments
538.
▲
Show HN: Real-world speedrun timer that auto-ticks via vision on smart glasses
(github.com/RealComputer)
4 points
tash_2s
4 months ago
3 comments
539.
▲
RL Unplugged: Benchmarks for Offline Reinforcement Learning
(github.com/deepmind)
1 point
MindGods
6 years ago
discuss
540.
▲
Nuvix – open-source BaaS with a query DSL more expressive than PostgREST
2 points
ravikantsaini
3 months ago
discuss
More