Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
511.
Show HN: Terminal-Bench-RL: Training long-horizon terminal agents with RL (github.com/Danau5tin)
125 points
Danau5tin
10 months ago
12 comments
512.
Show HN: Reinforcement Learning from Scratch with TypeScript (github.com/desi-ivanov)
7 points
evolveyourmind
3 years ago
discuss
513.
TallMountain – Stoic Virtue Ethics for an LLM Agent (github.com/seamus-brady)
3 points
s_brady
8 months ago
6 comments
514.
Show HN: RL from Scratch (github.com/desi-ivanov)
3 points
evolveyourmind
3 years ago
discuss
515.
Scaling Coding-Agent RL to 32x H100s. 160% Improvement on Stanford's TBench (github.com/Danau5tin)
2 points
Danau5tin
7 months ago
1 comment
516.
MFEK GPT-3 Policy – on GPT-3 aided computer code appearing in our FOSS libraries (github.com/MFEK)
2 points
kopipe
3 years ago
1 comment
517.
Show HN: Recursive Language Model for Querying Human Action by Ludwig von Mises (github.com/mateolafalce)
2 points
lafalce
5 months ago
discuss
518.
Sparse Predictive Hierarchies, an alternative to deep learning [pdf] (github.com/ogmacorp)
2 points
craigjb
7 years ago
discuss
519.
Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting (github.com/ChenRocks)
2 points
blopeur
8 years ago
discuss
520.
Open source remake of Lode Runner for Roku box/tv (github.com/lvcabral)
2 points
lvcabral
10 years ago
discuss
521.
Ask HN: Why is this Racket code so fast?
30 points
exdsq
4 years ago
27 comments
522.
Show HN: CLI to Test Supabase RLS Policies (github.com/Rodrigotari1)
4 points
rodrigotarca
8 months ago
4 comments
523.
Show HN: Tape/Z – a toolkit for analysing z/OS assembler (HLASM) code (github.com/avishek-sen-gupta)
2 points
armorer
a year ago
discuss
524.
Deep Reinforcement Learning in Depth in 60 Days (github.com/andri27-ts)
189 points
andri27
8 years ago
19 comments
525.
Show HN: COBOL-REKT, a toolkit for analysing and reverse-engineering COBOL (github.com/avishek-sen-gupta)
91 points
armorer
2 years ago
49 comments
526.
Car Reinforcement Learning Training (github.com/leesweqq)
4 points
kyleliiii
10 months ago
1 comment
527.
Master Deep Reinforcement Learning – Week 3 (github.com/andri27-ts)
4 points
andri27
8 years ago
discuss
528.
Prince of Persia Port for Roku Box and TVs (github.com/lvcabral)
3 points
lvcabral
10 years ago
discuss
529.
In-Context Reinforcement Learning (github.com/dunnolab)
2 points
vokneruk
2 years ago
discuss
530.
Typesetting.js (rlemon.github.com)
2 points
jrgifford
14 years ago
discuss
531.
Show HN: Retro 3000: 80s-style CLI API but with modern capabilities and easy API (github.com/sdegutis)
2 points
sdegutis
7 years ago
discuss
532.
Deep Reinforcement Learning in Depth Week 5 – TRPO and PPO (github.com/andri27-ts)
1 point
andri27
8 years ago
discuss
533.
Show HN: Minimalist self-hosted CI server written on Raku
6 points
melezhik
2 years ago
1 comment
534.
Show HN: TextPolicy – reinforcement learning for text generation on a MacBook (github.com/teilomillet)
4 points
teilom
9 months ago
discuss
535.
Show HN: Infinate –O(k)constant-time spatial attention for unlimited LLM context (github.com/ch1pu)
1 point
ch1pu
5 months ago
discuss
536.
Show HN: RLM-Toolkit – Secure LangChain
1 point
Chgdz
5 months ago
discuss
537.
Show HN: Smart glasses that tell me when to stop pouring (github.com/RealComputer)
5 points
tash_2s
3 months ago
7 comments
538.
Show HN: Real-world speedrun timer that auto-ticks via vision on smart glasses (github.com/RealComputer)
4 points
tash_2s
4 months ago
3 comments
539.
RL Unplugged: Benchmarks for Offline Reinforcement Learning (github.com/deepmind)
1 point
MindGods
6 years ago
discuss
540.
Nuvix – open-source BaaS with a query DSL more expressive than PostgREST
2 points
ravikantsaini
3 months ago
discuss
More