Search: github.com/rlk | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

511.

Show HN: Terminal-Bench-RL: Training long-horizon terminal agents with RL (github.com/Danau5tin)

125 points

10 months ago

512.

Show HN: Reinforcement Learning from Scratch with TypeScript (github.com/desi-ivanov)

7 points

3 years ago

513.

TallMountain – Stoic Virtue Ethics for an LLM Agent (github.com/seamus-brady)

3 points

8 months ago

514.

Show HN: RL from Scratch (github.com/desi-ivanov)

3 points

3 years ago

515.

Scaling Coding-Agent RL to 32x H100s. 160% Improvement on Stanford's TBench (github.com/Danau5tin)

2 points

7 months ago

516.

MFEK GPT-3 Policy – on GPT-3 aided computer code appearing in our FOSS libraries (github.com/MFEK)

2 points

3 years ago

517.

Show HN: Recursive Language Model for Querying Human Action by Ludwig von Mises (github.com/mateolafalce)

2 points

5 months ago

518.

Sparse Predictive Hierarchies, an alternative to deep learning [pdf] (github.com/ogmacorp)

2 points

7 years ago

519.

Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting (github.com/ChenRocks)

2 points

8 years ago

520.

Open source remake of Lode Runner for Roku box/tv (github.com/lvcabral)

2 points

10 years ago

521.

Ask HN: Why is this Racket code so fast?

30 points

4 years ago

522.

Show HN: CLI to Test Supabase RLS Policies (github.com/Rodrigotari1)

4 points

8 months ago

523.

Show HN: Tape/Z – a toolkit for analysing z/OS assembler (HLASM) code (github.com/avishek-sen-gupta)

2 points

a year ago

524.

Deep Reinforcement Learning in Depth in 60 Days (github.com/andri27-ts)

189 points

8 years ago

525.

Show HN: COBOL-REKT, a toolkit for analysing and reverse-engineering COBOL (github.com/avishek-sen-gupta)

91 points

2 years ago

526.

Car Reinforcement Learning Training (github.com/leesweqq)

4 points

10 months ago

527.

Master Deep Reinforcement Learning – Week 3 (github.com/andri27-ts)

4 points

8 years ago

528.

Prince of Persia Port for Roku Box and TVs (github.com/lvcabral)

3 points

10 years ago

529.

In-Context Reinforcement Learning (github.com/dunnolab)

2 points

2 years ago

530.

Typesetting.js (rlemon.github.com)

2 points

14 years ago

531.

Show HN: Retro 3000: 80s-style CLI API but with modern capabilities and easy API (github.com/sdegutis)

2 points

7 years ago

532.

Deep Reinforcement Learning in Depth Week 5 – TRPO and PPO (github.com/andri27-ts)

1 point

8 years ago

533.

Show HN: Minimalist self-hosted CI server written on Raku

6 points

2 years ago

534.

Show HN: TextPolicy – reinforcement learning for text generation on a MacBook (github.com/teilomillet)

4 points

9 months ago

535.

Show HN: Infinate –O(k)constant-time spatial attention for unlimited LLM context (github.com/ch1pu)

1 point

5 months ago

536.

Show HN: RLM-Toolkit – Secure LangChain

1 point

5 months ago

537.

Show HN: Smart glasses that tell me when to stop pouring (github.com/RealComputer)

5 points

3 months ago

538.

Show HN: Real-world speedrun timer that auto-ticks via vision on smart glasses (github.com/RealComputer)

4 points

4 months ago

539.

RL Unplugged: Benchmarks for Offline Reinforcement Learning (github.com/deepmind)

1 point

6 years ago

540.

Nuvix – open-source BaaS with a query DSL more expressive than PostgREST

2 points

3 months ago