Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
481.
▲
Show HN: Linguistic RL – A 7B model discovers Occam's Razor through reflection
(github.com/DRawson5570)
2 points
drawson5570
7 months ago
discuss
482.
▲
Recent cross-research on LLM and RL on ArXiv
(github.com/WindyLab)
2 points
Anon84
10 months ago
discuss
483.
▲
Abusing Roku APIs
(github.com/RoseSecurity)
2 points
notmysql_
3 years ago
discuss
484.
▲
Show HN: I've just ported the RWKV LLM to Fortran
(github.com/FortAI-Hub)
2 points
matteogrella
3 years ago
discuss
485.
▲
Rkflashtool: Tools for Flashing Rockchip Devices
(github.com/linux-rockchip)
2 points
pabs3
5 years ago
discuss
486.
▲
Show HN: Decentralized Reinforcement Learning with Societal Decision-Making
(github.com/mbchang)
2 points
bbdaph
6 years ago
discuss
487.
▲
A GitHub course in reinforcement learning in the wild
(github.com/yandexdataschool)
2 points
justheuristic
8 years ago
discuss
488.
▲
A git-course on reinforcement learning in the wild
(github.com/yandexdataschool)
2 points
sklearnman
9 years ago
discuss
489.
▲
Show HN: A Node.js Implementation of the Rapid Automatic Keyword Extraction Algo
(github.com/waseem18)
2 points
wasim_thabraze
9 years ago
discuss
490.
▲
Show HN: Hands-on course for building RL environments for LLMs
(github.com/anakin87)
1 point
anakin87
2 months ago
1 comment
491.
▲
Show HN: Framework for Transferring AI Capabilities (Students Surpass Teachers)
(github.com/DRawson5570)
1 point
drawson5570
7 months ago
1 comment
492.
▲
The open-source embodied intelligence simulation platform
(github.com/loongOpen)
1 point
OpenLoong
a year ago
1 comment
493.
▲
MicroSafe-RL – Deterministic $1.18 \mu s$ safety layer for Edge AI on MCUs
(github.com/Kretski)
1 point
DREDREG
2 months ago
discuss
494.
▲
MicroSafe-RL v1.0 – Sub-microsecond safety for Edge AI
(github.com/Kretski)
1 point
DREDREG
2 months ago
discuss
495.
▲
Show HN: Modeled healthcare de-identification as longitudinal RL control problem
(github.com/azithteja91)
1 point
vkatganti
3 months ago
discuss
496.
▲
Rkgk UI — A low latency Digital Art software on the browser
(github.com/michael-0acf4)
1 point
michael-0acf4
5 months ago
discuss
497.
▲
Practical RL (Yandex Data School)
(github.com/yandexdataschool)
1 point
xianshou
a year ago
discuss
498.
▲
Logic R1: Reproduce DeepSeek R1 Zero on 2K Logic Puzzle Dataset
(github.com/Unakar)
1 point
limoce
a year ago
discuss
499.
▲
Suika Reinforcement Learning Environment
(github.com/edwhu)
1 point
edhu2017
2 years ago
discuss
500.
▲
Awesome-Rl-for-Cybersecurity
(github.com/Limmen)
1 point
limmen
4 years ago
discuss
501.
▲
Fixing a deadlock in a Common Lisp library for Kafka
(github.com/SahilKang)
1 point
sahil-kang
6 years ago
discuss
502.
▲
Numerical Integration: RK4
(github.com/felipetavares)
1 point
felipetavares
7 years ago
discuss
503.
▲
Show HN: Jeevan-rakht
(github.com/UdacityFrontEndScholarship)
1 point
skywalker212
8 years ago
discuss
504.
▲
Controlling a unicycle with Policy Gradients
(github.com/pauli-space)
1 point
aidanrocke
8 years ago
discuss
505.
▲
GitHub course of practical reinforcement learning
(github.com/yandexdataschool)
1 point
sshb
9 years ago
discuss
506.
▲
Asyncronous RL in Tensorflow and Keras and OpenAI's Gym
(github.com/coreylynch)
1 point
mau
10 years ago
discuss
507.
▲
Schelling's dynamic model of segregation simulated in Racket
(github.com/jmoy)
1 point
yomritoyj
11 years ago
discuss
508.
▲
Show HN: CodeRLM – Tree-sitter-backed code indexing for LLM agents
(github.com/JaredStewart)
81 points
jared_stewart
4 months ago
37 comments
509.
▲
LLM generated parsers and compliance checkers for Sparrow DSL
3 points
melezhik
a month ago
discuss
510.
▲
Show HN: Drone Swarm Control with RL in AirSim and SB3
(github.com/Lauqz)
2 points
Lauqz
a year ago
discuss
More