Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
481.
Show HN: Linguistic RL – A 7B model discovers Occam's Razor through reflection (github.com/DRawson5570)
2 points
drawson5570
7 months ago
discuss
482.
Recent cross-research on LLM and RL on ArXiv (github.com/WindyLab)
2 points
Anon84
10 months ago
discuss
483.
Abusing Roku APIs (github.com/RoseSecurity)
2 points
notmysql_
3 years ago
discuss
484.
Show HN: I've just ported the RWKV LLM to Fortran (github.com/FortAI-Hub)
2 points
matteogrella
3 years ago
discuss
485.
Rkflashtool: Tools for Flashing Rockchip Devices (github.com/linux-rockchip)
2 points
pabs3
5 years ago
discuss
486.
Show HN: Decentralized Reinforcement Learning with Societal Decision-Making (github.com/mbchang)
2 points
bbdaph
6 years ago
discuss
487.
A GitHub course in reinforcement learning in the wild (github.com/yandexdataschool)
2 points
justheuristic
8 years ago
discuss
488.
A git-course on reinforcement learning in the wild (github.com/yandexdataschool)
2 points
sklearnman
9 years ago
discuss
489.
Show HN: A Node.js Implementation of the Rapid Automatic Keyword Extraction Algo (github.com/waseem18)
2 points
wasim_thabraze
9 years ago
discuss
490.
Show HN: Hands-on course for building RL environments for LLMs (github.com/anakin87)
1 point
anakin87
2 months ago
1 comment
491.
Show HN: Framework for Transferring AI Capabilities (Students Surpass Teachers) (github.com/DRawson5570)
1 point
drawson5570
7 months ago
1 comment
492.
The open-source embodied intelligence simulation platform (github.com/loongOpen)
1 point
OpenLoong
a year ago
1 comment
493.
MicroSafe-RL – Deterministic $1.18 \mu s$ safety layer for Edge AI on MCUs (github.com/Kretski)
1 point
DREDREG
2 months ago
discuss
494.
MicroSafe-RL v1.0 – Sub-microsecond safety for Edge AI (github.com/Kretski)
1 point
DREDREG
2 months ago
discuss
495.
Show HN: Modeled healthcare de-identification as longitudinal RL control problem (github.com/azithteja91)
1 point
vkatganti
3 months ago
discuss
496.
Rkgk UI — A low latency Digital Art software on the browser (github.com/michael-0acf4)
1 point
michael-0acf4
5 months ago
discuss
497.
Practical RL (Yandex Data School) (github.com/yandexdataschool)
1 point
xianshou
a year ago
discuss
498.
Logic R1: Reproduce DeepSeek R1 Zero on 2K Logic Puzzle Dataset (github.com/Unakar)
1 point
limoce
a year ago
discuss
499.
Suika Reinforcement Learning Environment (github.com/edwhu)
1 point
edhu2017
2 years ago
discuss
500.
Awesome-Rl-for-Cybersecurity (github.com/Limmen)
1 point
limmen
4 years ago
discuss
501.
Fixing a deadlock in a Common Lisp library for Kafka (github.com/SahilKang)
1 point
sahil-kang
6 years ago
discuss
502.
Numerical Integration: RK4 (github.com/felipetavares)
1 point
felipetavares
7 years ago
discuss
503.
Show HN: Jeevan-rakht (github.com/UdacityFrontEndScholarship)
1 point
skywalker212
8 years ago
discuss
504.
Controlling a unicycle with Policy Gradients (github.com/pauli-space)
1 point
aidanrocke
8 years ago
discuss
505.
GitHub course of practical reinforcement learning (github.com/yandexdataschool)
1 point
sshb
9 years ago
discuss
506.
Asyncronous RL in Tensorflow and Keras and OpenAI's Gym (github.com/coreylynch)
1 point
mau
10 years ago
discuss
507.
Schelling's dynamic model of segregation simulated in Racket (github.com/jmoy)
1 point
yomritoyj
11 years ago
discuss
508.
Show HN: CodeRLM – Tree-sitter-backed code indexing for LLM agents (github.com/JaredStewart)
81 points
jared_stewart
4 months ago
37 comments
509.
LLM generated parsers and compliance checkers for Sparrow DSL
3 points
melezhik
a month ago
discuss
510.
Show HN: Drone Swarm Control with RL in AirSim and SB3 (github.com/Lauqz)
2 points
Lauqz
a year ago
discuss
More