Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
1.
▲
Implementing DeepSeek R1's GRPO algorithm from scratch
(github.com/policy-gradient)
192 points
xcodevn
a year ago
3 comments
2.
▲
A minimal hackable implementation of policy gradients (GRPO, PPO, REINFORCE)
(github.com/zafstojano)
1 point
starzmustdie
5 months ago
discuss
3.
▲
Experimenting with policy gradient methods in Jax
(github.com/elliotvilhelm)
2 points
monadicmonad
a year ago
discuss
4.
▲
OpenAi Gym: Policy Gradient
(github.com/Mortiniera)
2 points
mortinie
7 years ago
discuss
5.
▲
Multi-Agent Deep Deterministic Policy Gradient
(github.com/openai)
2 points
stablemap
8 years ago
discuss
6.
▲
Controlling a unicycle with Policy Gradients
(github.com/pauli-space)
1 point
aidanrocke
8 years ago
discuss
7.
▲
AI and Games
3 points
shehabyasser
9 months ago
discuss
8.
▲
Show HN: Qantify – GPU-Accelerated Trading Library with Advanced Math and AutoML
(github.com/Alradyin)
1 point
Alradyin
7 months ago
discuss