Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
1.
Implementing DeepSeek R1's GRPO algorithm from scratch (github.com/policy-gradient)
192 points
xcodevn
a year ago
3 comments
2.
A minimal hackable implementation of policy gradients (GRPO, PPO, REINFORCE) (github.com/zafstojano)
1 point
starzmustdie
5 months ago
discuss
3.
Experimenting with policy gradient methods in Jax (github.com/elliotvilhelm)
2 points
monadicmonad
a year ago
discuss
4.
OpenAi Gym: Policy Gradient (github.com/Mortiniera)
2 points
mortinie
7 years ago
discuss
5.
Multi-Agent Deep Deterministic Policy Gradient (github.com/openai)
2 points
stablemap
8 years ago
discuss
6.
Controlling a unicycle with Policy Gradients (github.com/pauli-space)
1 point
aidanrocke
8 years ago
discuss
7.
AI and Games
3 points
shehabyasser
9 months ago
discuss
8.
Show HN: Qantify – GPU-Accelerated Trading Library with Advanced Math and AutoML (github.com/Alradyin)
1 point
Alradyin
7 months ago
discuss