Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
601.
▲
Show HN: AI coding assistant that helps to build things, not to ruin them
(github.com/volotat)
2 points
volotat
10 months ago
discuss
602.
▲
AReaL, Distributed Reinforcement Learning System for LLM Reasoning
(github.com/inclusionAI)
2 points
jinqueeny
a year ago
discuss
603.
▲
AReaL: Distributed Reinforcement Learning System for LLM Reasoning
(github.com/inclusionAI)
2 points
jinqueeny
a year ago
discuss
604.
▲
In-Context Reinforcement Learning
(github.com/dunnolab)
2 points
vokneruk
2 years ago
discuss
605.
▲
Tetris Gymnasium: A customizable reinforcement learning environment for Tetris
(github.com/Max-We)
2 points
mw00
2 years ago
discuss
606.
▲
Kolmogorov-Arnold Network for Reinforcement Leaning, Initial Experiments
(github.com/riiswa)
2 points
riiswa
2 years ago
discuss
607.
▲
Matrix digital rain implemented in Bash
(github.com/wick3dr0se)
2 points
thunderbong
2 years ago
discuss
608.
▲
Pydantic v2 ruined the elegance of Pydantic v1
(github.com/pydantic)
2 points
behnamoh
2 years ago
discuss
609.
▲
Ask HN: Rinf copies flutter_rust_bridge, says bridge bad, claims rinf ultimate
2 points
fzyzcjy
2 years ago
discuss
610.
▲
Pearl – A Production-Ready Reinforcement Learning AI Agent Library by Meta
(github.com/facebookresearch)
2 points
jcater
2 years ago
discuss
611.
▲
Friend: An extensible authentication and authorization library for Clojure Ring
(github.com/cemerick)
2 points
nickik
14 years ago
discuss
612.
▲
Why we need Reinforcement Learning for Language Model training
(gist.github.com)
2 points
yamrzou
3 years ago
discuss
613.
▲
Melting Pot: A suite of test scenarios for multi-agent reinforcement learning
(github.com/deepmind)
2 points
lnyan
5 years ago
discuss
614.
▲
Inverse Reinforcement Learning on Acrobot-v1
(github.com/Vrroom)
2 points
matroid
5 years ago
discuss
615.
▲
DeepMimic: Motion imitation with deep reinforcement learning
(github.com/xbpeng)
2 points
homarp
5 years ago
discuss
616.
▲
Jupylet: A Jupyter extension for Reinforcement Learning experiments
(github.com/nir)
2 points
cool-RR
5 years ago
discuss
617.
▲
FinRL: A Deep Reinforcement Learning Library for Automated Stock Trading
(github.com/AI4Finance-LLC)
2 points
T-A
6 years ago
discuss
618.
▲
Carl (Car Game for Reinforcement Learning)
(github.com/MatthiasSchinzel)
2 points
MadS123
6 years ago
discuss
619.
▲
Show HN: Decentralized Reinforcement Learning with Societal Decision-Making
(github.com/mbchang)
2 points
bbdaph
6 years ago
discuss
620.
▲
Julia Reinforcement Learning Implementations
(github.com/fabio-4)
2 points
fabio-4
6 years ago
discuss
621.
▲
Making rxi's lite my main text editor
(github.com/a327ex)
2 points
dsego
6 years ago
discuss
622.
▲
Minimal implementations of Reinforcement Learning algorithms
(github.com/seungeunrho)
2 points
ag8
6 years ago
discuss
623.
▲
People's Reinforcement Learning (PRL)
(github.com/opium-sh)
2 points
jonbaer
6 years ago
discuss
624.
▲
Deep-tic-tac-toe: deep reinforcement learning to play tic-tac-toe
(github.com/ZackAkil)
2 points
sebg
6 years ago
discuss
625.
▲
Show HN: Lock Free MRMW Ring in Go
(github.com/mitghi)
2 points
mitghi
6 years ago
discuss
626.
▲
Open-Source Reinforcement Learning Toolkit for Card Games
(github.com/datamllab)
2 points
ghgr
7 years ago
discuss
627.
▲
Rainbow Color Map Still Considered Harmful (2007) [pdf]
(github.com/djoshea)
2 points
Tomte
7 years ago
discuss
628.
▲
Atomic_ring: A C++ template for a quasi-lock-free ring implementation
(github.com/naver)
2 points
clauderoux
7 years ago
discuss
629.
▲
OpenSpiel: A Framework for Reinforcement Learning in Games
(github.com/deepmind)
2 points
jonbaer
7 years ago
discuss
630.
▲
Bezos: Build your own Reinforcement Learning framework
(github.com/justinglibert)
2 points
formalsystem
7 years ago
discuss
More