Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
241.
Reinforcement Learning Infrastructure for LLM Agents (github.com/NVIDIA-NeMo)
2 points
bakigul
6 months ago
discuss
242.
Verifiers: Environments for LLM Reinforcement Learning (github.com/PrimeIntellect-ai)
2 points
dominik-space
8 months ago
discuss
243.
AReaL, Distributed Reinforcement Learning System for LLM Reasoning (github.com/inclusionAI)
2 points
jinqueeny
a year ago
discuss
244.
AReaL: Distributed Reinforcement Learning System for LLM Reasoning (github.com/inclusionAI)
2 points
jinqueeny
a year ago
discuss
245.
In-Context Reinforcement Learning (github.com/dunnolab)
2 points
vokneruk
2 years ago
discuss
246.
Tetris Gymnasium: A customizable reinforcement learning environment for Tetris (github.com/Max-We)
2 points
mw00
2 years ago
discuss
247.
Kolmogorov-Arnold Network for Reinforcement Leaning, Initial Experiments (github.com/riiswa)
2 points
riiswa
2 years ago
discuss
248.
Pearl – A Production-Ready Reinforcement Learning AI Agent Library by Meta (github.com/facebookresearch)
2 points
jcater
2 years ago
discuss
249.
Why we need Reinforcement Learning for Language Model training (gist.github.com)
2 points
yamrzou
3 years ago
discuss
250.
Melting Pot: A suite of test scenarios for multi-agent reinforcement learning (github.com/deepmind)
2 points
lnyan
5 years ago
discuss
251.
Inverse Reinforcement Learning on Acrobot-v1 (github.com/Vrroom)
2 points
matroid
5 years ago
discuss
252.
DeepMimic: Motion imitation with deep reinforcement learning (github.com/xbpeng)
2 points
homarp
5 years ago
discuss
253.
Jupylet: A Jupyter extension for Reinforcement Learning experiments (github.com/nir)
2 points
cool-RR
5 years ago
discuss
254.
FinRL: A Deep Reinforcement Learning Library for Automated Stock Trading (github.com/AI4Finance-LLC)
2 points
T-A
6 years ago
discuss
255.
Carl (Car Game for Reinforcement Learning) (github.com/MatthiasSchinzel)
2 points
MadS123
6 years ago
discuss
256.
Show HN: Decentralized Reinforcement Learning with Societal Decision-Making (github.com/mbchang)
2 points
bbdaph
6 years ago
discuss
257.
Julia Reinforcement Learning Implementations (github.com/fabio-4)
2 points
fabio-4
6 years ago
discuss
258.
Minimal implementations of Reinforcement Learning algorithms (github.com/seungeunrho)
2 points
ag8
6 years ago
discuss
259.
People's Reinforcement Learning (PRL) (github.com/opium-sh)
2 points
jonbaer
6 years ago
discuss
260.
Deep-tic-tac-toe: deep reinforcement learning to play tic-tac-toe (github.com/ZackAkil)
2 points
sebg
6 years ago
discuss
261.
Open-Source Reinforcement Learning Toolkit for Card Games (github.com/datamllab)
2 points
ghgr
7 years ago
discuss
262.
OpenSpiel: A Framework for Reinforcement Learning in Games (github.com/deepmind)
2 points
jonbaer
7 years ago
discuss
263.
Bezos: Build your own Reinforcement Learning framework (github.com/justinglibert)
2 points
formalsystem
7 years ago
discuss
264.
Show HN: Bezos: build your own (deep) reinforcement learning framework (github.com/justinglibert)
2 points
glibertio
7 years ago
discuss
265.
Microsoft/TextWorld: A sandbox for training reinforcement learning (RL) agents (github.com/Microsoft)
2 points
sansnomme
7 years ago
discuss
266.
Saltie: Rocket League Distributed Deep Reinforcement Learning Bot (github.com/SaltieRL)
2 points
adamnemecek
8 years ago
discuss
267.
Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting (github.com/ChenRocks)
2 points
blopeur
8 years ago
discuss
268.
TRFL: a library of reinforcement learning building blocks (github.com/deepmind)
2 points
verdverm
8 years ago
discuss
269.
Reinforcement Learning Decoders for Fault-Tolerant Quantum Computation (github.com/R-Sweke)
2 points
lainon
8 years ago
discuss
270.
Dopamine is a research framework for fast prototyping of reinforcement learning (github.com/google)
2 points
mromanuk
8 years ago
discuss
More