Search: github.com/reinh | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

241.

Reinforcement Learning Infrastructure for LLM Agents (github.com/NVIDIA-NeMo)

2 points

6 months ago

242.

Verifiers: Environments for LLM Reinforcement Learning (github.com/PrimeIntellect-ai)

2 points

8 months ago

243.

AReaL, Distributed Reinforcement Learning System for LLM Reasoning (github.com/inclusionAI)

2 points

a year ago

244.

AReaL: Distributed Reinforcement Learning System for LLM Reasoning (github.com/inclusionAI)

2 points

a year ago

245.

In-Context Reinforcement Learning (github.com/dunnolab)

2 points

2 years ago

246.

Tetris Gymnasium: A customizable reinforcement learning environment for Tetris (github.com/Max-We)

2 points

2 years ago

247.

Kolmogorov-Arnold Network for Reinforcement Leaning, Initial Experiments (github.com/riiswa)

2 points

2 years ago

248.

Pearl – A Production-Ready Reinforcement Learning AI Agent Library by Meta (github.com/facebookresearch)

2 points

2 years ago

249.

Why we need Reinforcement Learning for Language Model training (gist.github.com)

2 points

3 years ago

250.

Melting Pot: A suite of test scenarios for multi-agent reinforcement learning (github.com/deepmind)

2 points

5 years ago

251.

Inverse Reinforcement Learning on Acrobot-v1 (github.com/Vrroom)

2 points

5 years ago

252.

DeepMimic: Motion imitation with deep reinforcement learning (github.com/xbpeng)

2 points

5 years ago

253.

Jupylet: A Jupyter extension for Reinforcement Learning experiments (github.com/nir)

2 points

5 years ago

254.

FinRL: A Deep Reinforcement Learning Library for Automated Stock Trading (github.com/AI4Finance-LLC)

2 points

6 years ago

255.

Carl (Car Game for Reinforcement Learning) (github.com/MatthiasSchinzel)

2 points

6 years ago

256.

Show HN: Decentralized Reinforcement Learning with Societal Decision-Making (github.com/mbchang)

2 points

6 years ago

257.

Julia Reinforcement Learning Implementations (github.com/fabio-4)

2 points

6 years ago

258.

Minimal implementations of Reinforcement Learning algorithms (github.com/seungeunrho)

2 points

6 years ago

259.

People's Reinforcement Learning (PRL) (github.com/opium-sh)

2 points

6 years ago

260.

Deep-tic-tac-toe: deep reinforcement learning to play tic-tac-toe (github.com/ZackAkil)

2 points

6 years ago

261.

Open-Source Reinforcement Learning Toolkit for Card Games (github.com/datamllab)

2 points

7 years ago

262.

OpenSpiel: A Framework for Reinforcement Learning in Games (github.com/deepmind)

2 points

7 years ago

263.

Bezos: Build your own Reinforcement Learning framework (github.com/justinglibert)

2 points

7 years ago

264.

Show HN: Bezos: build your own (deep) reinforcement learning framework (github.com/justinglibert)

2 points

7 years ago

265.

Microsoft/TextWorld: A sandbox for training reinforcement learning (RL) agents (github.com/Microsoft)

2 points

7 years ago

266.

Saltie: Rocket League Distributed Deep Reinforcement Learning Bot (github.com/SaltieRL)

2 points

8 years ago

267.

Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting (github.com/ChenRocks)

2 points

8 years ago

268.

TRFL: a library of reinforcement learning building blocks (github.com/deepmind)

2 points

8 years ago

269.

Reinforcement Learning Decoders for Fault-Tolerant Quantum Computation (github.com/R-Sweke)

2 points

8 years ago

270.

Dopamine is a research framework for fast prototyping of reinforcement learning (github.com/google)

2 points

8 years ago