Search: github.com/rlk | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

481.

Show HN: Linguistic RL – A 7B model discovers Occam's Razor through reflection (github.com/DRawson5570)

2 points

7 months ago

482.

Recent cross-research on LLM and RL on ArXiv (github.com/WindyLab)

2 points

10 months ago

483.

Abusing Roku APIs (github.com/RoseSecurity)

2 points

3 years ago

484.

Show HN: I've just ported the RWKV LLM to Fortran (github.com/FortAI-Hub)

2 points

3 years ago

485.

Rkflashtool: Tools for Flashing Rockchip Devices (github.com/linux-rockchip)

2 points

5 years ago

486.

Show HN: Decentralized Reinforcement Learning with Societal Decision-Making (github.com/mbchang)

2 points

6 years ago

487.

A GitHub course in reinforcement learning in the wild (github.com/yandexdataschool)

2 points

8 years ago

488.

A git-course on reinforcement learning in the wild (github.com/yandexdataschool)

2 points

9 years ago

489.

Show HN: A Node.js Implementation of the Rapid Automatic Keyword Extraction Algo (github.com/waseem18)

2 points

9 years ago

490.

Show HN: Hands-on course for building RL environments for LLMs (github.com/anakin87)

1 point

2 months ago

491.

Show HN: Framework for Transferring AI Capabilities (Students Surpass Teachers) (github.com/DRawson5570)

1 point

7 months ago

492.

The open-source embodied intelligence simulation platform (github.com/loongOpen)

1 point

a year ago

493.

MicroSafe-RL – Deterministic $1.18 \mu s$ safety layer for Edge AI on MCUs (github.com/Kretski)

1 point

2 months ago

494.

MicroSafe-RL v1.0 – Sub-microsecond safety for Edge AI (github.com/Kretski)

1 point

2 months ago

495.

Show HN: Modeled healthcare de-identification as longitudinal RL control problem (github.com/azithteja91)

1 point

3 months ago

496.

Rkgk UI — A low latency Digital Art software on the browser (github.com/michael-0acf4)

1 point

5 months ago

497.

Practical RL (Yandex Data School) (github.com/yandexdataschool)

1 point

a year ago

498.

Logic R1: Reproduce DeepSeek R1 Zero on 2K Logic Puzzle Dataset (github.com/Unakar)

1 point

a year ago

499.

Suika Reinforcement Learning Environment (github.com/edwhu)

1 point

2 years ago

500.

Awesome-Rl-for-Cybersecurity (github.com/Limmen)

1 point

4 years ago

501.

Fixing a deadlock in a Common Lisp library for Kafka (github.com/SahilKang)

1 point

6 years ago

502.

Numerical Integration: RK4 (github.com/felipetavares)

1 point

7 years ago

503.

Show HN: Jeevan-rakht (github.com/UdacityFrontEndScholarship)

1 point

8 years ago

504.

Controlling a unicycle with Policy Gradients (github.com/pauli-space)

1 point

8 years ago

505.

GitHub course of practical reinforcement learning (github.com/yandexdataschool)

1 point

9 years ago

506.

Asyncronous RL in Tensorflow and Keras and OpenAI's Gym (github.com/coreylynch)

1 point

10 years ago

507.

Schelling's dynamic model of segregation simulated in Racket (github.com/jmoy)

1 point

11 years ago

508.

Show HN: CodeRLM – Tree-sitter-backed code indexing for LLM agents (github.com/JaredStewart)

81 points

4 months ago

509.

LLM generated parsers and compliance checkers for Sparrow DSL

3 points

a month ago

510.

Show HN: Drone Swarm Control with RL in AirSim and SB3 (github.com/Lauqz)

2 points

a year ago