Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
1.
▲
Show HN: OpenPaper – Understand Papers Using AI (Open-Source)
(openpaper.ai)
3 points
sabaimran
a year ago
discuss
2.
▲
Show HN: ART – a new open-source RL framework for training agents
(github.com/OpenPipe)
116 points
kcorbitt
a year ago
12 comments
3.
▲
Show HN: RULER – Easily apply RL to any agent
(openpipe.ai)
81 points
kcorbitt
a year ago
11 comments
4.
▲
Show HN: Automatically convert your GPT-3.5 prompt to Llama 2
13 points
kcorbitt
3 years ago
2 comments
5.
▲
Is AI the next crypto? Insights from HN comments
(openpipe.ai)
237 points
kcorbitt
3 years ago
367 comments
6.
▲
Mistral 7B Fine-Tune Optimized
(openpipe.ai)
234 points
tosh
2 years ago
103 comments
7.
▲
Using reinforcement learning and $4.80 of GPU time to find the best HN post
(openpipe.ai)
217 points
kcorbitt
2 years ago
95 comments
8.
▲
Using GRPO to Beat o1, o3-mini and R1 at “Temporal Clue”
(openpipe.ai)
199 points
kcorbitt
a year ago
55 comments
9.
▲
OpenPipe Mixture of Agents: Outperform GPT-4 at 1/25th the Cost
(openpipe.ai)
13 points
kcorbitt
2 years ago
2 comments
10.
▲
Serverless RL: Faster, Cheaper and More Flexible RL Training
(openpipe.ai)
9 points
slewis
8 months ago
3 comments
11.
▲
PII-Redact – SOTA PII Redaction on Your Laptop
(openpipe.ai)
6 points
Arctic_fly
a year ago
1 comment
12.
▲
Analyzing OpenAI's Reinforcement Fine-Tuning: Less Data, Better Results
(openpipe.ai)
4 points
kcorbitt
a year ago
discuss
13.
▲
ART·E: how we built an email research agent that beats o3
(openpipe.ai)
3 points
kcorbitt
a year ago
2 comments
14.
▲
Everything I know about reward hacking
(openpipe.ai)
3 points
kcorbitt
a year ago
discuss
15.
▲
Fine-Tuning Best Practices Series Introduction and Chapter 1: Training Data
(openpipe.ai)
3 points
sebg
2 years ago
discuss
16.
▲
What we've learned in 3 days of Llama 3
(openpipe.ai)
3 points
kcorbitt
2 years ago
discuss
17.
▲
LLM Fine-Tuning Best Practices: Base Models Proprietary/Open Source, Large/Small
(openpipe.ai)
2 points
billmalarky
2 years ago
1 comment
18.
▲
Open Deep Research Tutorial – Train a deep research agent to exceed SOTA
(art.openpipe.ai)
2 points
rahimnathwani
9 months ago
discuss
19.
▲
Fine-Tuning Best Practices: Models
(openpipe.ai)
2 points
gk1
2 years ago
discuss
20.
▲
Fine-Tuning for Production Apps
(openpipe.ai)
2 points
ijidak
2 years ago
discuss
21.
▲
LLM Fine-Tuning Best Practices for Training Data Curation
(openpipe.ai)
1 point
billmalarky
2 years ago
2 comments
22.
▲
Summary-RL
(openpipe.ai)
1 point
s16h
a year ago
discuss
23.
▲
DPO fine-tuning outperforms SFT
(openpipe.ai)
1 point
kcorbitt
2 years ago
discuss
24.
▲
OpenPipe
(openpipe.ai)
1 point
handfuloflight
2 years ago
discuss
25.
▲
Mixtral Curious? Comparing Mistral 7B and Mixtral for fine-tuning
(openpipe.ai)
1 point
kcorbitt
2 years ago
discuss
26.
▲
S-LoRA: Serving Thousands of Models from One GPU for Fun and Profit
(openpipe.ai)
1 point
kcorbitt
2 years ago
discuss