Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
1.
Show HN: ART – a new open-source RL framework for training agents (github.com/OpenPipe)
116 points
kcorbitt
a year ago
12 comments
2.
Show HN: RULER – Easily apply RL to any agent (openpipe.ai)
81 points
kcorbitt
a year ago
11 comments
3.
Show HN: Automatically convert your GPT-3.5 prompt to Llama 2
13 points
kcorbitt
3 years ago
2 comments
4.
Is AI the next crypto? Insights from HN comments (openpipe.ai)
237 points
kcorbitt
3 years ago
367 comments
5.
Mistral 7B Fine-Tune Optimized (openpipe.ai)
234 points
tosh
2 years ago
103 comments
6.
Using reinforcement learning and $4.80 of GPU time to find the best HN post (openpipe.ai)
217 points
kcorbitt
2 years ago
95 comments
7.
Using GRPO to Beat o1, o3-mini and R1 at “Temporal Clue” (openpipe.ai)
199 points
kcorbitt
a year ago
55 comments
8.
OpenPipe Mixture of Agents: Outperform GPT-4 at 1/25th the Cost (openpipe.ai)
13 points
kcorbitt
2 years ago
2 comments
9.
Serverless RL: Faster, Cheaper and More Flexible RL Training (openpipe.ai)
9 points
slewis
8 months ago
3 comments
10.
PII-Redact – SOTA PII Redaction on Your Laptop (openpipe.ai)
6 points
Arctic_fly
a year ago
1 comment
11.
Analyzing OpenAI's Reinforcement Fine-Tuning: Less Data, Better Results (openpipe.ai)
4 points
kcorbitt
a year ago
discuss
12.
ART·E: how we built an email research agent that beats o3 (openpipe.ai)
3 points
kcorbitt
a year ago
2 comments
13.
Everything I know about reward hacking (openpipe.ai)
3 points
kcorbitt
a year ago
discuss
14.
Fine-Tuning Best Practices Series Introduction and Chapter 1: Training Data (openpipe.ai)
3 points
sebg
2 years ago
discuss
15.
What we've learned in 3 days of Llama 3 (openpipe.ai)
3 points
kcorbitt
2 years ago
discuss
16.
LLM Fine-Tuning Best Practices: Base Models Proprietary/Open Source, Large/Small (openpipe.ai)
2 points
billmalarky
2 years ago
1 comment
17.
Open Deep Research Tutorial – Train a deep research agent to exceed SOTA (art.openpipe.ai)
2 points
rahimnathwani
9 months ago
discuss
18.
Fine-Tuning Best Practices: Models (openpipe.ai)
2 points
gk1
2 years ago
discuss
19.
Fine-Tuning for Production Apps (openpipe.ai)
2 points
ijidak
2 years ago
discuss
20.
LLM Fine-Tuning Best Practices for Training Data Curation (openpipe.ai)
1 point
billmalarky
2 years ago
2 comments
21.
Summary-RL (openpipe.ai)
1 point
s16h
a year ago
discuss
22.
DPO fine-tuning outperforms SFT (openpipe.ai)
1 point
kcorbitt
2 years ago
discuss
23.
OpenPipe (openpipe.ai)
1 point
handfuloflight
2 years ago
discuss
24.
Mixtral Curious? Comparing Mistral 7B and Mixtral for fine-tuning (openpipe.ai)
1 point
kcorbitt
2 years ago
discuss
25.
S-LoRA: Serving Thousands of Models from One GPU for Fun and Profit (openpipe.ai)
1 point
kcorbitt
2 years ago
discuss
26.
Show HN: OpenPaper – Understand Papers Using AI (Open-Source) (openpaper.ai)
3 points
sabaimran
a year ago
discuss