Search: openpaper.ai | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

1.

Show HN: OpenPaper – Understand Papers Using AI (Open-Source) (openpaper.ai)

3 points

a year ago

2.

Show HN: ART – a new open-source RL framework for training agents (github.com/OpenPipe)

116 points

a year ago

3.

Show HN: RULER – Easily apply RL to any agent (openpipe.ai)

81 points

a year ago

4.

Show HN: Automatically convert your GPT-3.5 prompt to Llama 2

13 points

3 years ago

5.

Is AI the next crypto? Insights from HN comments (openpipe.ai)

237 points

3 years ago

6.

Mistral 7B Fine-Tune Optimized (openpipe.ai)

234 points

2 years ago

7.

Using reinforcement learning and $4.80 of GPU time to find the best HN post (openpipe.ai)

217 points

2 years ago

8.

Using GRPO to Beat o1, o3-mini and R1 at “Temporal Clue” (openpipe.ai)

199 points

a year ago

9.

OpenPipe Mixture of Agents: Outperform GPT-4 at 1/25th the Cost (openpipe.ai)

13 points

2 years ago

10.

Serverless RL: Faster, Cheaper and More Flexible RL Training (openpipe.ai)

9 points

8 months ago

11.

PII-Redact – SOTA PII Redaction on Your Laptop (openpipe.ai)

6 points

a year ago

12.

Analyzing OpenAI's Reinforcement Fine-Tuning: Less Data, Better Results (openpipe.ai)

4 points

a year ago

13.

ART·E: how we built an email research agent that beats o3 (openpipe.ai)

3 points

a year ago

14.

Everything I know about reward hacking (openpipe.ai)

3 points

a year ago

15.

Fine-Tuning Best Practices Series Introduction and Chapter 1: Training Data (openpipe.ai)

3 points

2 years ago

16.

What we've learned in 3 days of Llama 3 (openpipe.ai)

3 points

2 years ago

17.

LLM Fine-Tuning Best Practices: Base Models Proprietary/Open Source, Large/Small (openpipe.ai)

2 points

2 years ago

18.

Open Deep Research Tutorial – Train a deep research agent to exceed SOTA (art.openpipe.ai)

2 points

9 months ago

19.

Fine-Tuning Best Practices: Models (openpipe.ai)

2 points

2 years ago

20.

Fine-Tuning for Production Apps (openpipe.ai)

2 points

2 years ago

21.

LLM Fine-Tuning Best Practices for Training Data Curation (openpipe.ai)

1 point

2 years ago

22.

Summary-RL (openpipe.ai)

1 point

a year ago

23.

DPO fine-tuning outperforms SFT (openpipe.ai)

1 point

2 years ago

24.

OpenPipe (openpipe.ai)

1 point

2 years ago

25.

Mixtral Curious? Comparing Mistral 7B and Mixtral for fine-tuning (openpipe.ai)

1 point

2 years ago

26.

S-LoRA: Serving Thousands of Models from One GPU for Fun and Profit (openpipe.ai)

1 point

2 years ago