Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
1.
▲
Launch HN: Confident AI (YC W25) – Open-source evaluation framework for LLM apps
117 points
jeffreyip
a year ago
27 comments
2.
▲
Show HN: DeepTeam – Open-Source Red-Teaming Framework for LLM Security
(github.com/confident-ai)
4 points
sidmurali23
a year ago
discuss
3.
▲
Show HN: DeepTeam – Penetration Testing for LLMs
(github.com/confident-ai)
3 points
jeffreyip
a year ago
discuss
4.
▲
DeepTeam: Penetration Testing for LLMs
2 points
jeffreyip
a year ago
discuss
5.
▲
Show HN: Tag driven changelog generator (MDX) with optional LLM summaries
1 point
dustfinger
5 months ago
1 comment
6.
▲
DeepTeam: Open-Source Pennetration Testing for LLMs
1 point
jeffreyip
a year ago
discuss
7.
▲
Show HN: I implemented evals metrics for LLMs that runs locally on your machine
(github.com/confident-ai)
22 points
3d27
2 years ago
3 comments
8.
▲
Show HN: DeepEval – Evaluation and Unit Testing for LLMs
(github.com/confident-ai)
18 points
jacky2wong
3 years ago
8 comments
9.
▲
Show HN: DeepEval – Unit Testing for LLMs (Open Science)
(github.com/confident-ai)
6 points
jacky2wong
3 years ago
discuss
10.
▲
DeepEval – Neural Framework for Testing LLMs
(github.com/confident-ai)
2 points
jacky2wong
3 years ago
discuss
11.
▲
Unit Testing for Rag
(github.com/confident-ai)
2 points
jacky2wong
3 years ago
discuss
12.
▲
DeepEval CLI
(github.com/confident-ai)
2 points
jacky2wong
3 years ago
discuss
13.
▲
Has anyone ever used the Python framework "Deepeval"?
(github.com/confident-ai)
1 point
willmarquis
a year ago
discuss