Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
1.
Show HN: Ran an AI agent 100x – pass rate 70%, not 100% (github.com/alepot55)
2 points
alepot55
4 months ago
discuss
2.
Show HN: Agentrial – pytest for AI agents with statistical rigor (github.com/alepot55)
2 points
alepot55
4 months ago
discuss