Today we're launching a tool to help you evaluate and test prompts for Generative AI.
We have been building different GPT3 apps and noticed a gap in tooling to help developers assess the quality of different prompts.
Our tool helps you template and test on different datasets and LLM models like GPT-3, GPT-3.5 and open source models like flan-t5-xxl.
We are just getting started and would be delighted to receive your feedback.