Search: github.com/eval | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

151.

The Car Wash Problem: A variable isolation study on prompt architecture

2 points

4 months ago

152.

Show HN: Dingo 1.9.0 released: With enhanced hallucination detection (github.com/MigoXLab)

2 points

10 months ago

153.

Unbounded Context with Memory

1 point

2 years ago

154.

An Evaluation of Location Encoding Systems (2018) (github.com/google)

45 points

5 years ago

155.

An Evaluation of Location Encoding Systems (2018) (github.com/google)

41 points

7 years ago

156.

Evaluation of Location Encoding Systems (2021) (github.com/google)

32 points

3 years ago

157.

Evaluate and Track Your LLM Experiments: Introducing TruLens for LLMs (github.com/truera)

25 points

3 years ago

158.

Evaluation of Location Encoding Systems (github.com/google)

3 points

4 years ago

159.

Anthropic's Prompt Evalutions Course (github.com/anthropics)

2 points

thenameless7741

2 years ago

160.

An Evaluation of Location Encoding Systems (2018) (github.com/google)

2 points

3 years ago

161.

AKS on HCI: Azure VM Eval (github.com/Azure)

2 points

5 years ago

162.

Google: Evaluation of Location Encoding Systems (github.com/google)

2 points

9 years ago

163.

Llama2 on Replicate faster than ChatGPT? (github.com/BerriAI)

1 point

3 years ago

164.

We Built an Open-Source Text-to-Image Evaluation Library for Clip Models (github.com/encord-team)

1 point

Stephen_Oladele

2 years ago

165.

Evaluation of Location Encoding Systems (github.com/google)

1 point

8 years ago

166.

Show HN: High-performance GenAI engine now open source (github.com/arthur-ai)

22 points

a year ago

167.

Show HN: Billion Cell Spreadsheets with Incremental Computation (xls.feldera.io)

15 points

a year ago

168.

Show HN: Voicetest – open-source test harness for voice AI agents

3 points

4 months ago

169.

Sharing learnings from evaluating Million+ LLM responses

3 points

3 years ago

170.

Show HN: Adventures in UTM – Busy Beaver in under 5–10 mins

2 points

a year ago

171.

Show HN: Dingo – Automate Data Quality Checks Across Pre-Training and SFT Data (github.com/DataEval)

1 point

a year ago

172.

Ask HN: How to run Piper text-to-speech on a Mac (using Docker)?

1 point

2 years ago

173.

Which other AI search engines should we keep an eye on?

1 point

2 years ago

174.

You thought that “This should never happen was bad”? search – eval($_GET) (github.com)

23 points

10 years ago

175.

Benchmark GGUF model with ONE line of code (github.com/NexaAI)

6 points

2 years ago

176.

Medical Question-Answer AI Model Evaluation Framework (github.com/chat-data-llc)

4 points

2 years ago

177.

ClojureScript gets a new REPL (github.com/clojure)

4 points

15 years ago

178.

OpenAI cookbook: using GPT-4 as “reference-free” evaluator (github.com/openai)

3 points

3 years ago

179.

Test cases took my AI router from 82% to 98% accuracy (github.com/copycat-main)

2 points

2 months ago

180.

Benchmark GGUF models with a one line of code (github.com/NexaAI)

1 point

2 years ago