Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
151.
▲
The Car Wash Problem: A variable isolation study on prompt architecture
2 points
midmost44
4 months ago
1 comment
152.
▲
Show HN: Dingo 1.9.0 released: With enhanced hallucination detection
(github.com/MigoXLab)
2 points
e06084
10 months ago
discuss
153.
▲
Unbounded Context with Memory
1 point
codelion
2 years ago
1 comment
154.
▲
An Evaluation of Location Encoding Systems (2018)
(github.com/google)
45 points
Tomte
5 years ago
10 comments
155.
▲
An Evaluation of Location Encoding Systems (2018)
(github.com/google)
41 points
Tomte
7 years ago
discuss
156.
▲
Evaluation of Location Encoding Systems (2021)
(github.com/google)
32 points
tosh
3 years ago
14 comments
157.
▲
Evaluate and Track Your LLM Experiments: Introducing TruLens for LLMs
(github.com/truera)
25 points
shayaks
3 years ago
7 comments
158.
▲
Evaluation of Location Encoding Systems
(github.com/google)
3 points
sandebert
4 years ago
discuss
159.
▲
Anthropic's Prompt Evalutions Course
(github.com/anthropics)
2 points
thenameless7741
2 years ago
discuss
160.
▲
An Evaluation of Location Encoding Systems (2018)
(github.com/google)
2 points
Tomte
3 years ago
discuss
161.
▲
AKS on HCI: Azure VM Eval
(github.com/Azure)
2 points
mad_vill
5 years ago
discuss
162.
▲
Google: Evaluation of Location Encoding Systems
(github.com/google)
2 points
espeed
9 years ago
discuss
163.
▲
Llama2 on Replicate faster than ChatGPT?
(github.com/BerriAI)
1 point
ij23
3 years ago
2 comments
164.
▲
We Built an Open-Source Text-to-Image Evaluation Library for Clip Models
(github.com/encord-team)
1 point
Stephen_Oladele
2 years ago
1 comment
165.
▲
Evaluation of Location Encoding Systems
(github.com/google)
1 point
CharlesDodgson
8 years ago
discuss
166.
▲
Show HN: High-performance GenAI engine now open source
(github.com/arthur-ai)
22 points
fryz
a year ago
12 comments
167.
▲
Show HN: Billion Cell Spreadsheets with Incremental Computation
(xls.feldera.io)
15 points
gz09
a year ago
1 comment
168.
▲
Show HN: Voicetest – open-source test harness for voice AI agents
3 points
pldpld
4 months ago
discuss
169.
▲
Sharing learnings from evaluating Million+ LLM responses
3 points
sourabh03agr
3 years ago
discuss
170.
▲
Show HN: Adventures in UTM – Busy Beaver in under 5–10 mins
2 points
polymetron
a year ago
2 comments
171.
▲
Show HN: Dingo – Automate Data Quality Checks Across Pre-Training and SFT Data
(github.com/DataEval)
1 point
e06084
a year ago
discuss
172.
▲
Ask HN: How to run Piper text-to-speech on a Mac (using Docker)?
1 point
dv35z
2 years ago
discuss
173.
▲
Which other AI search engines should we keep an eye on?
1 point
james_chu
2 years ago
discuss
174.
▲
You thought that “This should never happen was bad”? search – eval($_GET)
(github.com)
23 points
callaars
10 years ago
15 comments
175.
▲
Benchmark GGUF model with ONE line of code
(github.com/NexaAI)
6 points
alanzhuly
2 years ago
1 comment
176.
▲
Medical Question-Answer AI Model Evaluation Framework
(github.com/chat-data-llc)
4 points
freexiaosu
2 years ago
2 comments
177.
▲
ClojureScript gets a new REPL
(github.com/clojure)
4 points
bashwort
15 years ago
discuss
178.
▲
OpenAI cookbook: using GPT-4 as “reference-free” evaluator
(github.com/openai)
3 points
zostale
3 years ago
discuss
179.
▲
Test cases took my AI router from 82% to 98% accuracy
(github.com/copycat-main)
2 points
a8hi
2 months ago
1 comment
180.
▲
Benchmark GGUF models with a one line of code
(github.com/NexaAI)
1 point
mountainview
2 years ago
discuss
More