Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
151.
The Car Wash Problem: A variable isolation study on prompt architecture
2 points
midmost44
4 months ago
1 comment
152.
Show HN: Dingo 1.9.0 released: With enhanced hallucination detection (github.com/MigoXLab)
2 points
e06084
10 months ago
discuss
153.
Unbounded Context with Memory
1 point
codelion
2 years ago
1 comment
154.
An Evaluation of Location Encoding Systems (2018) (github.com/google)
45 points
Tomte
5 years ago
10 comments
155.
An Evaluation of Location Encoding Systems (2018) (github.com/google)
41 points
Tomte
7 years ago
discuss
156.
Evaluation of Location Encoding Systems (2021) (github.com/google)
32 points
tosh
3 years ago
14 comments
157.
Evaluate and Track Your LLM Experiments: Introducing TruLens for LLMs (github.com/truera)
25 points
shayaks
3 years ago
7 comments
158.
Evaluation of Location Encoding Systems (github.com/google)
3 points
sandebert
4 years ago
discuss
159.
Anthropic's Prompt Evalutions Course (github.com/anthropics)
2 points
thenameless7741
2 years ago
discuss
160.
An Evaluation of Location Encoding Systems (2018) (github.com/google)
2 points
Tomte
3 years ago
discuss
161.
AKS on HCI: Azure VM Eval (github.com/Azure)
2 points
mad_vill
5 years ago
discuss
162.
Google: Evaluation of Location Encoding Systems (github.com/google)
2 points
espeed
9 years ago
discuss
163.
Llama2 on Replicate faster than ChatGPT? (github.com/BerriAI)
1 point
ij23
3 years ago
2 comments
164.
We Built an Open-Source Text-to-Image Evaluation Library for Clip Models (github.com/encord-team)
1 point
Stephen_Oladele
2 years ago
1 comment
165.
Evaluation of Location Encoding Systems (github.com/google)
1 point
CharlesDodgson
8 years ago
discuss
166.
Show HN: High-performance GenAI engine now open source (github.com/arthur-ai)
22 points
fryz
a year ago
12 comments
167.
Show HN: Billion Cell Spreadsheets with Incremental Computation (xls.feldera.io)
15 points
gz09
a year ago
1 comment
168.
Show HN: Voicetest – open-source test harness for voice AI agents
3 points
pldpld
4 months ago
discuss
169.
Sharing learnings from evaluating Million+ LLM responses
3 points
sourabh03agr
3 years ago
discuss
170.
Show HN: Adventures in UTM – Busy Beaver in under 5–10 mins
2 points
polymetron
a year ago
2 comments
171.
Show HN: Dingo – Automate Data Quality Checks Across Pre-Training and SFT Data (github.com/DataEval)
1 point
e06084
a year ago
discuss
172.
Ask HN: How to run Piper text-to-speech on a Mac (using Docker)?
1 point
dv35z
2 years ago
discuss
173.
Which other AI search engines should we keep an eye on?
1 point
james_chu
2 years ago
discuss
174.
You thought that “This should never happen was bad”? search – eval($_GET) (github.com)
23 points
callaars
10 years ago
15 comments
175.
Benchmark GGUF model with ONE line of code (github.com/NexaAI)
6 points
alanzhuly
2 years ago
1 comment
176.
Medical Question-Answer AI Model Evaluation Framework (github.com/chat-data-llc)
4 points
freexiaosu
2 years ago
2 comments
177.
ClojureScript gets a new REPL (github.com/clojure)
4 points
bashwort
15 years ago
discuss
178.
OpenAI cookbook: using GPT-4 as “reference-free” evaluator (github.com/openai)
3 points
zostale
3 years ago
discuss
179.
Test cases took my AI router from 82% to 98% accuracy (github.com/copycat-main)
2 points
a8hi
2 months ago
1 comment
180.
Benchmark GGUF models with a one line of code (github.com/NexaAI)
1 point
mountainview
2 years ago
discuss
More