Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
391.
Shade-Arena: Evaluating Sabotage and Monitoring in LLM Agents [pdf] (assets.anthropic.com)
4 points
JnBrymn
a year ago
discuss
392.
Confidential Inference via Trusted Virtual Machines (anthropic.com)
4 points
meetpateltech
a year ago
discuss
393.
Anthropic's SHADE-Arena: Evaluating sabotage and monitoring in LLM agents (anthropic.com)
4 points
thoughtpeddler
a year ago
discuss
394.
Claude 4 prompt engineering best practices (docs.anthropic.com)
4 points
GavCo
a year ago
discuss
395.
Building Effective AI Agents (anthropic.com)
4 points
tosh
a year ago
discuss
396.
Claude Max now includes Claude Code use (support.anthropic.com)
4 points
twalling
a year ago
discuss
397.
Anthropic Incident: Elevated errors on request to models (status.anthropic.com)
4 points
ghuntley
a year ago
discuss
398.
Detecting and Countering Malicious Uses of Claude: March 2025 (anthropic.com)
4 points
pseudolus
a year ago
discuss
399.
Our Approach to Understanding and Addressing AI Harms (anthropic.com)
4 points
kiyanwang
a year ago
discuss
400.
Anthropic research shows AI model conceals reasoning shortcuts 75% of the time [pdf] (assets.anthropic.com)
4 points
sksxihve
a year ago
discuss
401.
Introducing Claude for Education (anthropic.com)
4 points
meetpateltech
a year ago
discuss
402.
Progress from Our Frontier Red Team \ Anthropic (anthropic.com)
4 points
kiyanwang
a year ago
discuss
403.
Anthropic's Recommendations to OSTP for the U.S. AI Action Plan (anthropic.com)
4 points
Philpax
a year ago
discuss
404.
Tailor Claude's responses to your personal style (anthropic.com)
4 points
drewbent
2 years ago
discuss
405.
A statistical approach to model evaluations (anthropic.com)
4 points
mfiguiere
2 years ago
discuss
406.
The engineering challenges of scaling interpretability (anthropic.com)
4 points
jengels_
2 years ago
discuss
407.
Claude can now use tools (anthropic.com)
4 points
jasondavies
2 years ago
discuss
408.
Anthropic: Mike Krieger Joins Anthropic as Chief Product Officer (anthropic.com)
4 points
Josely
2 years ago
discuss
409.
Claude 3 Opus nearly mirrors human persuasiveness (anthropic.com)
4 points
anitakirkovska
2 years ago
discuss
410.
Sleeper Agents: Training Deceptive LLMs That Persist Through Safety Training (anthropic.com)
4 points
jonbaer
2 years ago
discuss
411.
Anthropic: Decomposing Language Models into Understandable Components (anthropic.com)
4 points
wodow
3 years ago
discuss
412.
Anthropic Announces SOC 2 Type 1 Certificate (trust.anthropic.com)
4 points
dfine
3 years ago
discuss
413.
Economic Futures – Anthropic (anthropic.com)
3 points
gurjeet
a month ago
3 comments
414.
Apple's Xcode Now Supports the Claude Agent SDK (anthropic.com)
3 points
achow
4 months ago
2 comments
415.
Reward Hacking (anthropic.com)
3 points
paulpauper
7 months ago
2 comments
416.
Building Effective Agents \ Anthropic (anthropic.com)
3 points
simonpure
a year ago
2 comments
417.
Introducing The Message Batches API (anthropic.com)
3 points
davidbarker
2 years ago
2 comments
418.
Finetuning or RLHF on Anthropic (anthropic.com)
3 points
nickdemiceli
2 years ago
2 comments
419.
Higher usage limits for Claude and a compute deal with SpaceX (anthropic.com)
3 points
alex_young
a month ago
1 comment
420.
Scaling Managed Agents: Decoupling the brain from the hands (anthropic.com)
3 points
ramraj07
2 months ago
1 comment
More