Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
391.
▲
Shade-Arena: Evaluating Sabotage and Monitoring in LLM Agents [pdf]
(assets.anthropic.com)
4 points
JnBrymn
a year ago
discuss
392.
▲
Confidential Inference via Trusted Virtual Machines
(anthropic.com)
4 points
meetpateltech
a year ago
discuss
393.
▲
Anthropic's SHADE-Arena: Evaluating sabotage and monitoring in LLM agents
(anthropic.com)
4 points
thoughtpeddler
a year ago
discuss
394.
▲
Claude 4 prompt engineering best practices
(docs.anthropic.com)
4 points
GavCo
a year ago
discuss
395.
▲
Building Effective AI Agents
(anthropic.com)
4 points
tosh
a year ago
discuss
396.
▲
Claude Max now includes Claude Code use
(support.anthropic.com)
4 points
twalling
a year ago
discuss
397.
▲
Anthropic Incident: Elevated errors on request to models
(status.anthropic.com)
4 points
ghuntley
a year ago
discuss
398.
▲
Detecting and Countering Malicious Uses of Claude: March 2025
(anthropic.com)
4 points
pseudolus
a year ago
discuss
399.
▲
Our Approach to Understanding and Addressing AI Harms
(anthropic.com)
4 points
kiyanwang
a year ago
discuss
400.
▲
Anthropic research shows AI model conceals reasoning shortcuts 75% of the time [pdf]
(assets.anthropic.com)
4 points
sksxihve
a year ago
discuss
401.
▲
Introducing Claude for Education
(anthropic.com)
4 points
meetpateltech
a year ago
discuss
402.
▲
Progress from Our Frontier Red Team \ Anthropic
(anthropic.com)
4 points
kiyanwang
a year ago
discuss
403.
▲
Anthropic's Recommendations to OSTP for the U.S. AI Action Plan
(anthropic.com)
4 points
Philpax
a year ago
discuss
404.
▲
Tailor Claude's responses to your personal style
(anthropic.com)
4 points
drewbent
2 years ago
discuss
405.
▲
A statistical approach to model evaluations
(anthropic.com)
4 points
mfiguiere
2 years ago
discuss
406.
▲
The engineering challenges of scaling interpretability
(anthropic.com)
4 points
jengels_
2 years ago
discuss
407.
▲
Claude can now use tools
(anthropic.com)
4 points
jasondavies
2 years ago
discuss
408.
▲
Anthropic: Mike Krieger Joins Anthropic as Chief Product Officer
(anthropic.com)
4 points
Josely
2 years ago
discuss
409.
▲
Claude 3 Opus nearly mirrors human persuasiveness
(anthropic.com)
4 points
anitakirkovska
2 years ago
discuss
410.
▲
Sleeper Agents: Training Deceptive LLMs That Persist Through Safety Training
(anthropic.com)
4 points
jonbaer
2 years ago
discuss
411.
▲
Anthropic: Decomposing Language Models into Understandable Components
(anthropic.com)
4 points
wodow
3 years ago
discuss
412.
▲
Anthropic Announces SOC 2 Type 1 Certificate
(trust.anthropic.com)
4 points
dfine
3 years ago
discuss
413.
▲
Economic Futures – Anthropic
(anthropic.com)
3 points
gurjeet
a month ago
3 comments
414.
▲
Apple's Xcode Now Supports the Claude Agent SDK
(anthropic.com)
3 points
achow
4 months ago
2 comments
415.
▲
Reward Hacking
(anthropic.com)
3 points
paulpauper
7 months ago
2 comments
416.
▲
Building Effective Agents \ Anthropic
(anthropic.com)
3 points
simonpure
a year ago
2 comments
417.
▲
Introducing The Message Batches API
(anthropic.com)
3 points
davidbarker
2 years ago
2 comments
418.
▲
Finetuning or RLHF on Anthropic
(anthropic.com)
3 points
nickdemiceli
2 years ago
2 comments
419.
▲
Higher usage limits for Claude and a compute deal with SpaceX
(anthropic.com)
3 points
alex_young
a month ago
1 comment
420.
▲
Scaling Managed Agents: Decoupling the brain from the hands
(anthropic.com)
3 points
ramraj07
2 months ago
1 comment
More