Archive

2 digests tagged industry clear

May 9, 202610 items

Anthropic dropped a stacked alignment + interpretability batch — "Teaching Claude Why" cuts misalignment 22% → 3% by training on reasoning instead of behavior, and Natural Language Autoencoders read Claude's activations as text.

alignment vendor:anthropic paper rlhf safety interpretability agent skill-library

May 6, 202612 items

Anthropic is buying compute from xAI and SpaceX — the three companies most loudly competing for AGI are now openly trading the infrastructure none of them can build fast enough alone.

compute infrastructure vendor:anthropic vendor:xai industry networking vendor:openai vendor:nvidia