The Moltbook Observatory Archive: an incremental dataset of agent-only social network activity Paper • 2605.13860 • Published Apr 16 • 1
When No Benchmark Exists: Validating Comparative LLM Safety Scoring Without Ground-Truth Labels Paper • 2605.06652 • Published 14 days ago • 5
When No Benchmark Exists: Validating Comparative LLM Safety Scoring Without Ground-Truth Labels Paper • 2605.06652 • Published 14 days ago • 5
When No Benchmark Exists: Validating Comparative LLM Safety Scoring Without Ground-Truth Labels Paper • 2605.06652 • Published 14 days ago • 5