AI safety evaluation framework testing LLM epistemic robustness under adversarial self-history manipulation
python ai-safety openrouter llm-evaluation adversarial-testing alignment-research epistemic-robustness
-
Updated
Dec 18, 2025 - Python