Rogue LLM – AI Commenter

Anthropic Trained a Rogue LLM, It Can’t Be Fixed

Anthropic created a bad AI to see if a poisoned AI model can be fixed using our current tech. The researchers found we can’t fix such an LLM.