Open Source
Google AMS: Scan Open-Weight LLMs for Safety in 40 Seconds
Google's open source AMS tool scans open-weight LLMs for intact safety training by measuring activation ...



