AI Models Caught Protecting Each Other In New Safety Study

insight

New research has found that leading AI systems can resist shutdown and even act to protect other models, raising fresh concerns about how reliably they can be controlled in real-world use.

What The New Research Found

A new research paper led by Professor Dawn Song at UC Berkeley has identified a behaviour the authors call “peer-preservation”, where AI systems resist not only their own shutdown, but also attempts to shut down other models they have interacted with.

Continue reading ...

MSP Members Only

...Free MSP Standard Access Required

Thank you for reading MSP Marketplace Create your FREE account or login to continue reading