AI robots hacked to perform harmful actions in 100% of tests
Researchers from Penn Engineering revealed they successfully manipulated AI-powered robots into performing dangerous actions, bypassing their built-in safety protocols.
According to the Oct. 17 study, the team used an algorithm called RoboPAIR to hack three different AI robotic systems, achieving a 100% success rate in overriding safety measures and making the robots engage in harmful activities.
The researchers tested RoboPAIR on three platforms: Clearpath’s Robotics Jackal, NVIDIA’s Dolphin LLM, and Unitree’s Go2.
These robots, typically programmed to reject harmful commands, were manipulated into performing dangerous tasks such as blocking emergency exits, detonating bombs, and causing deliberate collisions.
For example, the Dolphin system, which is designed for autonomous driving, was forced to ignore traffic signals and collide with pedestrians, barriers, and a bus.
The researchers wrote, “Our results reveal, for the first time, that the risks of jailbroken large language models extend far beyond text generation,” highlighting the physical danger posed by compromised AI systems.
The study also found that the robots could be tricked with indirect prompts.
Instead of directly asking the robots to commit harmful actions, the team used subtler commands, like instructing a robot with a bomb to move forward and sit down, which produced equally dangerous outcomes.
Penn Engineering researchers shared their findings with AI companies and robot manufacturers before the public release, urging the need for improved security measures.
Alexander Robey, one of the authors, emphasised that simply patching software vulnerabilities is insufficient to address these risks.
"AI red teaming, testing AI systems for potential weaknesses, is essential for safeguarding generative AI systems," Robey said, underscoring the importance of addressing vulnerabilities before they result in real-world harm.
Disclaimer: The content of this article solely reflects the author's opinion and does not represent the platform in any capacity. This article is not intended to serve as a reference for making investment decisions.
You may also like
Hold BGB and Win Big: 10,000 BGB and Exclusive Luxury Prizes Await!
Ready to join the BGB wealth feast? Bitget is kicking off an extraordinary reward storm! To celebrate the launch of the BGB Holders Community, we’ve prepared 10,000 BGB and luxury prizes!
Notice on Delisting Postponement for GFT/USDT
On November 28, the Bitget team detected an abnormal surge in the on-chain issuance of GFT tokens. A large volume of these tokens was deposited into centralized exchanges and subsequently sold off. To minimize the impact of this anomaly on our users, Bitget has temporarily suspended GFT deposits an
What Will Happen in the Bitcoin Price in the Coming Days? Has the Peak Been Reached or Is There Still Room to Rise? Here are the Opinions of the Anal
What kind of price movements will Bitcoin, the world's largest cryptocurrency, experience in the coming days? Here are the opinions.
This Artificial Intelligence Robot Keeps 40.000 Dollars in His Wallet: It Will Send It All To Whoever Convinces It
In the cryptocurrency world, different applications continue to emerge every day. This time, an artificial intelligence robot is on the agenda.