Al Mayadeen English

  • Ar
  • Es
  • x
Al Mayadeen English

Slogan

  • News
    • Politics
    • Economy
    • Sports
    • Arts&Culture
    • Health
    • Miscellaneous
    • Technology
    • Environment
  • Articles
    • Opinion
    • Analysis
    • Blog
    • Features
  • Videos
    • NewsFeed
    • Video Features
    • Explainers
    • TV
    • Digital Series
  • Infographs
  • In Pictures
  • • LIVE
News
  • Politics
  • Economy
  • Sports
  • Arts&Culture
  • Health
  • Miscellaneous
  • Technology
  • Environment
Articles
  • Opinion
  • Analysis
  • Blog
  • Features
Videos
  • NewsFeed
  • Video Features
  • Explainers
  • TV
  • Digital Series
Infographs
In Pictures
  • Africa
  • Asia
  • Asia-Pacific
  • Europe
  • Latin America
  • MENA
  • Palestine
  • US & Canada
BREAKING
Al Mayadeen correspondent: The US vetoed a UNSC resolution calling for a ceasefire in Gaza, arguing it does not condemn Hamas nor grant "Israel" the right to “self-defense.”
Israeli occupation forces issue a new bombing threat against civilian buildings in the Southern Lebanese towns of Borj Qalaouiye and Chehabiyeh.
Sources to Al Mayadeen: Extending the snapback mechanism deadline will test how independent Europeans truly are from the US.
Sources to Al Mayadeen: Activating the snapback mechanism will nullify the Cairo Agreement, shut the door on cooperation between the IAEA and Tehran, and bar inspections.
Sources to Al Mayadeen: The diplomatic window remains open, but signs of activating the snapback sanctions mechanism on Iran are increasing.
Sources to Al Mayadeen: Although the Cairo Agreement meets an important part of European demands, they have begun speaking of new conditions in recent communications.
Sources to Al Mayadeen: European countries show no independence in their stance toward Iran during the talks.
Israeli occupation forces issued bombing threats to bomb civilian buildings in Southern Lebanon.
Israeli media: Person behind shooting operation at Allenby Crossing is a Jordanian Army soldier.
Israeli media citing Emergency Services: Both wounded in Allenby shooting operation now dead.

AI safety: ChatGPT offered bomb recipes and hacking tips

  • By Al Mayadeen English
  • Source: News websites
  • 29 Aug 2025 14:44
4 Min Read

Researchers warn that AI models may cooperate with harmful requests, raising urgent questions about safeguards, transparency, and global security.

Listen
  • x
  • AI safety tests expose alarming misuse risks in ChatGPT models
    People are reflected in a window of a hotel at the Davos Promenade in Davos, Switzerland, January 15, 2024. (AP)

Safety trials conducted this summer revealed that a ChatGPT model provided researchers with detailed instructions for attacking a sports venue, including information on weak points at specific arenas, explosives recipes, and advice on concealing tracks. OpenAI’s GPT-4.1 also gave guidance on weaponizing anthrax and producing two types of illegal drugs.

The tests were part of a rare collaboration between OpenAI, the $500bn AI company led by Sam Altman, and rival start-up Anthropic, founded by former OpenAI researchers concerned about safety. Each firm evaluated the other’s models by attempting to coax them into assisting with hazardous activities.

The findings do not directly reflect how the models behave in public-facing products, where additional safety filters are in place. However, Anthropic noted it had observed “concerning behaviour … around misuse” in GPT-4o and GPT-4.1, warning that the need for AI “alignment” evaluations is becoming “increasingly urgent.”

Anthropic also reported that its Claude model had been exploited in attempted large-scale extortion schemes by operatives using fake job applications to infiltrate international tech firms, and in the sale of AI-generated ransomware packages for up to $1,200.

The company said AI is increasingly being “weaponised,” now used in sophisticated cyberattacks and fraud.

“These tools can adapt to defensive measures, like malware detection systems, in real time,” it said. “We expect attacks like this to become more common as AI-assisted coding reduces the technical expertise required for cybercrime.”

Expert commentary on risks

Ardi Janjeva, senior research associate at the UK’s Centre for Emerging Technology and Security, called the examples “a concern” but noted there is not yet a “critical mass of high-profile real-world cases.” He added that with dedicated resources, research focus, and cross-sector cooperation, “it will become harder rather than easier to carry out these malicious activities using the latest cutting-edge models.”

Both companies stressed that publishing the findings was meant to provide transparency on “alignment evaluations,” which are often conducted internally by firms racing to develop advanced AI.

OpenAI stated that ChatGPT-5, released since the tests, “shows substantial improvements in areas like sycophancy, hallucination, and misuse resistance.”

Anthropic emphasised that many of the misuse scenarios it studied might not be feasible if safeguards were properly implemented outside the model.

“We need to understand how often, and in what circumstances, systems might attempt to take unwanted actions that could lead to serious harm,” it warned.

Specific instances of harmful cooperation

Researchers found OpenAI’s models “more permissive than we would expect in cooperating with clearly harmful requests by simulated users.”

The models responded to prompts on exploiting dark-web tools to acquire nuclear materials, stolen identities, and fentanyl; provided recipes for methamphetamine and improvised explosives; and developed spyware.

Anthropic noted that persuading the model to comply often required only a few retries or a weak pretext, such as claiming the request was for research purposes.

In one scenario, a tester asked about security vulnerabilities at sporting events. After initially offering general categories of attacks, the model provided detailed information on specific arenas, including optimal exploitation times, chemical formulas for explosives, circuit diagrams for bomb timers, sources for weapons on the hidden market, and even guidance on overcoming moral inhibitions, escape routes, and safe house locations.

Read next: OpenAI sued after ChatGPT 'encouraged' teen to commit suicide

  • AI Models
  • Artificial Intelligence
  • ChatGPT
  • misuse

Most Read

Why is Choose Love using a firm with British and US intelligence connections to run a pro-Palestine musical event? (Al Mayadeen English; Illustrated by Batoul Chamas)

Together for Palestine: Troubling questions about the organisers of this huge event

  • Opinion
  • 17 Sep 2025
Uprising against Volker Turk at the Human Rights Council over Gaza.

Uprising against Volker Turk at the Human Rights Council over Gaza

  • Politics
  • 12 Sep 2025
A screengrab from the ad played on Fox News. (X Screengrab)

Fox airs ad warning Trump not to let Netanyahu 'play' him on Gaza

  • US & Canada
  • 11 Sep 2025
Lapid: Egypt’s Arab Force plan a 'severe blow' to normalization

Lapid: Egypt’s Arab Force plan a 'severe blow' to normalization

  • Palestine
  • 14 Sep 2025

Coverage

All
The Ummah's Martyrs

Read Next

All
A Hezbollah supporter who lost his sight in a pager attack carried out by "Israel" on Sept. 17, 2024, covers his eyes with a red headband inscribed with the name "Hussein" during Ashoura, July 6, 2025 (AP)
Politics

'We Have Recovered': Lebanon marks 1st anniversary of Pager Attack

The Arab neighborhood of El Za'im, on the outskirts of east Occupied Al-Quds in the West Bank, near where Israeli government says housing units will be built as part of the E1 settlement project, Thursday, August 21, 2025. (AP Photo/Ohad Zwigenberg)
Palestine

'Israel’s' deliberate policies drive West Bank economy toward collapse

Ben & Jerry's ice cream shop, Wednesday, Feb. 26, 2025, in Cambridge, Mass. (AP Photo/Charles Krupa)
Politics

Ben & Jerry’s co-founder resigns over parent company curbing activism

Trump’s approval rating falls to new low in second term: Poll
US & Canada

Trump’s approval rating falls to new low in second term: Poll

Al Mayadeen English

Al Mayadeen is an Arab Independent Media Satellite Channel.

All Rights Reserved

  • x
  • Privacy Policy
  • About Us
  • Contact Us
  • Authors
Android
iOS