Al Mayadeen English

  • Ar
  • Es
  • x
Al Mayadeen English

Slogan

  • News
    • Politics
    • Economy
    • Sports
    • Arts&Culture
    • Health
    • Miscellaneous
    • Technology
    • Environment
  • Articles
    • Opinion
    • Analysis
    • Blog
    • Features
  • Videos
    • NewsFeed
    • Video Features
    • Explainers
    • TV
    • Digital Series
  • Infographs
  • In Pictures
  • • LIVE
News
  • Politics
  • Economy
  • Sports
  • Arts&Culture
  • Health
  • Miscellaneous
  • Technology
  • Environment
Articles
  • Opinion
  • Analysis
  • Blog
  • Features
Videos
  • NewsFeed
  • Video Features
  • Explainers
  • TV
  • Digital Series
Infographs
In Pictures
  • Africa
  • Asia
  • Asia-Pacific
  • Europe
  • Latin America
  • MENA
  • Palestine
  • US & Canada
BREAKING
Local sources: An explosive device detonated in Bir Hasna, east of Al-Abbasiya in the Palmyra countryside, Syria, causing injuries and material damage.
Palestinian resistance to hand over Israeli captive body at 9 pm local time.
Syrian media: Israeli occupation forces entered the Quneitra countryside and set up a checkpoint between the village of Ufania and Khan Arnabeh to inspect civilian vehicles.
Palestinian Ministry of Health: Two children killed by the gunfire of Israeli occupation forces in the town of al-Judeira, occupied al-Quds, and their bodies are being withheld
Iranian Foreign Ministry: We express our solidarity with the Lebanese government and people in the face of these criminal attacks and our support for the legitimate resistance
The Iranian Foreign Ministry stressed that the United Nations, the international community, and regional countries bear responsibility for confronting what it described as "Israel’s" growing tendency to ignite wars
Iranian Foreign Ministry: We strongly condemn the Israeli entity's extensive military aggression against Lebanon
Japanese Prime Minister: No confirmations regarding damage caused by the North Korean missile
Japanese Prime Minister: North Korean missile likely landed outside Japan's exclusive economic zone
Japan Coast Guard reports North Korea fired a ballistic missile

AI safety: ChatGPT offered bomb recipes and hacking tips

  • By Al Mayadeen English
  • Source: News websites
  • 29 Aug 2025 14:44
4 Min Read

Researchers warn that AI models may cooperate with harmful requests, raising urgent questions about safeguards, transparency, and global security.

Listen
  • x
  • AI safety tests expose alarming misuse risks in ChatGPT models
    People are reflected in a window of a hotel at the Davos Promenade in Davos, Switzerland, January 15, 2024. (AP)

Safety trials conducted this summer revealed that a ChatGPT model provided researchers with detailed instructions for attacking a sports venue, including information on weak points at specific arenas, explosives recipes, and advice on concealing tracks. OpenAI’s GPT-4.1 also gave guidance on weaponizing anthrax and producing two types of illegal drugs.

The tests were part of a rare collaboration between OpenAI, the $500bn AI company led by Sam Altman, and rival start-up Anthropic, founded by former OpenAI researchers concerned about safety. Each firm evaluated the other’s models by attempting to coax them into assisting with hazardous activities.

The findings do not directly reflect how the models behave in public-facing products, where additional safety filters are in place. However, Anthropic noted it had observed “concerning behaviour … around misuse” in GPT-4o and GPT-4.1, warning that the need for AI “alignment” evaluations is becoming “increasingly urgent.”

Anthropic also reported that its Claude model had been exploited in attempted large-scale extortion schemes by operatives using fake job applications to infiltrate international tech firms, and in the sale of AI-generated ransomware packages for up to $1,200.

The company said AI is increasingly being “weaponised,” now used in sophisticated cyberattacks and fraud.

“These tools can adapt to defensive measures, like malware detection systems, in real time,” it said. “We expect attacks like this to become more common as AI-assisted coding reduces the technical expertise required for cybercrime.”

Expert commentary on risks

Ardi Janjeva, senior research associate at the UK’s Centre for Emerging Technology and Security, called the examples “a concern” but noted there is not yet a “critical mass of high-profile real-world cases.” He added that with dedicated resources, research focus, and cross-sector cooperation, “it will become harder rather than easier to carry out these malicious activities using the latest cutting-edge models.”

Both companies stressed that publishing the findings was meant to provide transparency on “alignment evaluations,” which are often conducted internally by firms racing to develop advanced AI.

OpenAI stated that ChatGPT-5, released since the tests, “shows substantial improvements in areas like sycophancy, hallucination, and misuse resistance.”

Anthropic emphasised that many of the misuse scenarios it studied might not be feasible if safeguards were properly implemented outside the model.

“We need to understand how often, and in what circumstances, systems might attempt to take unwanted actions that could lead to serious harm,” it warned.

Specific instances of harmful cooperation

Researchers found OpenAI’s models “more permissive than we would expect in cooperating with clearly harmful requests by simulated users.”

The models responded to prompts on exploiting dark-web tools to acquire nuclear materials, stolen identities, and fentanyl; provided recipes for methamphetamine and improvised explosives; and developed spyware.

Anthropic noted that persuading the model to comply often required only a few retries or a weak pretext, such as claiming the request was for research purposes.

In one scenario, a tester asked about security vulnerabilities at sporting events. After initially offering general categories of attacks, the model provided detailed information on specific arenas, including optimal exploitation times, chemical formulas for explosives, circuit diagrams for bomb timers, sources for weapons on the hidden market, and even guidance on overcoming moral inhibitions, escape routes, and safe house locations.

Read next: OpenAI sued after ChatGPT 'encouraged' teen to commit suicide

  • AI Models
  • Artificial Intelligence
  • ChatGPT
  • misuse

Most Read

People walk past a domestically-built missile "Khaibar-buster," and banners showing portraits of Iranian Leader Ayatollah Ali Khamenei, center, and the late armed forces commanders at Baharestan Square in Tehran, Thursday, September 25, 2025

IRGC reveals new details on Haniyeh assassination and Iran’s response

  • Politics
  • 3 Nov 2025
Jimmy Wales speaking in Montreal, April 11, 2016. (AP / PA Images)

Wikipedia founder comments on Gaza genocide article sparks backlash

  • Politics
  • 3 Nov 2025
The US and Puerto Rican flags. (AFP)

US imposes flight restrictions off Puerto Rico under Pentagon orders

  • Politics
  • 31 Oct 2025
Erasing evidence: Over 700 videos of Israeli crimes deleted by YouTube

Erasing evidence: Over 700 videos of Israeli crimes wiped off YouTube

  • Politics
  • 5 Nov 2025

Coverage

All
War on Gaza

Read Next

All
The moment the US airstrike targeted a boat in the Caribbean in a video released on November 7, 2025 (Pete Hegseth on X)
Politics

New US strike on alleged 'drug boat' in Caribbean kills three people

Several fall ill at a US base due to a suspicious white powder package.
Miscellaneous

Several fall ill at US base due to suspicious white powder package

An aircraft lands at Philadelphia International Airport in Philadelphia, Thursday, November 6, 2025 (AP)
Miscellaneous

Judge criticizes DOJ as Boeing avoids prosecution over 737 MAX crashes

Mexican President Claudia Sheinbaum holds a morning press conference at the National Palace in Mexico City, Monday, November 3, 2025 (AP)
Politics

Mexicans 'united against any interference' in country: Sheinbaum

Al Mayadeen English

Al Mayadeen is an Arab Independent Media Satellite Channel.

All Rights Reserved

  • x
  • Privacy Policy
  • About Us
  • Contact Us
  • Authors
Android
iOS