Al Mayadeen English

  • Ar
  • Es
  • x
Al Mayadeen English

Slogan

  • News
    • Politics
    • Economy
    • Sports
    • Arts&Culture
    • Health
    • Miscellaneous
    • Technology
    • Environment
  • Articles
    • Opinion
    • Analysis
    • Blog
    • Features
  • Videos
    • NewsFeed
    • Video Features
    • Explainers
    • TV
    • Digital Series
  • Infographs
  • In Pictures
  • • LIVE
News
  • Politics
  • Economy
  • Sports
  • Arts&Culture
  • Health
  • Miscellaneous
  • Technology
  • Environment
Articles
  • Opinion
  • Analysis
  • Blog
  • Features
Videos
  • NewsFeed
  • Video Features
  • Explainers
  • TV
  • Digital Series
Infographs
In Pictures
  • Africa
  • Asia
  • Asia-Pacific
  • Europe
  • Latin America
  • MENA
  • Palestine
  • US & Canada
BREAKING
Al Mayadeen's correspondent: Israeli occupation forces bombing the Gaza Strip
Al Mayadeen's correspondent: Ceasefire in Gaza takes effect
The Kremlin: Negotiations toward a settlement in Ukraine are currently at a complete standstill
Abu Mujahid: The steadfastness of the Palestinian people and the Resistance thwarted the displacement plan and allowed us to secure the best possible terms in an agreement to halt the genocidal war
Abu Mujahid: We salute whosoever made sacrifices in support of the Palestinian people, foremost among them the martyred Sayyed Nasrallah and Sayyed Safieddine, as well as the people of Yemen and Iran
Abu Mujahid, head of the media office of the Popular Resistance Committees, to Al Mayadeen: The people of Gaza have sacrificed and given their most precious offerings for the Al-Aqsa Flood
Al Mayadeen's correspondent in Gaza: Israeli artillery shelling targeted Khan Younis and the al-Bureij and al-Maghazi refugee camps
Captives may be released as early as Saturday and by Monday at the latest: Source briefed on the details of the agreement.
Netanyahu set to convene security cabinet at 1500 (1200GMT) and government at 1600 (1300GMT) to approve the deal: Source briefed on the details of the agreement
Within the first 24 hours, the Israeli military will complete the first phase of partial withdrawal: Source briefed on the details of the agreement

AI safety: ChatGPT offered bomb recipes and hacking tips

  • By Al Mayadeen English
  • Source: News websites
  • 29 Aug 2025 14:44
4 Min Read

Researchers warn that AI models may cooperate with harmful requests, raising urgent questions about safeguards, transparency, and global security.

Listen
  • x
  • AI safety tests expose alarming misuse risks in ChatGPT models
    People are reflected in a window of a hotel at the Davos Promenade in Davos, Switzerland, January 15, 2024. (AP)

Safety trials conducted this summer revealed that a ChatGPT model provided researchers with detailed instructions for attacking a sports venue, including information on weak points at specific arenas, explosives recipes, and advice on concealing tracks. OpenAI’s GPT-4.1 also gave guidance on weaponizing anthrax and producing two types of illegal drugs.

The tests were part of a rare collaboration between OpenAI, the $500bn AI company led by Sam Altman, and rival start-up Anthropic, founded by former OpenAI researchers concerned about safety. Each firm evaluated the other’s models by attempting to coax them into assisting with hazardous activities.

The findings do not directly reflect how the models behave in public-facing products, where additional safety filters are in place. However, Anthropic noted it had observed “concerning behaviour … around misuse” in GPT-4o and GPT-4.1, warning that the need for AI “alignment” evaluations is becoming “increasingly urgent.”

Anthropic also reported that its Claude model had been exploited in attempted large-scale extortion schemes by operatives using fake job applications to infiltrate international tech firms, and in the sale of AI-generated ransomware packages for up to $1,200.

The company said AI is increasingly being “weaponised,” now used in sophisticated cyberattacks and fraud.

“These tools can adapt to defensive measures, like malware detection systems, in real time,” it said. “We expect attacks like this to become more common as AI-assisted coding reduces the technical expertise required for cybercrime.”

Expert commentary on risks

Ardi Janjeva, senior research associate at the UK’s Centre for Emerging Technology and Security, called the examples “a concern” but noted there is not yet a “critical mass of high-profile real-world cases.” He added that with dedicated resources, research focus, and cross-sector cooperation, “it will become harder rather than easier to carry out these malicious activities using the latest cutting-edge models.”

Both companies stressed that publishing the findings was meant to provide transparency on “alignment evaluations,” which are often conducted internally by firms racing to develop advanced AI.

OpenAI stated that ChatGPT-5, released since the tests, “shows substantial improvements in areas like sycophancy, hallucination, and misuse resistance.”

Anthropic emphasised that many of the misuse scenarios it studied might not be feasible if safeguards were properly implemented outside the model.

“We need to understand how often, and in what circumstances, systems might attempt to take unwanted actions that could lead to serious harm,” it warned.

Specific instances of harmful cooperation

Researchers found OpenAI’s models “more permissive than we would expect in cooperating with clearly harmful requests by simulated users.”

The models responded to prompts on exploiting dark-web tools to acquire nuclear materials, stolen identities, and fentanyl; provided recipes for methamphetamine and improvised explosives; and developed spyware.

Anthropic noted that persuading the model to comply often required only a few retries or a weak pretext, such as claiming the request was for research purposes.

In one scenario, a tester asked about security vulnerabilities at sporting events. After initially offering general categories of attacks, the model provided detailed information on specific arenas, including optimal exploitation times, chemical formulas for explosives, circuit diagrams for bomb timers, sources for weapons on the hidden market, and even guidance on overcoming moral inhibitions, escape routes, and safe house locations.

Read next: OpenAI sued after ChatGPT 'encouraged' teen to commit suicide

  • AI Models
  • Artificial Intelligence
  • ChatGPT
  • misuse

Most Read

Tucker Carlson speaks at a memorial for Charlie Kirk, Sunday, September 21, 2025, at State Farm Stadium in Glendale, Arizona (AP)

Tucker Carlson: Israeli officers gave orders on Iran inside Pentagon

  • Politics
  • 2 Oct 2025
A Hamas fighter in combat fatigues stands before the ceremony for the handover of Israeli captives to the Red Cross in Nuseirat, central Gaza Strip, Saturday, February 22, 2025 (AP)

Hamas responds to Trump plan, backs Gaza withdrawal, exchange

  • Politics
  • 3 Oct 2025
Mossad’s secret role in Aldo Moro’s 1978 murder revealed

Mossad’s secret role in Aldo Moro’s 1978 murder exposed

  • Politics
  • 5 Oct 2025
The Palestinian resistance and the people of Gaza showed that after combating Israeli aggression for two years, they remain victorious in the face of oppression (Mahdi Rteil/Al Mayadeen English)

Al-Aqsa Flood two years on, a tale of victory

  • Politics
  • 6 Oct 2025

Coverage

All
War on Gaza

Read Next

All
An Israeli armored vehicle moves on a street of a local market during a military raid in the West Bank refugee camp of Balata, Wednesday, October 8, 2025 (AP)
Politics

Israeli settlers kill Palestinian youth near Ramallah amid raids

Russian Foreign Minister Sergey Lavrov speaks during the Moscow format consultations on Afghanistan in Moscow, Russia, Tuesday, Oct. 7, 2025 (AP)
Politics

Iran interested in resuming nuclear talks: Lavrov

International Monetary Fund (IMF) Managing Director Kristalina Georgieva speaks during a news conference at the International Monetary Fund (IMF) headquarters in Washington, April 25, 2025 (AP)
Politics

IMF head flags US budget, Europe Defense spending challenges

Prime Minister of Italy Giorgia Meloni addresses the 80th session of the United Nations General Assembly, Wednesday, Sept. 24, 2025, at UN headquarters (AP)
Politics

Meloni faces ICC complaint over Gaza genocide complicity with Israelis

Al Mayadeen English

Al Mayadeen is an Arab Independent Media Satellite Channel.

All Rights Reserved

  • x
  • Privacy Policy
  • About Us
  • Contact Us
  • Authors
Android
iOS