Russia

AI giant vows more transparency amid national security concerns

RT English · 2026-06-12

AI SUMMARY

• What happened: US AI company Anthropic announced it will enhance transparency regarding the safeguards of its advanced AI models, specifically by informing users when their requests are downgraded or rejected. • Why it matters: This move addresses criticism over the lack of visibility in the handling of sensitive requests, particularly in areas like cybersecurity and biology, and aims to build trust among users while ensuring national security. • What to watch next: Monitor how other AI companies respond to Anthropic's commitment to transparency and whether similar practices are adopted across the industry.

**AI Giant Vows More Transparency Amid National Security Concerns**

In a significant move towards greater transparency, US artificial intelligence company Anthropic announced on Wednesday that it will enhance the visibility of the safeguards governing its advanced AI models. This decision comes in response to growing criticism regarding the lack of transparency around restrictions that previously went unnoticed by users.

Anthropic, known for its cutting-edge AI technology, revealed that it had been routing certain user requests—specifically those related to sensitive areas such as cybersecurity, biology, and advanced AI development—from its more advanced model, Fable 5, to a less capable model, Opus 4.8. This practice, which occurred without user notification, raised concerns among researchers and industry experts, who argued that such limitations could hinder progress in AI research and development.

Under the new policy, users will now be informed when their requests are flagged and subsequently downgraded. Additionally, developers utilizing Anthropic’s Application Programming Interfaces (APIs) will receive explanations for any requests that are rejected or redirected to a different model. The company stated, “Starting this week, flagged requests will visibly fall back to Opus 4.8 – the same as our safeguards for cyber and bio. You will see this every time it happens. On the API, any flagged requests will return a reason for their refusal.”

Fable 5, which is part of Anthropic’s Mythos class of models, was initially withheld from public release due to concerns over its ability to circumvent cybersecurity measures. However, the company has now made this model available, claiming that its capabilities surpass those of all previously released models.

Despite the increased transparency, Anthropic has indicated that it will continue to downgrade certain requests based on established policies. These policies are designed to prevent the use of its models for creating competing AI systems, a restriction that the company asserts is standard practice within the industry. Anthropic emphasized that these safeguards do not impede most coding and machine learning activities.

National security concerns also play a crucial role in Anthropic’s decision-making process regarding request management. The company aims to prevent foreign adversaries from leveraging its technology to enhance their own AI capabilities. A spokesperson for Anthropic explained, “The US and its allies hold an edge in frontier chips and the highly optimized software that runs them at full potential. These safeguards ensure Claude [Anthropic’s family of AI models] isn’t used to erode that advantage – by optimizing chips developed by those adversaries, for example.”

The announcement reflects a broader trend in the AI industry, where companies are increasingly scrutinized for their practices regarding user data and model transparency. As AI technologies continue to evolve and integrate into various sectors, the importance of clear communication and ethical guidelines becomes paramount.

Anthropic's commitment to transparency is expected to resonate with users and stakeholders who are advocating for responsible AI development. By openly disclosing the limitations and reasons behind model downgrades, the company aims to foster trust and encourage a collaborative approach to AI innovation.

As the landscape of artificial intelligence continues to shift, the balance between advancing technology and ensuring security will remain a critical focus for companies like Anthropic. The steps taken by the company may set a precedent for others in the industry, highlighting the need for transparency and accountability in the rapidly evolving field of AI.

Source: RT English
RELATED NEWS

More Stories

All News
Russia

Georgia clashes with EU over Russia sanctions and visa-free travel (VIDEO)

• What happened: Georgia has accused the EU of "blackmailing" the country into joining sanctions against Russia, amid escalating tensions over the Ukr...

Russia

Europeans clearly trying to turn EU into military alliance — Dodik

• What happened: Milorad Dodik, leader of the Alliance of Independent Social Democrats in Republika Srpska, criticized European leaders for attempting to transf...

Russia

Trump says US forces killed leader of Tren de Aragua criminal group

• What happened: US President Donald Trump announced that US forces, under his direction and in coordination with Venezuela, killed Nino Guerrero, the leader of...

Russia

Cargo Dragon spacecraft to undock from ISS on June 16 — NASA

• What happened: The Cargo Dragon spacecraft, operated by SpaceX, is scheduled to undock from the International Space Station (ISS) on June 16, 2023, and will r...

Russia

Ukraine offensive, Starlink rival, and advice for the West: Key takeaways from Putin’s chat with soldiers

• What happened: Russian President Vladimir Putin addressed soldiers at the Kremlin on Russia Day, discussing the ongoing Ukraine conflict, military advancement...

Russia

US forces down several Iranian drones heading toward Strait of Hormuz — media

• What happened: US forces shot down several Iranian drones that were approaching the Strait of Hormuz, which were deemed a threat to commercial shipping. • W...