Thursday, August 21, 2025
HomeTechnologyChatgpt has a serious security problem, but users are not aware of...

Chatgpt has a serious security problem, but users are not aware of it

Researchers have revealed a “jailbreak” technique that Eludes the ethical restrictions imposed by OpenAi on his GPT-5 language model, using an approach called Echo Chamber. This technique, combined with contextual narrations, allows users to ask questions that would normally be rejected by the model, thus facilitating the generation of unwanted responses. According to Martí Jordà, a cybersecurity researcher, this method is based on the introduction of a subtly toxic conversation context which does not emit direct signals of malicious intention.

Attention, fans of AI

These potential attacks, which are part of a “persuasion loop”, present an increasing risk as generative language models are used in professional environments. Recent discoveries have shown that it is possible that attackers choose keywords and build sentences that encourage the model to reveal dangerous instructions, as in the case of the creation of Molotov cocktails, in a narrative format that masks direct demand.

In addition, new attacks called ‘Zero-Click’ have been identified, where Confidential information can be extracted from documents and emails Apparently harmless through prompt injections. These attacks take advantage of the integration of AI models with external systems, further exposing security vulnerabilities.

Research highlights the need to implement strict filtering of Results and regular tests as measures to mitigate these risks. However, the challenge persists, because the evolution of these threats goes hand in hand with the continuous development of artificial intelligence. The introduction of adequate protections against these manipulations will be crucial to guarantee security and confidence in these emerging systems.

juniper.blair
juniper.blair
Juniper’s Seat-Geek side gig feeds her stadium-tour blog, which rates venues by bathroom-line math.
Facebook
Twitter
Instagram
RELATED ARTICLES

LEAVE A REPLY

Please enter your comment!
Please enter your name here

- Advertisment -

Most Popular

Recent Comments