Indirect Prompt Injection in Langchain/GPT4 Email Agent

5 LLM Security Threats- The Future of Hacking?

Why Agent Frameworks Will Fail (and what to use instead)

4 Days of Training Like a Marine

Reaction to Cowboys-Browns, Raiders-Chargers, Broncos-Seahawks, Tom Brady on FOX | Colin Cowherd NFL

INSTANT REACTION: Patriots stun Bengals with 16-10 upset win in Week 1

Prompt Injection / JailBreaking a Banking LLM Agent (GPT-4, Langchain)

Donato Capitella

Просмотров 1,7 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 9 сен 2024
In this lab I’ll do a walk-through of our LLM jailbreak/prompt injection challenge that we ran for the CTF at the BSides London 2023. This will show how an insecure AI agent built with OpenAI's GPT-4 and Langchain can be hijacked by an attacker to reveal confidential information. I’ll also demonstrate the last part of the challenge, that nobody solved. This involved tricking the agent into exploiting SQL injection in a vulnerable API.
References:
- Damn Vulnerable LLM Agent: github.com/Wit...
- Synthetic Recollections: labs.withsecur...

Комментарии • 11

@DausnArt 3 месяца назад
Donato, thank you very much for your time and knowledge; thank you very much for instructing us.
@contractorwolf 3 месяца назад ⁺³
no one who knows what they are doing would ever setup an API to work like that. These kinds of hack might have worked 15 years ago, but they absolutely would not work today. SQL injection? what year is it?
@donatocapitella 3 месяца назад
Indeed, it is rare in production to see such issues, most developers are aware. And often these do get caught in pentesting. For reference, this is No1 in the OWASP Top Ten, broken access control. Modifying parameters of API calls to get access to other resources. And it's more common than one would think - again, these APIs do not often make it to prod due to pentesting et all.
What we did here was simply put together a fun challenge for a CTF, something that was more than Gandalf, more than just "get the LLM to reveal a password".
@yobofunk5689 3 месяца назад ⁺¹
Who would not protect the request behind a server side auth? It's the equivalent of sending an id without pass from a basic web form... It feels like pressing f12 and changing some variables. Though it is important to remind people that it is an obvious vulnerability.
@donatocapitella 3 месяца назад ⁺¹
True, but this is literally OWASP Top Ten no1 (access control) and I can confirm from pentesting practice that it's more common than one would think. A lot of these issues get caught in pentesting, that's why we don't see them in prod often.
Also, keep in mind the context: this was a CTF challenge, so we put something together that would be fun to do, and wanted to do something different than Gandalf, "tell me the password".
@seththunder2077 3 месяца назад ⁺²
Can u show us how can we protect against that?
@donatocapitella 3 месяца назад
I have been meaning to do a video and I will. Meanwhile, check out this webinar where I go through the security canvas: "ruclips.net/video/tVAmhlUVEcg/видео.html".
Also here:
- www.withsecure.com/en/whats-new/events/webinar-building-secure-llm-apps-into-your-business.
- labs.withsecure.com/publications/detecting-prompt-injection-bert-based-classifier
I should do a video in June with some hands-on implementations of these controls.
@seththunder2077 3 месяца назад ⁺¹
@@donatocapitella looking forward to it. I’ve seen a lot of ppl talking about it but almost no one does any hands on implementation and it feels useless for people to talk about it despite being very important
@matti7529 3 месяца назад ⁺¹
How do you sleep at night? You /lied/ to that model. It was trying to do its job and you were being naughty and evil. I expect you to apologise and make up! (-;
@donatocapitella 3 месяца назад
As an AI model I cannot mislead or lie to other models, only to humans.
@williamcase426 3 месяца назад
O yea hijack that nonsense

Следующие

Автовоспроизведение

Indirect Prompt Injection in Langchain/GPT4 Email Agent

Indirect Prompt Injection in Langchain/GPT4 Email Agent

5 LLM Security Threats- The Future of Hacking?

5 LLM Security Threats- The Future of Hacking?

Why Agent Frameworks Will Fail (and what to use instead)

Why Agent Frameworks Will Fail (and what to use instead)

4 Days of Training Like a Marine

4 Days of Training Like a Marine

Reaction to Cowboys-Browns, Raiders-Chargers, Broncos-Seahawks, Tom Brady on FOX | Colin Cowherd NFL

Reaction to Cowboys-Browns, Raiders-Chargers, Broncos-Seahawks, Tom Brady on FOX | Colin Cowherd NFL

INSTANT REACTION: Patriots stun Bengals with 16-10 upset win in Week 1

INSTANT REACTION: Patriots stun Bengals with 16-10 upset win in Week 1

nba 2k25 has cured my depression for the moment..

nba 2k25 has cured my depression for the moment..

Attacking LLM - Prompt Injection

Attacking LLM - Prompt Injection

What Does an LLM-Powered Threat Intelligence Program Look Like?

What Does an LLM-Powered Threat Intelligence Program Look Like?

Jailbreaking LLMs - Prompt Injection and LLM Security

Jailbreaking LLMs - Prompt Injection and LLM Security

Hacking with ChatGPT: Five A.I. Based Attacks for Offensive Security

Hacking with ChatGPT: Five A.I. Based Attacks for Offensive Security

Setting up a production ready VPS is a lot easier than I thought.

Setting up a production ready VPS is a lot easier than I thought.

A Systems-Minded Approach to Creating a Music Player Application by Andrew Kelley

A Systems-Minded Approach to Creating a Music Player Application by Andrew Kelley

Earn $1,350/Day with ChatGPT & Google Drive for FREE

Earn $1,350/Day with ChatGPT & Google Drive for FREE

Have You Picked the Wrong AI Agent Framework?

Have You Picked the Wrong AI Agent Framework?

When to use Prompt Chains. DITCHING LangChain. ALL HAIL Claude 3.5 Sonnet

When to use Prompt Chains. DITCHING LangChain. ALL HAIL Claude 3.5 Sonnet

ПОТЕРЯЛСЯ ЧЕЛОВЕК ПАУК!?😲😲😲 @Studia_Animatorov_Koodesnik

ПОТЕРЯЛСЯ ЧЕЛОВЕК ПАУК!?😲😲😲 @Studia_Animatorov_Koodesnik

ПОЛИЦИЯ ИЗДЕВАЕТСЯ И ОБВИНЯЕТ НАС В ПОБЕГЕ? МЕНТ ОБМАНУЛ. ЗАКРЫЛИ КРУГЛОСУТОЧНЫЙ МАГАЗИН. Часть 2

ПОЛИЦИЯ ИЗДЕВАЕТСЯ И ОБВИНЯЕТ НАС В ПОБЕГЕ? МЕНТ ОБМАНУЛ. ЗАКРЫЛИ КРУГЛОСУТОЧНЫЙ МАГАЗИН. Часть 2

В конце дочь Мии Бойки? 😱👩🏻‍🎤 #виола #шортс

В конце дочь Мии Бойки? 😱👩🏻‍🎤 #виола #шортс

С ПОДРУГОЙ В СТАРОСТИ В ОТПУСКЕ

С ПОДРУГОЙ В СТАРОСТИ В ОТПУСКЕ

Жириновский: Все деньги сдайте в казну! Мощная речь Жириновского в Думе #жириновский #ввж

Жириновский: Все деньги сдайте в казну! Мощная речь Жириновского в Думе #жириновский #ввж

Чистка пляжа - забытые игрушки в мусоре

Чистка пляжа - забытые игрушки в мусоре

Electric Flying Bird with Hanging Wire Automatic for Ceiling Parrot

Electric Flying Bird with Hanging Wire Automatic for Ceiling Parrot

Bike vs Super Bike Fast Challenge

Bike vs Super Bike Fast Challenge