You're reading from Adversarial AI Attacks, Mitigations, and Defense Strategies A cybersecurity professional's guide to AI attacks, threat modeling, and securing AI with MLSecOps

Product type Paperback

Published in Jul 2024

Publisher Packt

ISBN-13 9781835087985

Length 586 pages

Edition 1st Edition

Languages

Python

Tools

DeepFakes

Concepts

Artificial Intelligence

Author (1):

John Sotiropoulos

View More author details

Table of Contents (27) Chapters

Preface

1. Part 1: Introduction to Adversarial AI FREE CHAPTER

2. Chapter 1: Getting Started with AI

3. Chapter 2: Building Our Adversarial Playground

4. Chapter 3: Security and Adversarial AI

5. Part 2: Model Development Attacks

6. Chapter 4: Poisoning Attacks

7. Chapter 5: Model Tampering with Trojan Horses and Model Reprogramming

8. Chapter 6: Supply Chain Attacks and Adversarial AI

9. Part 3: Attacks on Deployed AI

10. Chapter 7: Evasion Attacks against Deployed AI

11. Chapter 8: Privacy Attacks – Stealing Models

12. Chapter 9: Privacy Attacks – Stealing Data

13. Chapter 10: Privacy-Preserving AI

14. Part 4: Generative AI and Adversarial Attacks

15. Chapter 11: Generative AI – A New Frontier

16. Chapter 12: Weaponizing GANs for Deepfakes and Adversarial Attacks

17. Chapter 13: LLM Foundations for Adversarial AI

18. Chapter 14: Adversarial Attacks with Prompts

19. Chapter 15: Poisoning Attacks and LLMs

20. Chapter 16: Advanced Generative AI Scenarios

21. Part 5: Secure-by-Design AI and MLSecOps

22. Chapter 17: Secure by Design and Trustworthy AI

23. Chapter 18: AI Security with MLSecOps

24. Chapter 19: Maturing AI Security

25. Index

Why subscribe?

26. Other Books You May Enjoy

Model inversion and training data extraction attacks on LLMs

When we discussed extracting training data in predictive AI, we focused on model inversion. The attack appears to extract training data, but in reality, the technique is to infer and reconstruct memorized training data from adversarial inputs.

Model inversion could still happen in an LLM world, but it is less structured, mathematically driven, and automated. Some efforts with a research project called TextRevealer (published in 2022 at https://arxiv.org/abs/2209.10505) have successfully demonstrated model inversion against transformer architectures but for smaller models such as Bidirectional Encoder Representations from Transformers (BERT).

For LLMs, an attacker could prompt the model to create descriptions and reviews of concepts, events, or people to infer information about a training sample. For example, by analyzing responses to the activities of a political group, the attacker may infer information about individuals...

The rest of the chapter is locked

You're reading from Adversarial AI Attacks, Mitigations, and Defense Strategies A cybersecurity professional's guide to AI attacks, threat modeling, and securing AI with MLSecOps

Table of Contents (27) Chapters

Model inversion and training data extraction attacks on LLMs

Authors (1)

Personalised recommendations for you

You're reading from Adversarial AI Attacks, Mitigations, and Defense Strategies A cybersecurity professional's guide to AI attacks, threat modeling, and securing AI with MLSecOps

Table of Contents (27) Chapters

Model inversion and training data extraction attacks on LLMs

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you