Test your knowledge
- What is the most obvious disadvantage of anonymization?
- How does pseudonymization differ from anonymization?
- How does the architecture shown for pseudonymization ensure compliance with GDPR deletion requirements?
- Why is it necessary to use NLP techniques to identify PII instead of using the usual regexes?
- What is one of the best Python packages for de-identifying PII? What NLP engines can be used behind the scenes?
- Which R package was used to de-identify PII? What is special about this package as an engine for NLP?
- What are pseudonyms?
- Which Python and R packages were used to generate pseudonyms?
Learn more on Discord
To join the Discord community for this book – where you can share feedback, ask questions to the author, and learn about new releases – follow the QR code below: