DP adds calibrated noise to training, preventing memorization of individual data points.
Train GPT-2 variant on sensitive text: DP reduces exact memorization by 90%.