Publication:
Dataset Reconstruction Attack against Language Models

dc.contributor.authorPanchendrarajan, R
dc.contributor.authorBhoi, S
dc.date.accessioned2022-04-25T08:30:32Z
dc.date.available2022-04-25T08:30:32Z
dc.date.issued2021-07
dc.description.abstractWith the advances of deep learning techniques in Natural Language Processing, the last few years have witnessed releases of powerful language models such as BERT and GPT-2. However, applying these general-purpose language models to domain-specific applications requires further fine-tuning using domain-specific private data. Since private data is mostly confidential, information that can be extracted by an adversary with access to the models can lead to serious privacy risks. The majority of privacy attacks on language models infer either targeted information or a few instances from the training dataset. However, inferring the whole training dataset has not been explored in depth which poses far greater risks than disclosure of some instances or partial information of the training data. In this work, we propose a novel data reconstruction attack that also infers the informative words present in the private dataset. Experiment results show that an adversary with black-box query access to a fine-tuned language model can infer the informative words with an accuracy of about 75% and can reconstruct nearly 46.67% of the sentences in the private dataset.en_US
dc.identifier.urihttps://rda.sliit.lk/handle/123456789/2045
dc.language.isoenen_US
dc.relation.ispartofseriesCEUR Workshop Proceedings;Vol 2942 Pages 1-17
dc.subjectLanguage Modelsen_US
dc.subjectDataset Reconstruction Attacken_US
dc.subjectInformation Leakageen_US
dc.titleDataset Reconstruction Attack against Language Modelsen_US
dc.typeArticleen_US
dspace.entity.typePublication

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
DatasetReconstructionAttackagainstLanguageModels.pdf
Size:
1.39 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: