Bayes, not Naïve: Security Bounds on Website Fingerprinting Defenses

Giovanni Cherubin

doi:10.1515/popets-2017-0046

Bayes, not Naïve: Security Bounds on Website Fingerprinting Defenses

Giovanni Cherubin

Research output: Contribution to journal › Article › peer-review

Abstract

Website Fingerprinting (WF) attacks raise major concerns about users’ privacy. They employ Machine Learning (ML) techniques to allow a local passive adversary to uncover the Web browsing behavior of a user, even if she browses through an encrypted tunnel (e.g. Tor, VPN). Numerous defenses have been proposed in the past; however, it is typically difficult to have formal guarantees on their security, which is most often evaluated empirically against state-of-the-art attacks. In this paper, we present a practical method to derive security bounds for any WF defense, where the bounds depend on a chosen feature set. This result derives from reducing WF attacks to an ML classification task, where we can determine the smallest achievable error (the Bayes error). Such error can be estimated in practice, and is a lower bound for a WF adversary, for any classification algorithm he may use. Our work has two main consequences: i) it allows determining the security of WF defenses, in a black-box manner, with respect to the state-of-the-art feature set and ii) it favors shifting the focus of future WF research to identifying optimal feature sets. The generality of this approach further suggests that the method could be used to define security bounds for other ML-based attacks.

Original language	English
Pages (from-to)	215-231
Number of pages	17
Journal	Proceedings on Privacy Enhancing Technologies
Volume	2017
Issue number	4
Early online date	10 Oct 2017
DOIs	https://doi.org/10.1515/popets-2017-0046
Publication status	Published - 2017

Keywords

website fingerprinting
privacy metric

Access to Document

10.1515/popets-2017-0046Licence: CC BY-NC-ND

https://petsymposium.org/2017/papers/issue4/paper50-2017-4-source.pdf

Cite this

@article{1223d984c87b41298450cb116871e801,

title = "Bayes, not Na{\"i}ve: Security Bounds on Website Fingerprinting Defenses",

abstract = "Website Fingerprinting (WF) attacks raise major concerns about users{\textquoteright} privacy. They employ Machine Learning (ML) techniques to allow a local passive adversary to uncover the Web browsing behavior of a user, even if she browses through an encrypted tunnel (e.g. Tor, VPN). Numerous defenses have been proposed in the past; however, it is typically difficult to have formal guarantees on their security, which is most often evaluated empirically against state-of-the-art attacks. In this paper, we present a practical method to derive security bounds for any WF defense, where the bounds depend on a chosen feature set. This result derives from reducing WF attacks to an ML classification task, where we can determine the smallest achievable error (the Bayes error). Such error can be estimated in practice, and is a lower bound for a WF adversary, for any classification algorithm he may use. Our work has two main consequences: i) it allows determining the security of WF defenses, in a black-box manner, with respect to the state-of-the-art feature set and ii) it favors shifting the focus of future WF research to identifying optimal feature sets. The generality of this approach further suggests that the method could be used to define security bounds for other ML-based attacks.",

keywords = "website fingerprinting, privacy metric",

author = "Giovanni Cherubin",

year = "2017",

doi = "10.1515/popets-2017-0046",

language = "English",

volume = "2017",

pages = "215--231",

journal = "Proceedings on Privacy Enhancing Technologies",

issn = "2299-0984",

publisher = "de Gruyter",

number = "4",

}

TY - JOUR

T1 - Bayes, not Naïve

T2 - Security Bounds on Website Fingerprinting Defenses

AU - Cherubin, Giovanni

PY - 2017

Y1 - 2017

N2 - Website Fingerprinting (WF) attacks raise major concerns about users’ privacy. They employ Machine Learning (ML) techniques to allow a local passive adversary to uncover the Web browsing behavior of a user, even if she browses through an encrypted tunnel (e.g. Tor, VPN). Numerous defenses have been proposed in the past; however, it is typically difficult to have formal guarantees on their security, which is most often evaluated empirically against state-of-the-art attacks. In this paper, we present a practical method to derive security bounds for any WF defense, where the bounds depend on a chosen feature set. This result derives from reducing WF attacks to an ML classification task, where we can determine the smallest achievable error (the Bayes error). Such error can be estimated in practice, and is a lower bound for a WF adversary, for any classification algorithm he may use. Our work has two main consequences: i) it allows determining the security of WF defenses, in a black-box manner, with respect to the state-of-the-art feature set and ii) it favors shifting the focus of future WF research to identifying optimal feature sets. The generality of this approach further suggests that the method could be used to define security bounds for other ML-based attacks.

AB - Website Fingerprinting (WF) attacks raise major concerns about users’ privacy. They employ Machine Learning (ML) techniques to allow a local passive adversary to uncover the Web browsing behavior of a user, even if she browses through an encrypted tunnel (e.g. Tor, VPN). Numerous defenses have been proposed in the past; however, it is typically difficult to have formal guarantees on their security, which is most often evaluated empirically against state-of-the-art attacks. In this paper, we present a practical method to derive security bounds for any WF defense, where the bounds depend on a chosen feature set. This result derives from reducing WF attacks to an ML classification task, where we can determine the smallest achievable error (the Bayes error). Such error can be estimated in practice, and is a lower bound for a WF adversary, for any classification algorithm he may use. Our work has two main consequences: i) it allows determining the security of WF defenses, in a black-box manner, with respect to the state-of-the-art feature set and ii) it favors shifting the focus of future WF research to identifying optimal feature sets. The generality of this approach further suggests that the method could be used to define security bounds for other ML-based attacks.

KW - website fingerprinting

KW - privacy metric

UR - https://github.com/gchers/wfes

U2 - 10.1515/popets-2017-0046

DO - 10.1515/popets-2017-0046

M3 - Article

SN - 2299-0984

VL - 2017

SP - 215

EP - 231

JO - Proceedings on Privacy Enhancing Technologies

JF - Proceedings on Privacy Enhancing Technologies

IS - 4

ER -

Bayes, not Naïve: Security Bounds on Website Fingerprinting Defenses

Abstract

Keywords

Access to Document

Other files and links

Centre for Doctoral Training in Cyber Security

Cite this

Bayes, not Naïve: Security Bounds on Website Fingerprinting Defenses

Abstract

Keywords

Access to Document

Other files and links

Projects

Centre for Doctoral Training in Cyber Security

Cite this