Privacy Now or Never: Large-Scale Extraction and Analysis of Dates in Privacy Policy Text

被引:2
|
作者
Srinath, Mukund [1 ]
Matheson, Lee [2 ]
Venkit, Pranav Narayanan [1 ]
Zanfir-Fortuna, Gabriela [2 ]
Schaub, Florian [3 ]
Giles, C. Lee [1 ]
Wilson, Shomir [1 ]
机构
[1] Penn State Univ, University Pk, PA 16802 USA
[2] Future Privacy Forum, Washington, DC USA
[3] Univ Michigan, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
privacy policy; date extraction; crawling;
D O I
10.1145/3573128.3609342
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The General Data Protection Regulation (GDPR) and other recent privacy laws require organizations to post their privacy policies, and place specific expectations on organisations' privacy practices. Privacy policies take the form of documents written in natural language, and one of the expectations placed upon them is that they remain up to date. To investigate legal compliance with this recency requirement at a large scale, we create a novel pipeline that includes crawling, regex-based extraction, candidate date classification and date object creation to extract updated and effective dates from privacy policies written in English. We then analyze patterns in policy dates using four web crawls and find that only about 40% of privacy policies online contain a date, thereby making it difficult to assess their regulatory compliance. We also find that updates in privacy policies are temporally concentrated around passage of laws regulating digital privacy (such as the GDPR), and that more popular domains are more likely to have policy dates as well as more likely to update their policies regularly.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Large-Scale Readability Analysis of Privacy Policies
    Fabian, Benjamin
    Ermakova, Tatiana
    Lentz, Tino
    2017 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2017), 2017, : 18 - 25
  • [2] Large-scale App privacy governance
    Zitong LI
    Zhuoya FAN
    Junxu LIU
    Lcixia WANG
    Xiaofeng MENG
    Frontiers of Engineering Management, 2022, 9 (04) : 640 - 652
  • [3] Large-scale App privacy governance
    Li, Zitong
    Fan, Zhuoya
    Liu, Junxu
    Wang, Leixia
    Meng, Xiaofeng
    FRONTIERS OF ENGINEERING MANAGEMENT, 2022, 9 (04) : 640 - 652
  • [4] Large-scale App privacy governance
    Zitong Li
    Zhuoya Fan
    Junxu Liu
    Leixia Wang
    Xiaofeng Meng
    Frontiers of Engineering Management, 2022, 9 : 640 - 652
  • [5] Large-Scale Information Extraction under Privacy-Aware Constraints
    Gupta, Rajeev
    Kondapally, Ranganath
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4792 - 4793
  • [6] Large-Scale Information Extraction under Privacy-Aware Constraints
    Gupta, Rajeev
    Kondapally, Ranganath
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4845 - 4848
  • [7] Preserving Security and Privacy in Large-Scale VANETs
    Qin, Bo
    Wu, Qianhong
    Domingo-Ferrer, Jose
    Zhang, Lei
    INFORMATION AND COMMUNICATIONS SECURITY, 2011, 7043 : 121 - +
  • [8] PRIVACY AND LARGE-SCALE PERSONAL DATA SYSTEMS
    LISTER, C
    PERSONNEL AND GUIDANCE JOURNAL, 1970, 49 (03): : 207 - 211
  • [9] Large language models: a new approach for privacy policy analysis at scale
    Rodriguez, David
    Yang, Ian
    Del Alamo, Jose M.
    Sadeh, Norman
    COMPUTING, 2024, 106 (12) : 3879 - 3903
  • [10] Privacy Analysis of User Association Logs in a Large-scale Wireless LAN
    Tan, Keren
    Yan, Guanhua
    Yeo, Jihwang
    Kotz, David
    2011 PROCEEDINGS IEEE INFOCOM, 2011, : 31 - 35