Table Of Links I-Table ye-Links Ukucinga 1 Introduction 1 Ukuqalisa 2 Background 2 I-Background 3 Privacy-Relevant Methods 3 Izindlela eziqhelekileyo ze-privacy 4 Identifying API Privacy-relevant Methods 4 Ukubonisa i-API ye-Privacy-Relevant Methods 5 Labels for Personal Data Processing 5 Iimveliso ze-Labels for Data Processing 6 Process of Identifying Personal Data 6 Umgangatho we-Identification Data 7 Data-based Ranking of Privacy-relevant Methods 7 Ukuhlaziywa kwe-Data-based of Privacy-relevant Methods 8 Application to Privacy Code Review 8 Ukusetyenziswa kwe-Privacy Code Review 9 Related Work 9 Izixhobo ezinxulumene Conclusion, Future Work, Acknowledgement And References Ukuhlaziywa, umsebenzi wokugqibela, ukunyaniseka kunye neengxaki Umgangatho we-Identification Data Personal Kwixesha lokugqiba kwinkqubo, kubalulekile ukuba ukwahlukanisa phakathi kwedatha ye-private data kunye ne-private information (PII). Nangona zombini zihlanganisa iinkcukacha ezihambelana ne-individual, i-PII i-catalogue yeedatha eyenza ngqo umntu. Iimpawu zihlanganisa iinkcukacha ze-akhawunti, iinkcukacha ze-contact, i-ID ye-private, kunye ne-ID ye-national. Akukho zonke i-10 iindidi ze-private data ekubeni phantsi kwegama le-PII. Ukukhuselwa kwe-PII ikakhulukazi kuba kunokuba kuthatha umdla we-private okanye i-psychological, ezifana ne-identity theft. Ukusetyenziswa kwimeko ye-pattern-matching yenzelwe nguTang et al. [?]. Le nkqubo yenzelwe ngempumelelo idatha evela kwi-10 iindidi, kuquka i-Account, i-Contact, i-Identification Personal, i-Location, kunye ne-National ID. Thina usebenzisa i-Semgrep, isixhobo esekelwe ukuxhaswa kwepattern kwi-code, ukuze lula le nqubo. Imiyalelo ye-Semgrep yenzelwe ngokukhawuleza kwiilwimi zeJava kunye neJavaScript. 6.1 I-Analysis ye-Static for Personal Data Identification I-phase yokuqala kwinkqubo yethu ibandakanya ukusetyenziswa kwe-static ukufumana i-fragments ye-code enezinto ze-private data. Thina usebenzisa i-Semgrep kule umsebenzi, ngenxa yokusebenza kwayo kunye ne-flexibility yokuhlola i-codebases ezininzi. Thina siphinde kwi-Semgrep yokusetyenziswa kwilwimi ezininzi kunye namandla zayo yokuhlola i-data flow. 6.2 Defining Sources of Personal Data Kwiinkqubo yethu yokuhlola, iimveliso zihlanganisa iimeko apho idatha ye-personal ifumaneka. Sinikezela iimveliso ze-personal ngezindlela ezimbini: 1) njenge-literal text ebonakalayo kwi-source code, kunye 2) njenge-variables, ngokusekelwe ama-identifiers zayo. Imiyalelo yethu yokuhlola yenzelwe ukunceda Java, JavaScript, kunye ne-TypeScript kodwa ingathengiswa kwiilwimi ezininzi ezihlabathi e-Semgrep. 6.3 Rule Crafting for Identification Ukusetyenziswa kwimibelelwano yabasetyhini yabasetyhini, sisetyenziswa kwimibelelwano yabasetyhini (regex) yabasetyhini. Oku kuthatyalaliswa, umzekelo, ngexesha lokucaciswa kwimibelelwano yabasetyhini yabasetyhini yabasetyhini. Ngokusetyenziswa kwimibelelwano yabasetyhini, sisetyenziswa kwimibelelwano yabasetyhini yabasetyhini yabasetyhini yabasetyhini yabasetyhini yabasetyhini yabasetyhini yabasetyhini yabasetyhini yabasetyhini yabasetyhini. Ukusetyenziswa kwimibelelwano yabasetyhini yabasetyhini iwasetyhini yabasetyhini yabas Umbhali: Feiyang Tang Bjarte M. Østvold Authors: Ukucinga Tang Bjarte M. Østvold Oku kunokwenzeka kwi-archiv phantsi kolawulo lweCC BY-NC-SA 4.0. Oku kunokwenzeka kwi-archiv phantsi kolawulo lweCC BY-NC-SA 4.0. Zifumaneka kwi-Archiv