for Privacy Awareness
+49 681 9325 500
+49 681 9325 599
gerhard.weikum(Replace this parenthesis with the @ sign)cispa.saarland
The goal of this project is to devise technology for comprehensively retrieving and assessing a user’s long-term privacy-critical traces, across her entire digital habitat and the Internet at large. Today, this is not possible even for the data contained within individual platforms (such as Facebook). Meeting the grand challenge entails several sub-goals: How to find and retrieve user data from the Internet’s heterogeneous search space (social networks, forums, review sites, Deep Web services, public databases, etc.), and how to continuously monitor that data, in a scalable manner? How to determine a data item’s criticality to user privacy? How to determine its visibility, as well as its provenance, i.e., how it became visible (e.g., by actions of other users)? To address these goals, we will harness and enhance entity search, focused crawling, Deep Web querying techniques, and combinations thereof. We will use statistical methods combined with background knowledge to identify privacy-critical information, such as emotional or embarrassing statements. As cues to reconstruct data provenance, we will employ copy detection and co-reference analysis in natural language statements.
Role Within the Collaborative Research Center
In the context of this project we are looking for two qualified PhD students with a track record in topics relevant to the project. If you are interested please send a letter of intent together with your short CV to the PIs of the project.