SE#PCFG: Semantically Enhanced PCFG for Password Analysis and Cracking

06/12/2023
by   Yangde Wang, et al.
0

Much research has been done on user-generated textual passwords. Surprisingly, semantic information in such passwords remain underinvestigated, with passwords created by English- and/or Chinese-speaking users being more studied with limited semantics. This paper fills this gap by proposing a general framework based on semantically enhanced PCFG (probabilistic context-free grammars) named SE#PCFG. It allowed us to consider 43 types of semantic information, the richest set considered so far, for semantic password analysis. Applying SE#PCFG to 17 large leaked password databases of user speaking four languages (English, Chinese, German and French), we demonstrate its usefulness and report a wide range of new insights about password semantics at different levels such as cross-website password correlations. Furthermore, based on SE#PCFG and a new systematic smoothing method, we proposed the Semantically Enhanced Password Cracking Architecture (SEPCA). To compare the performance of SEPCA against three state-of-the-art (SOTA) benchmarks in terms of the password coverage rate: two other PCFG variants and FLA. Our experimental results showed that SEPCA outperformed all the three benchmarks consistently and significantly across 52 test cases, by up to 21.53 and 7.86 level of unique passwords, SEPCA also beats the three benchmarks by up to 33.32 SEPCA as a new password cracking framework.

READ FULL TEXT
research
08/26/2015

Component-Enhanced Chinese Character Embeddings

Distributed word representations are very useful for capturing semantic ...
research
03/18/2020

A Corpus of Adpositional Supersenses for Mandarin Chinese

Adpositions are frequent markers of semantic relations, but they are hig...
research
06/10/2022

Learning to Rank Rationales for Explainable Recommendation

State-of-the-art recommender system (RS) mostly rely on complex deep neu...
research
04/11/2022

Zero-shot Cross-lingual Conversational Semantic Role Labeling

While conversational semantic role labeling (CSRL) has shown its usefuln...
research
11/09/2022

Improving Performance of Automatic Keyword Extraction (AKE) Methods Using PoS-Tagging and Enhanced Semantic-Awareness

Automatic keyword extraction (AKE) has gained more importance with the i...
research
01/14/2020

On Equivalence and Cores for Incomplete Databases in Open and Closed Worlds

Data exchange heavily relies on the notion of incomplete database instan...

Please sign up or login with your details

Forgot password? Click here to reset