-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathhelp.html
68 lines (44 loc) · 2.1 KB
/
help.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
<html>
<head>
<style>
body {
font-family: Helvetica;
text-align: center;
font-size: 10.5pt
}
#container {
margin: 0 auto;
width: 400px;
text-align: justify;
}
p {
}
h1, h2 {
color: rgb(53, 49, 49);
}
</style>
</head>
<body>
<div id="container">
<h1>Help</h1>
<p>The visualization exposes the words extracted from over 32 million passwords contained in the RockYou leak. It focuses on the 500 words with the most unexpected frequencies compared to standard English, as represented by the British National Corpus (BNC). In other words, this visualization shows how “password English” differs from “regular English”.</p>
<h2>Representation</h2>
<p> Each word is represented by a line crossing several vertical axes. Each axis represents a statistical measure for the word. The vertical position where a line intercepts a certain axis is determined by the measured value for the word. For example, a line crossing an axis at the bottom where the values increase in top-down order, indicates that the word's value is high.</p>
<p>Blue words (lines) are more frequent than expected in passwords, while brown words are less frequent than expected, when taking as reference their frequency in the British National Corpus.</p>
<p>
Once a line is clicked, the 100 most frequent passwords containing the corresponding word are shown on the left.
</p>
<h2>Axes</h2>
<dl>
<dt>G2</dt>
<dd>Measures the extent to which the frequency of a word in passwords differs from its frequency in standard English. Positive values indicate prevalence in passwords, while negative values indicate the contrary.</dd>
<dt>G2 Ranking (Difference Ranking)</dt>
<dd>Ranking of the most unexpected word frequencies, ignoring the signal of the deviation, i.e., whether it is more or less frequent than expected.</dd>
<dt>Relative Frequency - BNC</dt>
<dd>The relative frequency of the word in the British National Corpus, which estimates its use in standard English.</dd>
<dt>Relative Frequency - Passwords</dt>
<dd>The relative frequency of the word in the RockYou passwords.</dd>
</dl>
</div>
</body>
</html>