Version 2 is live! Wordlists sorted by probability originally created for password generation and testing - make sure your passwords aren't popular!
Get reminded of the absolute worst passwords using the @WorstPasswords twitter account!
Release one used about 300 files. I did not filter them very well. Some "password" wordlists contained non-password lines. This time, I pulled out all the md5 and SHA1 hashes that had somehow weaseled their way in, removed files that contained nothing but lists of things like cartoon characters and made sure the lists were as pure as possible.
Release two uses over 1500 files. I attempted to clean each one, which in some cases required a judgement call on my end.
V1 Popularity was determined by appearances in as few as 2 files. Since some lists are compilations of smaller lists, this resulted in some lines that really appeared only once to make the cut. More files allowed me to raise the minimum threshold to 5 appearances.
Duplicates in Release one were caused by a mismatch of newline characters. Not this time. Every single processing step included a tr -d '\r'
step. I wasn't going to have them slip in.
For V1, I made a judgement call (based on no evidence) that I wasn't going to include lines that contained blankspace characters. Since I was already disregarding ASCII characters, I thought this wouldn't cause much of a problem.
In V2, I included lines that had non-ASCII characters. This opens up the entire non-American English speaking world of passwords! I also decided not to remove blankspace characters - on the whole. These characters were rare enough to leave in without causing a flood of duplicates.
In some files, a blankspace character was at the end of every line. In these cases, I would remove the final blankspace character from all lines. However, some files did not have consistency when it came to beginning or ending with blankspace characters. In this instance, I would leave them in place, since I had reason to believe the blankspaces were part of the data.
While I haven't visualized them yet, I included mask analysis for the files in this release. These have been ranked in order of popularity.
I have also included HashCat rulesets, also in order of popularity.
UPDATE: Probable-Wordlists needs your help! Support the project by seeding the files below, please.
Task List for 1.2 Included:
Issues Addressed in Release
Assuming the torrents prove to reliable, this will be the stable release for a while as I prepare for a complete overhaul for Rev 2 in the next month(s)
Task List for Rev 1.1 Included:
Issues Addressed in Release:
Initial Release