Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets.
-enum_name is now included in restricted words-app does not freeze for the 2 last datasets you shared with me -outputs are all created in an 'outputs' folder, in the same directory as the source file
This is an alpha release of an executable GUI application for identifying PII within a dataset. This is completely untested on IPA PII-containing datasets due to a lack of data access.
As an alpha release, it is expected that this will contain bugs and will not work for all users. Please share any issues or feedback you have resulting from use by filing an issue on GitHub or emailing. Please do not share this application outside of IPA at this time.
This Windows 7* version does not contain many features included in releases for modern operating systems. Some features not included:
Instead, this version employs a number of methods to identify fields that may contain PII, and then it lists those fields for the user to review and take action on outside of the application. Ensuring the dataset is devoid of PII is ultimately still your responsibility.
Plans for future development are included in the issues: https://github.com/PovertyAction/PII_detection/issues and on Asana: https://app.asana.com/0/418411014871343/543165118458083
*This is compatible with Windows 10 as well, though a separate Windows 10 release with more features is intended.
This is an alpha release of an executable GUI application for identifying PII within a dataset. This is completely untested on IPA PII-containing datasets due to a lack of data access.
As an alpha release, it is expected that this will contain bugs and will not work for all users. Please share any issues or feedback you have resulting from use by filing an issue on GitHub or emailing. Please do not share this application outside of IPA at this time.
This Windows 7* version does not contain many features included in releases for modern operating systems. Some features not included:
Instead, this version employs a number of methods to identify fields that may contain PII, and then it lists those fields for the user to review and take action on outside of the application. Ensuring the dataset is devoid of PII is ultimately still your responsibility.
Plans for future development are included in the issues: https://github.com/PovertyAction/PII_detection/issues and on Asana: https://app.asana.com/0/418411014871343/543165118458083
*This is compatible with Windows 10 as well, though a separate Windows 10 release with more features is intended.
This is the initial release of an executable GUI application for identifying PII within a dataset. It is completely untested on IPA PII-containing datasets due to a lack of data access.
As the initial release, it is expected that this will contain bugs and will not work for all users. Please share any issues or feedback you have resulting from use by filing an issue on GitHub or replying to this post on Chatter. Please do not share this application outside of IPA at this time.
This Windows 7* version does not contain many features included in releases for modern operating systems. Some features not included:
Instead, this version employs a number of methods to identify fields that may contain PII, and then it lists those fields for the user to review and take action on outside of the application. Ensuring the dataset is devoid of PII is ultimately still your responsibility.
Plans for future development are included in the issues: https://github.com/PovertyAction/PII_detection/issues and on Asana: https://app.asana.com/0/418411014871343/543165118458083
*This is compatible with Windows 10 as well, though a separate Windows 10 release with more features is intended.