Bulk Reviewer is intended to help librarians, archivists, and others to identify, review, and remove sensitive files in directories and disk images. It is built using Django, Django Rest Framework, Celery, Django Channels, and Vue.js. Bulk Reviewer scans directories and disk images for personally identifying information (PII) and other sensitive information using bulk_extractor, a best-in-class digital forensics tool, and presents results in a review dashboard, enabling easier detection and dismissal of false positives. It provides the ability to generate CSV reports about inputs as well as the ability to export files from directories and disk images, separating problematic files from those that are free of sensitive information.
Initial development occurred while the author, Tim Walsh, was a 2018 Summer Fellow at the Library Innovation Lab at Harvard University. The application is currently under active development, and is still in the exploratory/prototype phase.
See more information on Github.