Projects

Brunnhilde

A Python command-line and GUI reporting tool for directories and disk images. Brunnhilde bundles tools such as Siegfried, ClamAV, bulk_extractor, The Sleuthkit (TSK)‘s tsk_recover, HFS Explorer, and fiwalk to streamline appraisal and minimal processing of digital archives.

Brunnhilde was written to supplement existing file format identification and characterization tools, with a focus on producing human-readable high-level reports for digital archives and digital preservation practitioners. It is installed as a standard utility in the BitCurator environment.

> Links: Github

Bulk Reviewer

Bulk Reviewer is a software program that aids in identification, review, and removal of sensitive files in directories and disk images. Bulk Reviewer scans directories and disk images for personally identifiable information (PII) and other sensitive information using bulk_extractor, a best-in-class digital forensics tool, and can optionally extract named entities (personal names as well as nationalities, religions, and political affiliations) using spaCy and Apache Tika. A browser application enables users to configure, start, and review scans, generate reports, and export files, separating problematic files (e.g., those requiring redaction or further review) from those that are free of sensitive information.

Bulk Reviewer consists of two separate applications that communicate via websockets and a REST API:

> Links: Github

METSFlask

A Flask web application for human-friendly exploration of Archivematica METS files.

> Links: Github, Live demo

CCA Tools (e.g. Disk Image Processor)

Digital archives processing tools intended for use in BitCurator, an open source Ubuntu-derived digital forensics environment for digital forensics and digital archiving.

> Links: Github

addext

Command-line Python utility to add file extensions to files without them, based on DROID/Siegfried PRONOM-based file format identification.

> Links: Github

Independent Study: Access to Born-Digital Architectural Records (Spring 2015)

This independent study supplemented other coursework taken in the MS in Library and Information Science program at Simmons College. The study focused on preservation and access for digital architecture and design records such as computer aided design (CAD) and Building Information Modeling (BIM), and resulted in a 49-page paper on the subject entitled “Preservation and Access of Born-Digital Architectural Design Records in an OAIS-Type Archive.”

> Links: Reading list and paper