DICAT is a simple graphical tool that facilitates DICOM (Digital Imaging and Communications in Medicine) de-identification directly on a local workstation. It was designed to work on all major operating systems (Windows, Linux and OSX) and is very light in terms of dependencies (Python). Binaries (with no dependencies) have been compiled for all operating systems.
With the increasing use of web-based database systems, such as LORIS (Das et al., 2011, Das et al., 2016), for large scale imaging studies, de-identification of DICOM datasets becomes a requirement before they can be uploaded in such databases.
Typical Flow Chart of DICOM de-identification.
Before DICOM datasets can be uploaded into a web-based database, identifying information stored in the DICOM header (such as patient name, date of birth) should be removed.
DICAT produces two archival outputs: a back-up of the original DICOM files, and a de-identified DICOM dataset that can then be uploaded or transferred to other systems.
DICAT also features an ID key log that can be used to keep a record of the original candidate name (participant/patient) linked to their anonymized study identifier, for reference by study coordinators.
DICAT was first developed during the 2014 and 2015 brainhacks held at the Organization of Human Brain Mapping (OHBM) conferences.
Installation instructions vary depending on the operating system used. See below for detailed information.
Running DICAT will open a window with three different tabs:
- A simple "Welcome to DICAT" tab giving a short description of the tool
- A "DICOM de-identifier" tab, in which de-identification of DICOMs will take place
- An "ID Key" tab, containing the key between candidate's names and their IDs
Welcome page of DICAT.
Executables of DICAT have been created for most systems and can be found in the executables
directory of the Github repository. They will be organized by versions of DICAT and operating systems. Download the executable relevant to your system and move it to any folder of your choice.
To open DICAT, simply double click on the executable.
To install DICAT source code on a computer, download and save the content of the current Github repository into a workstation.
Before running DICAT, make sure your systems contains a Python compiler with the TkInter library (usually, TkInter comes by default with most Python installations).
For Ubuntu distributions, TkInter can be installed via apt-get:
sudo apt-get install python-tk
The PyDICOM package is also required by DICAT.
DICAT can be started by executing DICAT.py
script with a Python compiler. On UNIX computers (Linux and Mac OS X), open a terminal, go to the main directory of DICAT source code (dicat
directory) and run the following:
python DICAT.py
DICOM de-identification with DICAT.
In the "DICOM de-identifier" tab (1), use the select button (2) to choose a directory containing DICOM files to de-identify.
Once a directory containing DICOM files have been selected (as described in the above section), the DICOM fields can be viewed when clicking on the “View DICOM fields” button (3).
The DICOM fields will be displayed in a table with editable fields in black (4) and non-editable fields greyed out (5). The non-editable fields will be replaced by empty strings in the DICOM files when running the de-identification, while the editable fields will be replaced by the value entered by the user. By default, editable fields are “PatientName”, “PatientBirthDate” and “PatientSex”.
The “Clear” button (6) will erase values from all editable fields.
Finally, once the user has finalized the edits, clicking on the “De-identify” button (7) will run the de-identification tool on the DICOM dataset.
The ID Key feature of DICAT allows storage of the key between identifiable candidates's information (Real Name and Date of Birth) and its study’s identifier. This information will be stored locally on the workstation within an XML file (candidate.xml) in DICAT's directory. See the following figure for detailed information on how to use this feature.
ID key feature of DICAT.
This feature (1) allows storage of the mapping information between candidates’s information and study IDs. This information will be stored in an XML file that can be either created (2) or opened (3). Changes will be automatically saved.
A candidate (participant/patient) can be looked up using the “Search candidate” button (5) after having entered either the “Identifier” or the “Real Name” text fields available in (4).
The “Clear fields” button (6) allows clearing the text in those text fields.
A new candidate can be registered using the “Add candidate” button (7) after having entered the “Identifier”, “Real Name” and “Date of birth” information in the text fields of (4).
Clicking on a subject row (8) of the table displayed at the bottom of the application will automatically populate the text fields (4) with the information of the candidate.
The “Real Name” or “Date of birth” of that candidate can be edited if needed by altering the field and clicking on the “Edit candidate” button (9).
Finally, the data table of candidate is sortable by clicking on any of the column headers (10).
Ayan Sengupta [email protected] - Concept, DICOM-toolkit implementation, Pydicom implementation
Cecile Madjar [email protected] - GUI implementation, PyDICOM implementation, python integration of DICOM-toolkit, ID key
Dave MacFarlane [email protected] - ID Key
Samir Das [email protected] - Concept and guidance
Daniel Krötz [email protected] - Documentation, testing on Windows
Christine Rogers [email protected] - Documentation
Leigh Evans [email protected] - Video tutorial
Derek Lo [email protected] - Logo design