Wiki Spaces
Documentation
Projects
Resources
Get Help from Others
Q&A: Ask OpenMRS
Discussion: OpenMRS Talk
Real-Time: IRC Chat | Slack
At times you will want to demonstrate the OpenMRS web application to people outside of your organization. Displaying real patient names would not be very appropriate.
OpenMRS has a built in way to simply obscure all patient's names in the system using your runtime properties:
obscure_patients=false obscure_patients.given_name=John obscure_patients.middle_name=P. obscure_patients.family_name=Smith
When you need to have (nearly) complete HIPAA compliant anonymity of patients and data you need to do a lot of scrubbing to the underlying database.
This archive:anonymizing sql script will scramble your patient names, patient addresses, locations, and any dates stored against patients. User names and passwords will be reset to username-"id" / test.
Do not run this on a live database. Only run this on a copy. Accurate data is impossible to recover from this.
This anonymization may not be fully HIPAA compliant. It is very difficult to truly anonymize (de-identify) data such that someone could not re-identify a patient. People (and a growing number of tools) can come up with clever ways of figuring out identities from supposedly anonymous data. Any free text (comments on observations, observations with text values, or any user-entered text anywhere else in the database) could accidentally reveal a patient's identity. Whenever possible, avoid sharing patient data publicly (even if you think you have anonymized it). If you want/need to share data publicly, be extremely vigilant in ensuring that the data are truly anonymized.
Ideally, all protected health information should be anonymized, including:
In OpenMRS, this means that not only names and birth dates get scrubbed, but all dates (including dates on encounters & observations) should be stripped of month & day and any internal identifiers (e.g., patient_id, person_id, etc.) must be randomly changed. Identifiers and dates should not be adjusted relative to their original value (e.g., adding or substracting a specific number), since patterns (like differences between dates or identifiers) could be used to re-identify patients.
1 Comment
Burke Mamlin
FYI – There was a GSoC 2013 project (De-Identified Patient Data Export) on creating de-identified data and I believe Mike Seaton made a tool to assist with extract data into a de-identified data set. If you are working in this area, it might be worth looking into those past efforts to see if there is any way to build on them or at least learn from them.