Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Scrubber 3.0 Beta Released ! 

Info

Scrubber now uses Apache cTakes to provide parallel concept extraction during de-idenification. Apache cTAKES graciously invited us to port the Scrubber de-identification pipeline to the Apache hosted codebase. The maintenance version of the 2.X will remain available . The publication describing this work has been accepted with minor revision, this site will be updated shortly to reflect the described methods and results.

Code Block

McMurry* AJ, Fitch* B, Savova G, Kohane IS, Reis BY. “Improved de-identification of physician notes through integrative modeling of both identifying and non-identifying medical text”, BMC Medical Informatics and Decision Making Accepted minor revise Jan 2013.

...



Motivation

"Free Text" medical notes contain information which can be used to locate human biospecimens and even predict patient outcomes.
Because medical notes often contain Protected Health Information, it is necessary to "scrub" notes of sensitive information prior to sharing with a clinical investigator. Towards this goal, we have developed Open Source software that removes PHI from raw text, XML, or databases.
The software has been approved for use by numerous hospital IRBs, and has been manually reviewed by physician experts.

...

This new approach using machine learning analyzes similaraties and differences betwen physician notes, medical dictionaries, and medical journal publications.

Notice: Scrubber 3.X Beta Release

...

Scrubber now uses Apache cTakes to provide parallel concept extraction during de-idenification. Apache cTAKES graciously invited us to port the Scrubber de-identification pipeline to the Apache hosted codebase. The maintenance version of the 2.X will remain available as will the 3.0 release candidate. The publication describing this work has been accepted with minor revision, this site will be updated shortly to reflect the described methods and results.

...