Note : Scrubber 3.X is being ported to Apache cTAKES, this is an interim BETA release.

1. Intended usages

1.1 Default configuration

We recommend starting with the default properties and prebuilt train/test models.
The train and test models are anonymized feature sets generated by scrubber runtime (NOT text).

scrubber.properties : all supported config options and features in one place.

Apache UIMA, Apache cTAKES, and WEKA distribution jars are loaded dynamically.

1.2 Customize NLP pipeline

1.3 Customize Classifier

2. Software Features

2.1 Annotation

2.2 Models

2.3 Classification

2.4 Compare Text

3. How To

3.X Install / Train / Test / Scrub

Scrubber Property KEY = VALUE

4. scrubber.properties

4.1 Java Object

4.2 Java Template

4.3 Shell scripts

4.4 Shell UnitTest