You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 6 Next »

3.X Diagram In Progress

Use Case: Tagging Noun Phrases and UMLS concepts

Precondition:

  • Training Data: Genia, PenTree Bank, Mayo Source
  • Software: cTakes using features POS tagger & UMLS CUID extractor

Steps:

  1. Block of text is sent to cTakes
  2. cTakes processing
    1. start & end position of all POS tags
    2. part of speech
      1. Most interested in Nouns because of PHI
    3. Need Info: are cUIDS associated with WORDS or PHRASES?

Post-condition:

  • Input document (either medical note OR publication) will have POS tagged and UMLS CUIDs.

Use Case: Meta-analysis of text

Precondition:

  • Tagging Noun Phrases
  • Scubber configured (with or without local dictionary/regex mods)

Steps:

  1. Each "scrubber" implementation procudes Recorder output
    1. Passthrough Imp
      1. Regex
      2. Word lists
    2. cTakes Impl (OpenNLP)
      1. Noun Phrases
      2. UMLS cuids
  2. Performance evaluation (ROC)
    1. Scrubber standalone
    2. Scrubber word lists limited by detected noun phrases
    3. Scrubber word lists limited by detected noun phrases and non-UMLS concepts

Post-Condition

  • Text is processed by more than one algorithm "ham vs spam"
  • No labels