Categories

Versions

Filter Tokens (by POS Tags) (Text Processing)

Synopsis

Filters tokens based on their POS tags.

Description

Filter tokens based on the specified types of POS tags. The possible POS tags are in STTS system for German tagging and in PENN system for English tagging and are defined by a regular expression of types. For example: the expression JJ.*|N.* would keep all adjectives and nouns.

Input

  • document

    The document port.

Output

  • document

    The document port.

Parameters

  • languageThe language for the used part of speech (POS) tagger.
  • expressionThe regular expression used for matching the POS tags (STTS for German and PENN for English).
  • invert filterIndicates if the filter should be inverted.