Filter Tokens (by POS Tags) (Text Processing)
Synopsis
Filters tokens based on their POS tags.Description
Filter tokens based on the specified types of POS tags. The possible POS tags are in STTS system for German tagging and in PENN system for English tagging and are defined by a regular expression of types. For example: the expression JJ.*|N.* would keep all adjectives and nouns.
Input
- document
The document port.
Output
- document
The document port.
Parameters
- languageThe language for the used part of speech (POS) tagger.
- expressionThe regular expression used for matching the POS tags (STTS for German and PENN for English).
- invert filterIndicates if the filter should be inverted.