Categories

Versions

Read Document (Text Processing)

Input

  • file (File)

    The file port.

Output

  • output

    The output port.

Parameters

  • fileName of the file to read the data from.
  • extract text onlyIf checked, structural information like xml or html tags will be ignored and discarded.
  • use file extension as typeIf checked, the type of the files will be determined by their extensions. Unknown extensions will be treated as text files.
  • content typeThe content type of the input texts
  • encodingThe encoding used for reading or writing files.