Read Document (Text Processing)
Input
- file (File)
The file port.
Output
- output
The output port.
Parameters
- fileName of the file to read the data from.
- extract text onlyIf checked, structural information like xml or html tags will be ignored and discarded.
- use file extension as typeIf checked, the type of the files will be determined by their extensions. Unknown extensions will be treated as text files.
- content typeThe content type of the input texts
- encodingThe encoding used for reading or writing files.