Categories

Versions

You are viewing the RapidMiner Studio documentation for version 9.5 - Check here for latest version

Read Azure Data Lake Storage (Cloud Connectivity)

Synopsis

This operator downloads the specified file from Microsoft Azure Data Lake Storage.

Description

After you have configured your Azure Data Lake Storage Gen1 account, you can load the Azure Data Lake Storage file with this operator.

Be aware that the operator cannot read the file as example set. For this reason, you must connect the Read Azure Data Lake Storage operator to another appropriate operator to read the file. For example, if you want to load an Excel file from your Azure Data Lake Storage, you must connect the Read Azure Data Lake Storage operator with the Read Excel operator to see the result.

Input

  • connection (Connection)

    This input port expects a Connection object if any. See the parameter connection entry for more information.

Output

  • file (File)

    The downloaded file object is returned here. Must be connected to a appropriate Read Operator, for example Read Excel or Read CSV.

  • connection (Connection)

    This output port delivers the Connection object from the input port. If the input port is not connected the port delivers nothing.

Parameters

  • connection_source This parameter indicates how the connection should be specified. It gives you two options, predefined and repository. The parameter is not visible if the connection input port is connected. Range: selection
  • connection_entry This parameter is only available when the connection source parameter is set to repository. This parameter is used to specify a repository location that represents a connection entry. The connection can also be provided using the connection input port. Range: string
  • connection This parameter is only available when the connection source parameter is set to predefined. The connection details for the Azure Data Lake Storage Gen1 connection have to be specified. If you have already configured an Azure Data Lake Storage Gen1 connection, you can select it from the drop-down list. If you have not configured an Azure Data Lake Storage yet, select the icon to the right of the drop-down list. Create a new Azure Data Lake Storage Gen1 connection in the Manage connections box. The account name and account key are required. Range: configurable
  • file Select the Azure Data Lake Storage file you want to download. Note that you need Read and Execute permissions on the root directory to be able to list its content. Range: selection