Categories

Versions

You are viewing the RapidMiner Studio documentation for version 9.3 - Check here for latest version

Add to Solr (Documents) (Solr)

Synopsis

This operator adds collections of documents to Solr.

Description

To connect to a Solr server, you have to specify a Solr connection. This comprises the URL of a Solr server and an optional user/password combination for authentication. Typically, the Solr server URL ends with the string '/solr'.

The next step is to select a collection on the server. A collection can be imagined as a table. It is composed of several columns, which are called Solr fields. A Solr field has a type (e.g. number) and a key (the name of the column). Each entry in Solr can be imagined as a row and contains values for the respective fields.

A RapidMiner document has a set of metadata records, which consist of a key and a related value. The metadata keys are mapped to the Solr attributes. RapidMiner documents have an additional body. Therefore you can select a Solr field, in which the document body will be stored.

Input

  • connection (Connection)

    This input port expects a Connection object if any. See the parameter connection entry for more information.

  • documents (Collection)

    This port connects a collection of documents, which has to be added. This port is extendable.

Output

  • connection (Connection)

    This output port delivers the Connection object from the input port. If the input port is not connected the port delivers nothing.

  • documents (Collection)

    The added collection of documents are provided at this port. This port is extendable.

Parameters

  • connection_source This parameter indicates how the connection should be specified. It gives you two options, predefined and repository. The parameter is not visible if the connection input port is connected. Range: selection
  • connection_entry This parameter is only available when the connection source parameter is set to repository. This parameter is used to specify a repository location that represents a connection entry. The connection can also be provided using the connection input port. Range: string
  • connection This parameter is only available when the connection source parameter is set to predefined. The connection details for the Solr connection have to be specified. If you have already configured a Solr connection, you can select it from the drop-down list. If you have not configured a Solr connection yet, select the icon to the right of the drop-down list. Create a new Solr connection in the Manage connections dialog. The Solr server URL is required. Additionally, you can specify a username/password combination for authentication. Range: configurable
  • collection Provide the name of the Solr collection, which has to be used to access data. Range: string
  • document_body_field The Solr field, which is used for the RapidMiner document body. Range: string