Categories

Versions

Write Google BigQuery (Cloud Connectivity)

Synopsis

This operator writes an ExampleSet to a GBQ database.

Description

The Write GBQ operator is used for writing an ExampleSet to the specified GBQ database. You need to have at least basic understanding of databases and database connections in order to use this operator properly. Go through the parameters and the attached Example Process to understand the flow of this operator.

The user can specify the database connection and a table name. Please note that the table will be created during writing if it does not exist. The most convenient way of defining the necessary parameters is the Manage Database Connections wizard. The most important parameters (database URL and user name) will be automatically determined by this wizard. At the end, you only have to define the table name. This operator only supports the writing of the complete ExampleSet consisting of all regular and special attributes and all examples. If this is not desired, perform some preprocessing operators like the Select Attributes or Filter Examples operators before applying the Write Google BigQuery operator. Data from database tables can be read by using the Read Database operator.

Input

  • input (Data table)

    This input port expects an ExampleSet. It is output of the Retrieve operator in the attached Example Process.

  • connection (Connection)

    This input port expects a Connection object if any. See the parameter connection entry for more information.

Output

  • through (Data table)

    The ExampleSet that was provided at the input port is delivered through this output port without any modifications. This is usually used to reuse the same ExampleSet in further operators of the process.

  • connection (Connection)

    This output port delivers the Connection object from the input port. If the input port is not connected the port delivers nothing.

Parameters

  • connection_entry Indicates how the database connection should be specified. Range: string
  • schema_name Name of the schema. Range: string
  • table_nameThis parameter is used for selecting the required table from the specified database. Please note that you can also write a table name here, if the table does not exist it will be created during writing. Range: string
  • overwrite_modeThis parameter indicates if an existing table should be overwritten or data should be appended to the existing data. Range: selection
  • batch_sizeThis parameter specifies the number of rows which are written at once with one single query to the database. Larger values can greatly improve the speed. However, BigQuery has restrictions on the maximum number of values written at once. Range: integer
  • set_default_varchar_lengthThis parameter allows you to set varchar columns to default length. Range: boolean
  • default_varchar_lengthThis parameter is only available when the set default varchar length parameter is set to true. This parameter specifies the default length of varchar columns. Range: integer