Categories

Versions

You are viewing the RapidMiner Go documentation for version 10.0 - Check here for latest version

RapidMiner Go

RapidMiner Go (AM) is a Spring Boot application.

Configuration

There are 2 levels of configuration for RapidMiner Go:

  • Spring profiles
  • environment variables

Using Spring profiles

As RapidMiner Go is a Spring Boot application, it can be configured using the SPRING_PROFILES_ACTIVE environment variable in docker-compose-services.yml. This is a comma separated list of the active spring profiles.

Available profiles:

Spring profile Description
db-postgresql RapidMiner Go uses PostgreSQL database
on-prem On prem installation

The default value is db-postgresql,on-prem. This means that the default database is PostgreSQL. The on-prem installation implies that Go uses an on-prem license for the whole product.

Using environment variables

On top of these profiles you can also specify environment variables -- these will override the default value of the profiles.

Table of default environment variables -- these are the minimum required with the above profiles:

Environment variable name Description
DB_HOST docker service for DB
BASE_URL external base URL used on the frontend - e.g. https://localhost:30000
RMID_BASE_URL internal docker service URL for RMID -e.g. http://rapidminer-automodel-rmid:8080
AMQ_URL AMQ URL
AMQ_USERNAME AMQ username
AMQ_PASSWORD AMQ password
AUTH_SECRET authentication secret used between services -- i.e AM and RMID
AUTOMODELER_FRONTEND_TRACKING_ENABLED enable event tracking service (internal analytics)
JOB_QUEUE AMQ job queue name
JOB_STATUS_QUEUE AMQ status queue name
JOB_COMMAND_TOPIC AMQ topic name

Further useful environment variables:

Environment variable name Description
AUTOMODELER_FRONTEND_CLOUDAPPS_DROPBOXAPPID Dropbox application id for importing files from cloud
AUTOMODELER_FRONTEND_CLOUDAPPS_ONEDRIVEAPPID Microsoft One Drive application id for importing files from cloud
AUTOMODELER_FRONTEND_GOOGLEANALYTICSID Google Analytics id
SPRING_SERVLET_MULTIPART_MAX_FILE_SIZE Maximal Data file size has to be given in bytes upto 200MB
AUTOMODELER_FILESTORE_MIN_ROW_LIMIT Minimal no. of data rows for more reliable predictions (default 50)
AUTOMODELER_EXECUTION_TASK_LIMIT_PER_USER The no. of parallel tasks that can be submitted by a single user (default 4). A task can be an analysis or a prediction. A task consists of multiple jobs.
AUTOMODELER_EXECUTION_QUEUE_LIMIT_PER_USER The no. of jobs per user JobContainers will pick up concurrently. This number should be less than or equal the no. of JobContainer instances running simultaneously (default 1)
AUTOMODELER_EXECUTION_MAXIMUM_TOTAL_RESULT_SIZE Maximal size of RapidMiner result set given in MB
AUTOMODELER_DATA_IMPORT_MINIMUM_ATTRIBUTE_COUNT Minimum number of columns required for training
AUTOMODELER_DATA_IMPORT_MAXIMUM_ATTRIBUTE_COUNT Maximum number of columns accepted for training

Licensing

The application depends on the license file at licenses/rapidminer-go-on-prem directory - if this is not present uploading data will fail. This folder is automatically mounted into the file system of every RapidMiner Go and Job Container instance - so there's no need to copy it manually.