You are viewing the RapidMiner Go documentation for version 9.6 - Check here for latest version
RapidMiner Go
RapidMiner Go (AM) is a Spring Boot application.
Configuration
There are 2 levels of configuration for RapidMiner Go:
- Spring profiles
- environment variables
Using Spring profiles
As RapidMiner Go is a Spring Boot application, it can be configured using the SPRING_PROFILES_ACTIVE
environment variable in docker-compose-services.yml.
This is a comma separated list of the active spring profiles.
Available profiles:
Spring profile | Description |
---|---|
db-postgresql | RapidMiner Go uses PostgreSQL database |
db-h2 | RapidMiner Go uses H2 database |
exec-jc | Job Containers as execution back-end |
exec-js | Job Service as execution back-end |
broker-amq | AMQ for messaging |
The default value is db-postgresql,exec-jc,broker-amq
.
This means that the default database is PostgreSQL, and the default execution back-end is provided by Job Containers, and AMQ messaging is used between Automodel and the Job Container instances.
If no execution profile is set RapidMiner Go will run processes internally.
Using environment variables
On top of these profiles you can also specify environment variables -- these will override the default value of the profiles.
Table of default environment variables -- these are the minimum required with the above profiles:
Environment variable name | Description |
---|---|
DB_HOST | docker service for DB |
BASE_URL | external base URL used on the frontend - e.g. http://localhost:30000 |
RMID_BASE_URL | internal docker service URL for RMID -e.g. http://rapidminer-automodel-rmid:8080 |
AMQ_URL | AMQ URL |
AMQ_USERNAME | AMQ username |
AMQ_PASSWORD | AMQ password |
AUTH_SECRET | authentication secret used between services -- i.e AM and RMID |
AUTOMODELER_FRONTEND_TRACKING_ENABLED | enable event tracking service (internal analytics) |
JOB_QUEUE | AMQ job queue name |
JOB_STATUS_QUEUE | AMQ status queue name |
JOB_COMMAND_TOPIC | AMQ topic name |
Further useful environment variables:
Environment variable name | Description |
---|---|
AUTOMODELER_FRONTEND_CLOUDAPPS_DROPBOXAPPID | Dropbox application id for importing files from cloud |
AUTOMODELER_FRONTEND_CLOUDAPPS_ONEDRIVEAPPID | Microsoft One Drive application id for importing files from cloud |
AUTOMODELER_FRONTEND_GOOGLEANALYTICSID | Google Analytics id |
SPRING_SERVLET_MULTIPART_MAX_FILE_SIZE | Maximal Data file size has to be given in bytes upto 200MB |
AUTOMODELER_FILESTORE_MIN_ROW_LIMIT | Minimal no. of data rows for more reliable predictions (default 50) |
AUTOMODELER_EXECUTION_TASK_LIMIT_PER_USER | The no. of parallel task that can be submitted by a single user (default 4) |
AUTOMODELER_EXECUTION_QUEUE_LIMIT_PER_USER | The no. of modeling tasks that jobContainers can pick up concurrently. Note that this ultimately should match the no. of jobContainer instances running simultaneously (default 1) |
AUTOMODELER_EXECUTION_MAXIMUM_TOTAL_RESULT_SIZE | Maximal size of RapidMiner result set given in MB |
_JAVA_OPTIONS | Java options directly passed to the JVM. Use this to configure memory settings and others |
Licensing
The license determines how many data rows can be uploaded. The constraints for different license types are listed below:
Edition | Data rows | Logical processors | Memory |
---|---|---|---|
Free | 10.000 | 1 | Unlimited (due to 10.000 row limit) |
Unit | - | 8 / unit | 64 GB / unit |