Automatic Job Cleanup

If you’re not on 9.10.4, but any version from 9.10.0 to 9.10.3, please head over to the older documentation for Job Cleanup.

RapidMiner Server automatically saves information related to recently executed jobs. This includes the user who triggered the execution, the job's state, the queue on which the job has been executed and also the date on which the process was executed - this information can be reviewed on the Executions page within RapidMiner Server. Additionally, the Job Agent which was responsible for executing the job, created a dedicated working directory for it.

Those stored pieces of information and also the working directories can grow large. In order to avoid this, RapidMiner Server provides a job cleanup mechanism to wipe old jobs. If you like to configure it, the following properties need to be added to the file within the <rapidminer-home>/configuration folder of your RapidMiner Server home directory.

  1. jobservice.scheduled.archive.jobCleanup.enabled: Enable the job cleanup.

  2. jobservice.scheduled.archive.jobCleanup.cronExpression: This property defines the point in time when the automatic job cleanup will be executed with the help of a cron expression. By default, the cleanup task is configured to run hourly with the cron expression 0 0 * * * *. It follows the cron pattern <second> <minute> <hour> <day> <month> <weekday>. So 0 */30 * * * * would run the job cleanup every 30 minute whereas 0 0 0 * * * would run it daily.

  3. jobservice.scheduled.archive.jobCleanup.maxAge: This property defines the maximum age of jobs in minutes. By default, this is not configured. If configured, jobs older than the value set in the property will be cleaned up. Set this to any arbitrary number greater than zero. Please note that the property value needs to reflect the execution time. If you like to clean up all jobs which are older than 10 minutes, you also need to run the job every 10 minutes.