Page History

Table of Contents

	outlinh1. true
	outlinh1. true
1	printablefalse
2	stylh1. none
3	indent20px

Functioning of the Watchdog (heart_beat_watchdog_thread)

A Watchdog is started automatically each time a JobScheduler is started as part of a Cluster. Each Watchdog runs as a seperate thread alongside its respective JobScheduler and monitors that JobScheduler's heartbeat. The Watchdog stops its JobScheduler if the JobScheduler's heartbeat is missing for a predefined length of time.

...

This behaviour cannot be configured as it is an "emergency" procedure to ensure the reliable functioning of the cluster.

Possible reasons for a missing heartbeat

Database problems
Problems with the SMTP mail server
DNS problems
A heavily overload computer (e.g. lack of memory)
A change in system time

Ausgaben im scheduler.log

JobScheduler determines that its own heartbeat is missing 31 seconds after it was due. The warning is issued after a further delay of 3 seconds. The maximum delay that is tollerated is 55 seconds.

...

Space shortcuts

Page tree

Versions Compared

Old Version 5

New Version 6

Key

Functioning of the Watchdog (heart_beat_watchdog_thread)

Possible reasons for a missing heartbeat

Ausgaben im scheduler.log