Configuration item |
Configuration instructions |
Storm. zookeeper. Servers |
Zookeeper Server LIST |
Storm. zookeeper. Port |
Zookeeper connection Port |
Storm. Local. dir |
Local file system directory used by storm (must exist and the storm process can be read and written) |
Storm. Cluster. Mode |
Storm cluster running mode ([distributed | Local]) |
Storm. Local. mode. zmq |
Whether to use zeromq as the message system in local mode. If it is set to false, the Java Message System is used. The default value is false. |
Storm. zookeeper. Root |
Location of the storm root directory in zookeeper |
Storm. zookeeper. session. Timeout |
Client Connection zookeeper timeout |
Storm. ID |
The ID of the running topology, which consists of storm name and a unique random number. |
Nimbus. Host |
Nimbus server address |
Nimbus. Thrift. Port |
Nimbus's thrift listening port |
Nimbus. childopts |
Specifies the JVM option for the nimbus process during storm-deploy project deployment. |
Nimbus. task. Timeout. secs |
Heartbeat timeout. After the timeout, nimbus considers the task dead and assigns it to another address. |
Nimbus. Monitor. freq. secs |
Nimbus checks the interval between heartbeat and reallocation tasks. Note that if the machine goes down, nimbus takes over and processes it immediately. |
Nimbus. Supervisor. Timeout. secs |
The heartbeat timeout of the supervisor. Once Nimbus is exceeded, the supervisor is deemed dead and stops distributing new tasks for it. |
Nimbus. task. Launch. secs |
A special timeout setting for task startup. This value will be used to temporarily replace nimbus. task. Timeout. secs. |
Nimbus. Reassign |
Whether Nimbus re-allocates and executes tasks when they fail. It is true by default and is not recommended to be modified. |
Nimbus. file. Copy. expiration. secs |
Nimbus determines the upload/download link time-out. When the idle time exceeds this setting, nimbus considers the Link Dead and actively disconnects it. |
Ui. Port |
Storm UI service port |
DRPC. Servers |
DRPC server list, so that drpcspout knows who to communicate |
DRPC. Port |
Storm DRPC service port |
Supervisor. Slots. Ports |
List of ports that can run workers on the supervisor. each worker occupies one port, and each port runs only one worker. this configuration allows you to adjust the number of workers running on each machine. (adjust the number of slots per server) |
Supervisor. childopts |
Used in the storm-deploy project to configure JVM options for the supervisor daemon |
Supervisor. Worker. Timeout. secs |
Worker heartbeat timeout in the supervisor. Once the supervisor times out, the supervisor tries to restart the worker process. |
Supervisor. Worker. Start. Timeout. secs |
When the supervisor is initially started, the worker's heartbeat timeout time is reached. When this time is exceeded, the supervisor tries to restart the worker. Due to the additional consumption caused by the Initial Startup and configuration of JVM, the first heartbeat will exceed the settings of supervisor. Worker. Timeout. secs. |
Supervisor. Enable |
Whether the supervisor should run the workers assigned to him. The default value is true. This option is used for storm unit testing and should not be modified. |
Supervisor. Heartbeat. frequency. secs |
Supervisor heartbeat sending frequency (how often) |
Supervisor. Monitor. frequency. secs |
How often does the supervisor check the worker heartbeat? |
Worker. childopts |
The JVM option used by the supervisor to start the worker. All the "% ID %" strings will be replaced with the corresponding worker identifier. |
Worker. Heartbeat. frequency. secs |
Interval of Heartbeat sending of worker |
Task. Heartbeat. frequency. secs |
Interval of Heartbeat tasks |
Task. Refresh. Poll. secs |
The frequency of link synchronization between tasks and other tasks. (if the task is re-allocated, other tasks need to refresh the connection to send messages to it ). generally, other tasks will be notified when the reallocation occurs. This configuration is only used to prevent notifications. |
Topology. Debug |
If it is set to true, storm records each piece of information that is sent. |
Topology. Optimize |
Whether the master can optimize topologies by running multiple tasks in a single thread at an appropriate time. |
Topology. Workers |
Number of processes that should be started when the topology cluster is executed. Each process executes a certain number of tasks. Topology components in the thread mode to optimize the performance based on the parameter and concurrency prompt. |
Topology. ackers |
Number of Acker tasks started in topology. acker stores the records of tuples sent by spout and detects when tuple is fully processed. when Acker detects that tuple has been processed, it sends a confirmation message to spout. generally, the number of ackers should be determined based on the topology throughput, but generally it does not need too much. when the value is set to 0, message reliability is disabled. Storm will confirm the message immediately after spout sends tuples. |
Topology. Message. Timeout. secs |
The maximum timeout time for processing spout messages in topology. If a message is not successful in this time window, storm will notify spout that the message fails. Some spouts enable the failed message replay function. |
Topology. kryo. Register |
List of serialization solutions registered to kryo (the underlying serialization framework of storm). the serialization solution can be a class name or the implementation of COM. esotericsoftware. kryo. serializer. |
Topology. Skip. Missing. kryo. registrations |
Storm should skip the unrecognized kryo serialization scheme. If it is set to no, the task may fail to be loaded or an error will be thrown during runtime. |
Topology. Max. task. Parallelism |
Maximum degree of parallelism allowed in a topology. This configuration is mainly used to test the thread count limit in local mode. |
Topology. Max. spout. Pending |
The maximum number of tuples in a spout task in the pending state. This configuration is applied to a single task, rather than the entire spouts or topology. |
Topology. state. synchronization. Timeout. secs |
Maximum timeout value of the component synchronization status Source (retain option, not used currently) |
Topology. Stats. sample. Rate |
Tuples sampling percentage used to generate task statistics |
Topology. Fall. back. On. java. serialization |
Whether Java serialization scheme is used in Topology |
Zmq. threads |
Number of threads used for zeromq communication in each worker process |
Zmq. Linger. millis |
When the connection is closed, the link tries to resend the duration of the message to the target host. This is an uncommon advanced option and can be ignored. |
Java. Library. Path |
Java. Library. Path setting when JVM is started (such as Nimbus, supervisor, and workers). This option tells JVM in which paths to locate the local library. |