Concept
Ambari metrics is a functional component in Ambari that is responsible for monitoring cluster status. It has some of the following key concepts:
Terminology |
Description |
Ambari Metrics System ("AMS") |
The built-in metrics collection system for Ambari. |
Metrics Collector |
The standalone server, collects metrics, aggregates metrics, serves metrics from the Hadoop service sinks and the MetR ICS Monitor. |
Metrics Hadoop Sinks |
Plugs into the various Hadoop components sinks to send Hadoop metrics to the metrics Collector. |
Metrics Monitor |
Installed on all host in the cluster to collect system-level metrics and forward to the metrics Collector. |
Simply put, Ambari collects two types of information on the collector:
1. "System level" metrics for each node
2. Metrics for each component of Hadoop
The former is collected by the metrics Monitor (which is the agent) installed on each node, which is collected by sink (the same conceptually and Flume sink) for a particular Hadoop component.
Finally add one by one points, collector is to use HBase to store metrics data.
Architecture
Configuration configuration Ambari metrics for distributed mode
The default installation is Ambari metrics is embedded mode, so that all collected data is stored locally on the Collector node, and a large amount of metrics data is crowding out a large amount of local storage, which is distributed after the metrics data is placed on the HDFs , so this is usually a necessary operation after installing Ambari. The specific operation can refer to: http://docs.hortonworks.com/HDPDocuments/Ambari-2.1.0.0/bk_ambari_reference_guide/content/_ Configuring_ambari_metrics_for_distributed_mode.html
Configuring the life cycle of metrics data
A large number of metrics occupy very large storage space, it is necessary to set the metrics data retention time (TTL), the control metrics data Retention time parameters in Ams-site.xml, the following are related configuration items:
Configuration Items |
Default Value |
Description |
Timeline.metrics.host.aggregator.ttl |
86400 |
1 minute resolution data purge interval. Default is 1 day. |
Timeline.metrics.host.aggregator.minute.ttl |
604800 |
Host based X minutes resolution data purge interval. Default is 7 days. (X = configurable interval, default interval is 2 minutes) |
Timeline.metrics.host.aggregator.hourly.ttl |
2592000 |
Host based hourly resolution data purge interval. Default is. |
Timeline.metrics.host.aggregator.daily.ttl |
31536000 |
Host based daily resolution data purge interval. Default is 1 year. |
Timeline.metrics.cluster.aggregator.minute.ttl |
2592000 |
Cluster wide minute resolution data purge interval. Default is. |
Timeline.metrics.cluster.aggregator.hourly.ttl |
31536000 |
Cluster wide hourly resolution data purge interval. Default is 1 year. |
Timeline.metrics.cluster.aggregator.daily.ttl |
63072000 |
Cluster wide daily resolution data purge interval. Default is 2 years. |
Copyright NOTICE: This article for Bo Master original article, without Bo Master permission not reproduced.
Ambari Metrics Introduction