Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add mapreduce config specs #7178

Merged
merged 4 commits into from
Aug 18, 2020
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
94 changes: 94 additions & 0 deletions mapreduce/assets/configuration/spec.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,94 @@
name: MapReduce
files:
- name: mapreduce.yaml
options:
- template: init_config
options:
- name: general_counters
description: |
`general_counters` are job agnostic metrics that create a metric for each specified counter
Create a an object with the following layout:

- counter_group_name: '<COUNTER_GROUP_NAME>'
counters:
- counter_name: 'MAP_INPUT_RECORDS'
- counter_name: 'MAP_OUTPUT_RECORDS'
- counter_name: 'REDUCE_INPUT_RECORDS'
- counter_name: 'REDUCE_OUTPUT_RECORDS'

For more information on counters visit the MapReduce documentation page:
https://hadoop.apache.org/docs/current/hadoop-mapreduce-client/hadoop-mapreduce-client-core/MapredAppMasterRest.html#Job_Counters_API /noqa
value:
example:
- counter_group_name: 'org.apache.hadoop.mapreduce.FileSystemCounter'
counters:
- counter_name: 'HDFS_BYTES_READ'
type: array
items:
type: object

- name: job_specific_counters
description: |
`job_specific_counters` are metrics that are specific to a particular job.
Create an object with the following layout:

- job_name: <JOB_NAME>
metrics:
- counter_group_name: <COUNTER_GROUP_NAME>
counters:
- counter_name: <COUNTER_NAME>

For more information on counters visit the MapReduce documentation page:
https://hadoop.apache.org/docs/current/hadoop-mapreduce-client/hadoop-mapreduce-client-core/MapredAppMasterRest.html#Job_Counters_API /noqa
value:
example:
- job_name: '<JOB_NAME>'
metrics:
- counter_group_name: 'org.apache.hadoop.mapreduce.FileSystemCounter'
counters:
- counter_name: 'FILE_BYTES_WRITTEN'
- counter_name: 'HDFS_BYTES_WRITTEN'
- counter_group_name: 'org.apache.hadoop.mapreduce.FileSystemCounter'
counters:
- counter_name: 'HDFS_BYTES_READ'
type: array
items:
type: object

- template: init_config/http
- template: init_config/default
- template: instances
options:
- name: resourcemanager_uri
description: |
The MapReduce check retrieves metrics from YARNS's ResourceManager. This
check must be run from the Master Node and the ResourceManager URI must
be specified below. The ResourceManager URI is composed of the
ResourceManager's hostname and port.

The ResourceManager hostname can be found in the yarn-site.xml conf file
under the property yarn.resourcemanager.address

The ResourceManager port can be found in the yarn-site.xml conf file under
the property yarn.resourcemanager.webapp.address
required: true
value:
example: http://localhost:8088
type: string
- name: cluster_name
description: A friendly name for the cluster.
required: true
value:
example: "<MAPREDUCE_CLUSTER_NAME>"
type: string
- template: instances/http
- template: instances/default

- template: logs
example:
- type: file
path: /usr/local/hadoop/logs/hadoop-root-namenode-localhost.log
source: mapreduce
- type: file
path: /usr/local/hadoop/logs/hadoop-root-datadnode-localhost.log
source: mapreduce
Loading