Datadog Agent 6 contains a large number of changes. While we attempted to make it a drop in replacement, there were a small number of deprecations or changes in behavior which will be listed in this document. For a list of features that haven't been ported, see this doc.
Note: If you see anything that's incorrect about this document, do not hesitate to open an issue or submit a Pull Request.
- Configuration Files
- GUI
- CLI
- Logs
- APM agent
- Process agent
- Docker
- Kubernetes
- Autodiscovery
- Python Modules
- JMX
- GCE hostname
Prior releases of Datadog Agent stored configuration files in /etc/dd-agent
.
Starting with the 6.0 release configuration files will now be stored in
/etc/datadog-agent
.
In addition to the location change, the primary agent configuration file has been
transitioned from INI format to YAML to better support complex configurations and
for a more consistent experience across the Agent and the Checks; as such datadog.conf
is now retired in favor of datadog.yaml
.
To automatically transition between agent configuration paths and formats, you
may use the agent command: sudo -u dd-agent -- datadog-agent import
.
The command will parse an existing datadog.conf
and convert all the bits that
the new Agent still supports to the new format, in the new file. It also copies
configuration files for checks that are currently enabled.
Please refer to this section of the documentation for a detailed list of the configuration options that were either changed or deprecated in the new Agent.
In order to provide a more flexible way to define the configuration for a check,
from version 6.0.0 the Agent will load any valid YAML file contained in the folder
/etc/datadog-agent/conf.d/<check_name>.d/
.
This way, complex configurations can be broken down into multiple files: for example,
a configuration for the http_check
might look like this:
/etc/datadog-agent/conf.d/http_check.d/
├── backend.yaml
└── frontend.yaml
The agent won't load configuration files from any sub-directories within a
<check_name>.d
folder. This means this configuration will NOT be loaded:
/etc/datadog-agent/conf.d/http_check.d/prod.d/
├── backend.yaml
Autodiscovery template files will be stored in the configuration folder as well,
for example this is how the redisdb
check configuration folder looks like:
/etc/datadog-agent/conf.d/redisdb.d/
├── auto_conf.yaml
└── conf.yaml.example
The yaml files within a <check_name.d>
folder can have any name, as long as
they have a .yaml
or .yml
extension. Only their content and the folder name
matter.
To keep backwards compatibility, the Agent will still pick up configuration files
in the form /etc/datadog-agent/conf.d/<check_name>.yaml
but migrating to the
new layout is strongly recommended.
When running the agent in a container, it is also possible to set some of the configuration through environment variables. The environment variables used in the agent 6 are different from those available in agent 5.
To not miss some specific configuration details, if you are currently using environment variables to setup your agent's v5 configuration, consult the new list of environment variables for Agent v6.
The agent proxy settings can be overridden with the standard *_PROXY
environment variables:
HTTP_PROXY
: an http URL to use as a proxy forhttp
requests.HTTPS_PROXY
: an http URL to use as a proxy forhttps
requests.NO_PROXY
: a comma-separated list of URLs for which no proxy should be used.
Notice: these variables don't use the DD_
prefix.
The behaviour of the Agent 6 is different from Agent 5:
The Agent 6 will first use the environment variables and the configuration file (as for every other setting available through environment variables). This is the opposite of Agent 5, which would always use the proxy from the configuration file if set.
For proxies, the Agent 6 will override the values from the configuration file
with the ones in the environment. This means that if both a HTTP
and HTTPS
proxy are set in the configuration file but only the HTTPS_PROXY
is set in
the environment, the agent will use the HTTPS
value from the environment and
the HTTP
value from the configuration file.
Agent 6 deprecated Agent5's Windows Agent Manager GUI, replacing it with a browser-based, cross-platform one. See the specific docs for more details.
The new command line interface for the Agent is sub-command based:
Command | Notes |
---|---|
check | Run the specified check |
configcheck | Print all configurations loaded & resolved of a running agent |
diagnose | Execute some connectivity diagnosis on your system |
flare | Collect a flare and send it to Datadog |
health | Print the current agent health |
help | Help about any command |
hostname | Print the hostname used by the Agent |
import | Import and convert configuration files from previous versions of the Agent |
installservice | Installs the agent within the service control manager |
launch-gui | starts the Datadog Agent GUI |
regimport | Import the registry settings into datadog.yaml |
remove-service | Removes the agent from the service control manager |
restart-service | restarts the agent within the service control manager |
start | Start the Agent |
start-service | starts the agent within the service control manager |
status | Print the current status |
stopservice | stops the agent within the service control manager |
jmx | JMX troubleshooting |
version | Print the version info |
To see the list of available sub-commands on your platform, run:
<agent_binary> --help
To run a sub-command, the Agent binary must be invoked like this:
<agent_binary> <sub_command> <options>
Some options have their own set of flags and options detailed in a help message.
For example, to see how to use the check
sub-command, run:
<agent_binary> check --help
There are a few major changes:
- only the lifecycle commands (i.e.
start
/stop
/restart
/status
on the Agent service) should be run withsudo service
/sudo initctl
/sudo systemctl
- all the other commands need to be invoked on the Agent binary, located in the
PATH
(/usr/bin
) asdatadog-agent
by default. Thedd-agent
command is not available anymore. - the
info
command has been renamedstatus
- the Agent 6 does not ship a SysV-init script (previously located at
/etc/init.d/datadog-agent
)
The lifecycle commands didn't change if the service
wrapper command is available on your system.
For example, on Ubuntu, the lifecycle commands are:
Command | Notes |
---|---|
sudo service datadog-agent start |
Start the Agent as a service |
sudo service datadog-agent stop |
Stop the Agent service |
sudo service datadog-agent restart |
Restart the Agent service |
sudo service datadog-agent status |
Print the status of the Agent service |
If the service
wrapper command is not available on your system, use:
- on
upstart
-based systems:sudo start/stop/restart/status datadog-agent
- on
systemd
-based systems:sudo systemctl start/stop/restart/status datadog-agent
If you're unsure which init system your distribution uses by default, please refer to the table below:
distribution \ init system | upstart |
systemd |
Notes |
---|---|---|---|
Amazon Linux (<= 2017.09) | ✅ | ||
Amazon Linux 2 (>= 2017.12) | ✅ | ||
CentOS/RHEL 6 | ✅ | ||
CentOS/RHEL 7 | ✅ | ||
Debian 7 (wheezy) | currently unsupported unless you install systemd | ||
Debian 8 (jessie) & 9 (stretch) | ✅ | ||
SUSE 11 | currently unsupported unless you install systemd | ||
SUSE 12 | ✅ | ||
Ubuntu < 15.04 | ✅ | ||
Ubuntu >= 15.04 | ✅ |
Other functionalities are now provided by the Agent binary itself as sub-commands and shouldn't be invoked with service
/systemctl
/initctl
. Here are a few examples:
Agent5 Command | Agent6 Command | Notes |
---|---|---|
sudo service datadog-agent info |
sudo datadog-agent status |
Status page of a running Agent |
sudo service datadog-agent flare |
sudo datadog-agent flare |
Send flare |
sudo service datadog-agent |
sudo datadog-agent --help |
Display Agent usage |
sudo -u dd-agent -- dd-agent check <check_name> |
sudo -u dd-agent -- datadog-agent check <check_name> |
Run a check |
There are a few major changes
- the main executable name is now
agent.exe
(it wasddagent.exe
previously) - Commands should be run with the command line
"c:\program files\datadog\datadog agent\embedded\agent.exe" <command>
from an "As Administrator" command prompt. - The configuration GUI is now a web-based configuration application, it can be easily accessed by running
the command
"c:\program files\datadog\datadog agent\embedded\agent.exe" launch-gui
or using the systray app. - The Windows service is now started "Automatic-Delayed"; it is started automatically on boot, but after all other services. This will result in a small delay in reporting of metrics after a reboot of a Windows device.
- The Windows GUI and Windows system tray icon are now implemented separately. See the specific docs for more details.
- the lifecycle commands (former
datadog-agent start
/stop
/restart
/status
on the Agent 5) are replaced bylaunchctl
commands on thecom.datadoghq.agent
service, and should be run under the logged-in user. For these commands, you can also use the Datadog Agent systray app - all the other commands can still be run with the
datadog-agent
command (located in thePATH
(/usr/local/bin/
) by default) - the
info
command has been renamedstatus
- The configuration GUI is now a web-based configuration application, it can be easily accessed by running
the command
datadog-agent launch-gui
or using the systray app.
A few examples:
Agent5 Command | Agent6 Command | Notes |
---|---|---|
datadog-agent start |
launchctl start com.datadoghq.agent or systray app |
Start the Agent as a service |
datadog-agent stop |
launchctl stop com.datadoghq.agent or systray app |
Stop the Agent service |
datadog-agent restart |
run stop then start or systray app |
Restart the Agent service |
datadog-agent status |
launchctl list com.datadoghq.agent or systray app |
Print the Agent service status |
datadog-agent info |
datadog-agent status or web GUI |
Status page of a running Agent |
datadog-agent flare |
datadog-agent flare or web GUI |
Send flare |
not implemented | datadog-agent --help |
Display command usage |
datadog-agent check <check_name> |
datadog-agent check <check_name> |
Run a check (unchanged) |
The Agent logs are still located in the /var/log/datadog/
directory. On Windows, the logs are still located in the c:\programdata\Datadog\logs
directory.
Prior releases were logging to multiple files in that directory (collector.log
,
forwarder.log
, dogstatsd.log
, etc). Starting with 6.0 the Agent logs to a single log file, agent.log
.
The APM agent (also known as trace agent) is shipped by default with the Agent 6 in the Linux, MacOS and Windows packages.
The APM agent is enabled by default on linux.
To enable the check on other platforms or disable it on linux,
you can update the apm_config
key in your datadog.yaml
:
apm_config:
enabled: true
For the Docker image, the APM agent is disabled by default. You can enable it by setting
the DD_APM_ENABLED
envvar to true
. It will listen to all interfaces by default.
If you want to listen to non-local trafic on any other platform, you can set
apm_config.apm_non_local_traffic = true
in your datadog.yaml
.
The process agent is shipped by default with the Agent 6 in the Linux packages only.
The process agent is not enabled by default. To enable the check you can update your datadog.yaml
file to add the following:
process_config:
enabled: "true"
The enabled
value is a string with the following options:
"true"
: Enable the process-agent to collect processes and containers."false"
: Only collect containers if available (the default)"disabled"
: Don't run the process-agent at all.
The Docker check has been rewritten in Go to take advantage of the new internal architecture of the Agent, mainly bringing a consistent behaviour across every container related component. Therefore the Python version will never work within Agent 6.
-
The new check is named
docker
and no longerdocker_daemon
. All features are ported, excepted the following deprecations:- the
url
,api_version
andtags*
options are deprecated, direct use of the standard docker environment variables is encouraged - the
ecs_tags
,performance_tags
andcontainer_tags
options are deprecated. Every relevant tag is now collected by default - the
collect_container_count
option to enable thedocker.container.count
metric is not supported.docker.containers.running
and.stopped
are to be used
- the
-
Some options have moved from
docker_daemon.yaml
to the maindatadog.yaml
:collect_labels_as_tags
has been renameddocker_labels_as_tags
and now supports high cardinality tags, see the details indatadog.yaml.example
exclude
andinclude
lists have been renamedac_include
andac_exclude
. In order to make filtering consistent accross all components of the agent, we had to drop filtering on arbitrary tags. The only supported filtering tags areimage
(image name) andname
(container name). Regexp filtering is still available, seedatadog.yaml.example
for examplesdocker_root
option has been split in two optionscontainer_cgroup_root
andcontainer_proc_root
exclude_pause_container
has been added to exclude pause containers on Kubernetes and Openshift (default to true). This will avoid removing them from the exclude list by error
The import
will help you converting the old
docker_daemon.yaml
to the new docker.yaml
. The command will also move
needed settings from docker_daemon.yaml
to datadog.yaml
.
Agent 6 currently supports Kubernetes versions 1.7.6 and above. Support for previous versions will be added in a future release.
The kubernetes
integration insights are provided combining:
- The
kubelet
check retrieving metrics from the kubelet - The
kubernetes_apiserver
check retrieving events and service checks from the apiserver
The agent import
command (in versions 6.2 and higher) will import settings from the legacy kubernetes.yaml
configuration, if found. The following options are deprecated:
- API Server credentials (
api_server_url
,apiserver_client_crt
,apiserver_client_key
,apiserver_ca_cert
) please provide a a kubeconfig file to the agent via thekubernetes_kubeconfig_path
option use_histogram
: please contact support to determine the best alternative for younamespaces
,namespace_name_regexp
: Agent6 now collects metrics from all available namespaces
The upgrade logic enables the new prometheus metric collection, that is compatible with Kubernetes versions 1.7.6 and up. If you run an older version or want to revert to the cadvisor collection logic, you can set the cadvisor_port
option back to 4194
(or the port your kubelet exposes cadvisor at).
The kubernetes_state
integration works on both versions of the agent.
While Agent5 automatically collected every pod label as tags, Agent6 needs you to whilelist
labels that are relevant to you. This is done with the kubernetes_pod_labels_as_tags
option
in datadog.yaml
.
The following options and tags are deprecated:
- `label_to_tag_prefix` option is superseeded by kubernetes_pod_labels_as_tags
- `container_alias` tags are not collected anymore
- `kube_replicate_controller` is only added if the pod is created by a replication controller,
not systematically. Use the relevant creator tag (`kube_deployment` / `kube_daemon_set`...)
We reworked the Autodiscovery system from the ground up to be faster and more reliable.
We also worked on decoupling container runtimes and orchestrators, to be more flexible in the future. This includes the move from docker_images
to ad_identifiers
in templates.
All documented use cases are supported, please contact our support team if you run into issues.
When using Kubernetes, the Autodiscovery system now sources information from the kubelet, instead of the Docker daemon. This will allow AD to work without access to the Docker socket, and enable a more consistent experience accross all parts of the agent. Also, the default behaviour is to source AD templates from pod annotations. You can enable the docker
config-provider to use container labels, and replace the kubelet
listener by the kubelet
one if you need AD on containers running out of pods.
When specifying AD templates in pod annotations, the new annotation name prefix is ad.datadoghq.com/
. the previous annotation prefix
service-discovery.datadoghq.com/
is still supported for Agent6 but support will be removed in Agent7.
Autodiscovery templates in Docker labels still work, with the same com.datadoghq.ad.
name prefix.
The identifier override label has been renamed from com.datadoghq.sd.check.id
to com.datadoghq.ad.check.id
for consistency. The previous
name is still supported for Agent6 but support will be removed in Agent7.
While we are continuing to ship the python libraries that ship with Agent 5,
some of the embedded libraries have been removed. util.py
and its associated
functions have been removed from the agent. util.headers(...)
is still included
in the agent, but implemented in C and Go and passed through to the check.
Note: all the official integrations have had these obsolete modules removed from them, so these changes will only affect custom checks.
Much of the utils
directory has been removed from the agent as well. However,
most of what was removed was not diretly related to checks and wouldn't be imported
in almost anyone's checks. The flare module, for example, was removed and
reimplemented in Go, but is unlikely to have been used by anyone in a custom check.
To learn more, you can read about the details in the development documentation.
Even if the new Agent fully supports Python checks, a few of those provided by integrations-core are expected to fail if run within the Agent as they are not currently implemented:
- agent_metrics
- docker_daemon replaced by a new
docker
check - kubernetes replaced by new checks
The base class for python checks remains AgentCheck
, and you will import it in
the same way. However, there are a number of things that have been removed or
changed in the class API. In addition, each check instance is now its own instance
of the class. So you cannot share state between them.
Some methods in the AgentCheck
class are not currently implemented. These include:
service_metadata
get_service_metadata
generate_historate_func
generate_histogram_func
stop
The function signature of the metric senders changed from:
gauge(self, metric, value, tags=None, hostname=None, device_name=None, timestamp=None)
to:
gauge(self, name, value, tags=None, hostname=None, device_name=None)
The following methods have been permanently removed from AgentCheck
:
_roll_up_instance_metadata
instance_count
is_check_enabled
read_config
set_check_version
set_manifest_path
_get_statistic_name_from_method
_collect_internal_stats
_get_internal_profiling_stats
_set_internal_profiling_stats
get_library_versions
get_library_info
from_yaml
get_service_checks
has_warnings
get_metrics
has_events
get_events
All the official integrations have had these methods removed from them, so these will only affect custom checks.
With Agent 6, the order of precedence between custom
checks (i.e. checks in the /etc/datadog-agent/checks.d/
folder by default on Linux) and the checks shipped
with the Agent by default (i.e. checks from integrations-core
) has changed: the
integrations-core
checks now have precedence over custom checks.
This affects your setup if you have custom checks that have the same name as existing integrations-core
checks: these custom checks will now be ignored, and the integrations-core
checks loaded instead.
To fix your custom check setup with Agent 6, rename your affected custom checks to a new and unused name,
and rename the related .yaml
configuration files accordingly.
If you happen to use custom checks, there's a chance your code depends on py code that was bundled with agent5 that may not longer be available in the with the new Agent 6 package. This is a list of packages no longer bundled with the Agent:
- backports.ssl-match-hostname
- datadog
- decorator
- future
- futures
- google-apputils
- pycurl
- pyOpenSSL
- python-consul
- python-dateutil
- python-etcd
- python-gflags
- pytz
- PyYAML
- rancher-metadata
- tornado
- uptime
- websocket-client
If your code depends on any of those packages, it'll break. You can fix that by running the following:
sudo -u dd-agent -- /opt/datadog-agent/embedded/bin/pip install <dependency>
Similarly, you may have added a pip package to meet a requirement for a custom check while on Agent 5. If the added pip package had inner dependencies with packages already bundled with Agent 5 (see list above), those dependencies will be missing after the upgrade to Agent 6 and your custom checks will break. You will have to install the missing dependencies manually as described above.
The Agent 6 ships JMXFetch, with the following changes:
The Agent 6 does not ship the jmxterm
JAR. If you wish to download and use jmxterm
, please refer to the upstream project.
Troubleshooting commands syntax have changed. These commands are available since v6.2.0, for earlier v6 versions please refer to the earlier docs:
-
sudo -u dd-agent datadog-agent jmx list matching
: List attributes that match at least one of your instances configuration. -
sudo -u dd-agent datadog-agent jmx list limited
: List attributes that do match one of your instances configuration but that are not being collected because it would exceed the number of metrics that can be collected. -
sudo -u dd-agent datadog-agent jmx list collected
: List attributes that will actually be collected by your current instances configuration. -
sudo -u dd-agent datadog-agent jmx list not-matching
: List attributes that don’t match any of your instances configuration. -
sudo -u dd-agent datadog-agent jmx list everything
: List every attributes available that has a type supported by JMXFetch. -
sudo -u dd-agent datadog-agent jmx collect
: Start the collection of metrics based on your current configuration and display them in the console.
By default theses command will run on all the configured jmx checks. If you want to
use them for specific checks, you can specify them using the --checks
flag :
sudo datadog-agent jmx list collected --checks tomcat
Only affects Agents running on GCE
When running on GCE, by default, Agent 6 uses the instance's hostname provided by GCE as its hostname.
This matches the behavior of Agent 5 (since v5.5.1) if gce_updated_hostname
is set to true in datadog.conf
,
which is recommended.
If you're upgrading from an Agent 5 with gce_updated_hostname
unset or set to false, and the hostname
of the Agent is not hardcoded in datadog.conf
/datadog.yaml
, the reported hostname on Datadog
will change from the GCE instance name to the full GCE instance hostname (which includes the GCE project id).