Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Restructure the logs page #4949

Merged
merged 7 commits into from
Aug 5, 2024
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
205 changes: 172 additions & 33 deletions content/en/docs/concepts/signals/logs.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,31 +5,178 @@ description: A recording of an event.
---

A **log** is a timestamped text record, either structured (recommended) or
unstructured, with metadata. Of all telemetry signals, logs have the biggest
legacy. Most programming languages have built-in logging capabilities or
well-known, widely used logging libraries. Although logs are an independent data
source, they may also be attached to spans. In OpenTelemetry, any data that is
not part of a distributed trace or a metric is a log. For example, _events_ are
a specific type of log. Logs often contain detailed debugging/diagnostic info,
such as inputs to an operation, the result of the operation, and any supporting
metadata for that operation.
unstructured, with optional metadata. Of all telemetry signals, logs have the
biggest legacy. Most programming languages have built-in logging capabilities or
well-known, widely used logging libraries.

For traces and metrics, OpenTelemetry takes the approach of a clean-sheet
design, specifies a new API, and provides full implementations of this API in
multiple language SDKs.
## OpenTelemetry logs

OpenTelemetry's approach with logs is different. Because existing logging
solutions are widespread in language and operational ecosystems, OpenTelemetry
acts as a "bridge" between those logs, the tracing and metrics signals, and
other OpenTelemetry components. In fact, the API for logs is called the "Logs
Bridge API" for this reason.
OpenTelemetry does not define a bespoke API or SDK to create logs. Instead,
OpenTelemetry logs are the existing logs you already have from a logging
framework or infrastructure component. OpenTelemetry SDKs and
autoinstrumentation utilize several components to automatically correlate logs
with [traces](/docs/concepts/signals/traces).

The [logs specification][] contains more details on this philosophy.
OpenTelemetry's support for logs is designed to be fully compatible with what
you already have, providing capabilities to wrap those logs with additional
context and a common toolkit to parse and manipulate logs into a common format
across many different sources.

To understand how logging in OpenTelemetry works, let's look at a list of
components that will play a part in instrumenting our code.
### OpenTelemetry logs in the OpenTelemetry Collector

## Log Appender / Bridge
The [OpenTelemetry Collector](/docs/collector) provides several tools to work
with logs:

- Several receivers which parse logs from specific, known sources of log data.
- The filelogreceiver, which reads logs from any file and provides features to
parse them from different formats or use a regular expression.
- Processors like the transformprocessor which lets you parse nested data,
flatten nested structures, add/remove/update values, and more.
- Exporters that let you emit log data in a non-OpenTelemetry format.

The first step in adopting OpenTelemetry frequently involves deploying a
Collector as a general-purposes logging agent.

### OpenTelemetry logs for applications

In applications, OpenTelemetry logs are created with any logging library or
built-in logging capabilities. When you add autoinstrumentation or activate an
SDK, OpenTelemetry will automatically correlate your existing logs with any
active trace and span, wrapping the log body with their IDs. In other words,
OpenTelemetry automatically correlates your logs and traces.

### Language support

Logs are a [stable](/docs/specs/otel/versioning-and-stability/#stable) signal in
the OpenTelemetry specification. For the individual language specific
implementations of the Logs API & SDK, the status is as follows:

{{% signal-support-table "logs" %}}

## Structured, unstructured, and semistructured logs

OpenTelemetry does not technically distinguish between structured and
unstructured logs. You can use any log you have with OpenTelemetry. However, not
all log formats are equally useful! Structured logs, in particular, are
recommended for production observability because they are easy to parse and
analyze at scale. The following section explains the differences between
structured, unstructured, and semistructured logs.

### Structured logs

A structured log is a log whose textual format follows a consistent,
machine-readable format. For applications, one of the most common formats is
JSON:

```json
{
"timestamp": "2024-08-04T12:34:56.789Z",
"level": "INFO",
"service": "user-authentication",
"environment": "production",
"message": "User login successful",
"context": {
"userId": "12345",
"username": "johndoe",
"ipAddress": "192.168.1.1",
"userAgent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/104.0.0.0 Safari/537.36"
},
"transactionId": "abcd-efgh-ijkl-mnop",
"duration": 200,
"request": {
"method": "POST",
"url": "/api/v1/login",
"headers": {
"Content-Type": "application/json",
"Accept": "application/json"
},
"body": {
"username": "johndoe",
"password": "******"
}
},
"response": {
"statusCode": 200,
"body": {
"success": true,
"token": "jwt-token-here"
}
}
}
```

and for infrastructure components, Common Log Format (CLF) is commonly used:

```text
127.0.0.1 - johndoe [04/Aug/2024:12:34:56 -0400] "POST /api/v1/login HTTP/1.1" 200 1234
```

It is also common to have different structured log formats mixed together. For
example, an Extended Log Format (ELF) log can mix JSON with the
whitespace-separated data in a CLF log.

```text
192.168.1.1 - johndoe [04/Aug/2024:12:34:56 -0400] "POST /api/v1/login HTTP/1.1" 200 1234 "http://example.com" "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/104.0.0.0 Safari/537.36" {"transactionId": "abcd-efgh-ijkl-mnop", "responseTime": 150, "requestBody": {"username": "johndoe"}, "responseHeaders": {"Content-Type": "application/json"}}
```

To make the most use of this log, parse both the JSON and the ELF-related pieces
into a shared format to make analysis on an observability backend easier. The
`filelogreceiver` in the [OpenTelemetry Collector](/docs/collector) contains
standardized ways to parse logs like this.

Structured logs are the preferred way to use logs. Because structured logs are
emitted in a consistent format, they are straightforward to parse, which makes
them easier to preprocess in an OpenTelemetry Collector, correlate with other
data, and ultimate analyze in an Observability backend.

### Unstructured logs

Unstructured logs are logs that don't follow a consistent structure. They may be
more human-readable, and are often used in development. However, it is not
preferred to use unstructured logs for production observability purposes, since
they are much more difficult to parse and analyze at scale.

Examples of unstructured logs:

```text
[ERROR] 2024-08-04 12:45:23 - Failed to connect to database. Exception: java.sql.SQLException: Timeout expired. Attempted reconnect 3 times. Server: db.example.com, Port: 5432

System reboot initiated at 2024-08-04 03:00:00 by user: admin. Reason: Scheduled maintenance. Services stopped: web-server, database, cache. Estimated downtime: 15 minutes.

DEBUG - 2024-08-04 09:30:15 - User johndoe performed action: file_upload. Filename: report_Q3_2024.pdf, Size: 2.3 MB, Duration: 5.2 seconds. Result: Success
```

It is possible to store and analyze Unstructured logs in production, although
you may need to do substantial work to parse or otherwise pre-process them to be
machine-readable. For example, the above three logs will require a regular
expression to parse their timestamps and custom parsers to consitently extract
the bodies of the log message. This will typically be necessary for a logging
backend to know how to sort and organize the logs by timestamp. Although it's
possible to parse unstructured logs for analysis purposes, doing this may be
more work than switching to structured logging, such as via a standard logging
framework in your applications.

### Semistructured logs

A semistructured log is a log that does use some self-consistent patterns to
distinguish data so that it's machine-readable, but may not use the same
formatting and delimiters between data across different systems.

Example of a semistructured log:

```text
2024-08-04T12:45:23Z level=ERROR service=user-authentication userId=12345 action=login message="Failed login attempt" error="Invalid password" ipAddress=192.168.1.1 userAgent="Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/104.0.0.0 Safari/537.36"
```

Although machine-readable, semistructured logs may require several different
parsers to allow for analysis at scale.

## OpenTelemetry logging components

The following lists of concepts and components power OpenTelemetry's logging
support.

### Log Appender / Bridge

As an application developer, the **Logs Bridge API** should not be called by you
directly, as it is provided for logging library authors to build log appenders /
Expand All @@ -39,7 +186,7 @@ OpenTelemetry LogRecordExporter.

OpenTelemetry language SDKs offer this functionality.

## Logger Provider
### Logger Provider

> Part of the **Logs Bridge API** and should only be used if you are the author
> of a logging library.
Expand All @@ -49,20 +196,20 @@ A Logger Provider (sometimes called `LoggerProvider`) is a factory for
lifecycle matches the application's lifecycle. Logger Provider initialization
also includes Resource and Exporter initialization.

## Logger
### Logger

> Part of the **Logs Bridge API** and should only be used if you are the author
> of a logging library.

A Logger creates log records. Loggers are created from Log Providers.

## Log Record Exporter
### Log Record Exporter

Log Record Exporters send log records to a consumer. This consumer can be
standard output for debugging and development-time, the OpenTelemetry Collector,
or any open source or vendor backend of your choice.

## Log Record
### Log Record

A log record represents the recording of an event. In OpenTelemetry a log record
contains two kinds of fields:
Expand All @@ -89,15 +236,7 @@ The top-level fields are:
For more details on log records and log fields, see
[Logs Data Model](/docs/specs/otel/logs/data-model/).

## Language Support

Logs are a [stable](/docs/specs/otel/versioning-and-stability/#stable) signal in
the OpenTelemetry specification. For the individual language specific
implementations of the Logs API & SDK, the status is as follows:

{{% signal-support-table "logs" %}}

## Specification
### Specification

To learn more about logs in OpenTelemetry, see the [logs specification][].

Expand Down