Fast access logging for analytics #537

krizhanovsky · 2016-05-27T18:23:50Z

Now all information from config file parsing errors to clients blocking is written to dmesg. Instead following logs on top of TempestaDB must be introduced:

access log - TfwHttpReq must be passed as an argument to tfw_log_access() function.
security log - all security event must be logged there (e.g. block client IP, Frang rate limiting etc.). It seems there could be several logging functions which write events at different layers. Since we log blocking events, the events might overflow the log due to DoS attack. At the moment they're logged using net_warn_ratelimited(), so we're good with the system ratelimiter. We should either leave the security log in the kernel log or implement our own rate limiter.
error log - all HTTP processing events must go there (e.g. invalid URI), records in the log also must be done in TfwHttpReq context;
probably stress (performance) events.

The 2 modes of logging must be implemented:

TDB tables with automatic eviction of old records (ring buffer) to fit in RAM
logs transmission to a remote server using TCP synchronous sockets (not to lose records under peak load)

The logs should be configured by independent configuration options:

    log_access /opt/tempesta/db/log_access.tdb <variables>
    log_error /opt/tempesta/db/log_error.tdb <variables>
    log_security 192.168.1.1:5000 <variables>

variables is the list of variables to log:

remote_addr - remote user IP
time - local time (miliseconds since epoch)
method - request method
uri
resp_status - response status
body_sent - response bytes sent
resp_time - response time
values of any special header with srvhdr_ or clntcdr_ prefixes and - changed to _, e.g. srvhdr_set_cookie or clnt_user_agent
cache miss/hit
origin IP

In general, Tempesta DB should provide streaming data processing (#515 and #516) foundation for the logging application. Probably we need to keep all request and response headers with the metadata (e.g. timings, chunks information, TCP/IP and TLS layers etc) for a relatively short sliding window. Such data is extremely useful to query to debug immediate performance issues and DDoS attacks.

TBD a possible application is to batch events in front of a time series DB, e.g. ClickHouse, InfluxDB or TimescaleDB.

We need to support per-vhost logging, i.e. thousands of vhosts having several logs each. This could be done using either secondary index #733 or we have to be able to scale to thousands of TDB tables.

For better performance logs must use simple sequential ring-buffer TDB table format w/o any indexes (#516). Log records must be stored in structured TDB records. Probably we don't event need TDB for this and just mmap the ring buffer into the user space.

The binary log format could be

    <event_type><timestamp><var0><var1>...<varN>

, where event_type defines the event type and it's format (number of variables and their type, e.g. client address, URI, HTTP Host header etc.). In this release the format must be simple and hardcoded.

Simple retrieval user space tool like varnishlog must be developed on top of tdbq to print the logs and/or write them in human-readable or JSON formats to files. The tool also must be able to run in daemon mode, read the TDB tables and flush the log records to files or syslogd.

The human-readable text format should be compatible with the W3C draft, but should also provide more information.

Also reference TUX and HAProxy, which also use(ed) binary logging.

The text was updated successfully, but these errors were encountered:

krizhanovsky · 2024-01-18T22:37:09Z

At the moment there are hundreds of log messages of various levels and generic printf()-like formats. Hopefully, all of them are printed with macros like T_ERR or T_WARN, so all of them can be preprocessed by a tool, which will build a C table with indexes and compiled formats to avoid formats conversion in runtime. See qrintf

One more approach is to log only binary data (e.g. integers, floats and nanoseconds since epoch) into a ring buffer and use a log reading tool to process the binary data. This is very close to what HAproxy does https://www.youtube.com/watch?v=762owEyCI4o

Access log is a separate case: it can be used to compute an advanced statistics - larger log allows longer statistics. E.g. with the current access log we can compute statistics for each of return code and we don't actually need the counters implemented in #2023 (#1454). Access log can be also extended with:

was a response got from cache or requested from an upstream
size of request to estimate number of bytes sent to an upstream)

Probably #2023 should be reverted, but maybe we should provide an application layer parser, which will compute the statistics. This probably can be done with the same library as tdbq, see #279 and #528.

krizhanovsky · 2024-09-28T23:37:32Z

The problem is crucial because of low performance of the kernel log and absence of analytics abilities for DDoS incident responses.

To react on a DDoS incident we need to extend the access log with JA5 fingerprints #2052.

Access log must have a per-cpu user-space mapped (see for example how tcp_mmap maps user-space pages) ring buffers. (Please check the current state of the generic ringbuffers and make a TODO comment and probably and issue to use it. Also see it's implementation to borrow some code.) access_log configuration option must be extended with a size of a per-cpu buffer (1MB minumum and by default) and the current mode: dmesg or mmap. The access log records must be written in a binary format (string parameters must use 2 byte length and be truncated by 64KB). Truncated strings must have explicit attribute of truncation.

The log records and the whole mmaped buffers must be defined with a C structure, which will be later extended, e.g with a record type for error and security events. The buffer structures must have two integer fields: head and tail. head is the offset of the last written byte by sotfirq. tail is the next byte to read by a user-space daemon. If there is not enough space to insert a new log record, softirq must account the number of skipped records. The next inserted record should contain the counter in the descriptor data structure.

A C++ daemon must spawn N threads, where N is defined as a command line argument. During startup the daemon should define the sets of CPUs, which each of the threads should process. The daemon must use the ClickHouse client library to connect to a ClickHouse instance by an address, specified in another command line argument. Each thread must process the mmaped buffers of assigned CPUs in round-robin fashion and prepare ClickHouse batches of configured size and send to ClickHouse. A code example from ChatGPT:

#include <clickhouse/client.h>
using namespace clickhouse;

int main() {
    // Establish connection to ClickHouse server
    Client client(ClientOptions().SetHost("localhost"));

    // Define batch of data
    Block block;
    block.AppendColumn("column1", std::make_shared<ColumnUInt64>());
    block.AppendColumn("column2", std::make_shared<ColumnString>());

    // Insert rows to the block
    for (uint64_t i = 0; i < 1000; ++i) {
        block[0]->As<ColumnUInt64>()->Append(i);
        block[1]->As<ColumnString>()->Append("value" + std::to_string(i));
    }

    // Insert the batch into ClickHouse
    client.Insert("default.my_table", block);

    return 0;
}

If a thread reaches head of all designated log buffers, it should sleep for a short period of time, e.g. 0.1s or 0.01s (futexes aren't available for us).

All the log records must contain current timestamp (like jiffies), which must be light-weight to get and be accessible from the user space. There is no need to sort the records - leave it for ClickHouse.

The daemon should use C++23 and Boost, but not asio due it's poor performance.

The daemon is supposed to be extended to write to a file, but for now let's just keep the current dmesg implementation for this case.

Testing & doc

Please update the wiki.

The testing issue is #2269

krizhanovsky · 2024-09-29T00:45:00Z

Also, for slow DDoS attacks detection, let's add time spent to send response to the access log, like <response bytes sent>/<time in seconds>. Good to start with and make in a separate pull request.

In #537 we need a way to deliver log data to userspace. Introduce a set of per-cpu ring buffer mapped to userspace. Signed-off-by: Alexander Ivanov <[email protected]>

ai-tmpst · 2024-11-22T12:19:04Z

For access log table creation connect by clickhouse client and execute:
CREATE TABLE IF NOT EXISTS access_log (timestamp DateTime64, address IPv6, method UInt8, version UInt8, status UInt16, response_content_length UInt32, response_time UInt32, vhost String, uri String, referer String, user_agent String, dropped_events UInt64, PRIMARY KEY(timestamp))

krizhanovsky · 2024-11-22T14:31:07Z

@ai-tmpst this table creation should be either in our Wiki installation guide or client handling or in Tempesta scripts

krizhanovsky added the enhancement label May 27, 2016

krizhanovsky added this to the 0.5.0 Web Server milestone May 27, 2016

krizhanovsky assigned keshonok May 27, 2016

krizhanovsky changed the title ~~Logging~~ Custom logging May 27, 2016

krizhanovsky modified the milestones: 0.6 WebOS, 0.5.0 Web Server Feb 23, 2017

krizhanovsky unassigned keshonok Aug 31, 2017

krizhanovsky modified the milestones: backlog, 0.11 Tempesta Language Jan 15, 2018

krizhanovsky mentioned this issue Mar 13, 2018

Fix #903: add logging for all error responses. #952

Merged

krizhanovsky modified the milestones: 1.5 Tempesta Language, 1.3 Web server Aug 8, 2018

krizhanovsky modified the milestones: 1.3 Web server & advanced strings, 1.0 Beta Feb 2, 2019

krizhanovsky modified the milestones: 1.0 Beta, 1.1 Network performance & scalability, 1.1 TBD (Network performance & scalability), 1.1 TDB (ML, QUIC, DoH etc.) Feb 11, 2019

krizhanovsky added the TDB Tempesta DB module and related issues label Apr 27, 2020

krizhanovsky mentioned this issue Aug 18, 2021

TDBv0.2: Cache background revalidation and eviction #515

Open

22 tasks

snizovtsev mentioned this issue Sep 14, 2021

Fix poor connection error messages #1521

Merged

krizhanovsky added the crucial label Dec 27, 2021

krizhanovsky modified the milestones: 1.1 TBD - ML, DoH and other features after 1.0, 0.8 TLS 1.3 & TDBv0.2 - Beta Dec 27, 2021

krizhanovsky mentioned this issue Dec 27, 2021

Access log #1543

Closed

krizhanovsky self-assigned this Dec 28, 2021

krizhanovsky modified the milestones: 0.8 - TBD, 1.1 - TLS 1.3 Jan 3, 2022

krizhanovsky mentioned this issue Nov 25, 2022

Limit the number of log records for security events #1749

Open

krizhanovsky mentioned this issue Sep 28, 2024

JA5 fingerpringing computation and filtration #2052

Open

krizhanovsky removed their assignment Sep 28, 2024

krizhanovsky modified the milestones: 1.2 - TBD, 0.9 - LA Sep 28, 2024

krizhanovsky changed the title ~~Custom logging~~ Fast access logging for analytics Sep 28, 2024

krizhanovsky modified the milestones: 0.9 - LA, 0.8 - Beta Sep 30, 2024

krizhanovsky assigned ai-tmpst Sep 30, 2024

ai-tmpst mentioned this issue Oct 4, 2024

Kernel-User Space Transport #77

Open

ai-tmpst added a commit that referenced this issue Oct 10, 2024

Add a ring buffer mapped to userspace

977ce77

In #537 we need a way to deliver log data to userspace. Introduce a set of per-cpu ring buffer mapped to userspace. Signed-off-by: Alexander Ivanov <[email protected]>

ai-tmpst added a commit that referenced this issue Oct 10, 2024

Add a ring buffer mapped to userspace

064efbf

In #537 we need a way to deliver log data to userspace. Introduce a set of per-cpu ring buffer mapped to userspace. Signed-off-by: Alexander Ivanov <[email protected]>

ai-tmpst linked a pull request Oct 10, 2024 that will close this issue

Add ring buffers mapped to userspace #2259

Closed

ai-tmpst removed a link to a pull request Oct 10, 2024

Add ring buffers mapped to userspace #2259

Closed

ai-tmpst added a commit that referenced this issue Oct 21, 2024

Add a ring buffer mapped to userspace

582b89c

In #537 we need a way to deliver log data to userspace. Introduce a set of per-cpu ring buffer mapped to userspace. Signed-off-by: Alexander Ivanov <[email protected]>

ai-tmpst added a commit that referenced this issue Oct 30, 2024

Add a ring buffer mapped to userspace

a9ddf9d

In #537 we need a way to deliver log data to userspace. Introduce a set of per-cpu ring buffer mapped to userspace. Signed-off-by: Alexander Ivanov <[email protected]>

krizhanovsky mentioned this issue Nov 13, 2024

tests: tests for access log transmission to Clickhouse server #2269

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fast access logging for analytics #537

Fast access logging for analytics #537

krizhanovsky commented May 27, 2016 •

edited

Loading

krizhanovsky commented Jan 18, 2024 •

edited

Loading

krizhanovsky commented Sep 28, 2024 •

edited

Loading

krizhanovsky commented Sep 29, 2024

ai-tmpst commented Nov 22, 2024

krizhanovsky commented Nov 22, 2024

Fast access logging for analytics #537

Fast access logging for analytics #537

Comments

krizhanovsky commented May 27, 2016 • edited Loading

krizhanovsky commented Jan 18, 2024 • edited Loading

krizhanovsky commented Sep 28, 2024 • edited Loading

Testing & doc

krizhanovsky commented Sep 29, 2024

ai-tmpst commented Nov 22, 2024

krizhanovsky commented Nov 22, 2024

krizhanovsky commented May 27, 2016 •

edited

Loading

krizhanovsky commented Jan 18, 2024 •

edited

Loading

krizhanovsky commented Sep 28, 2024 •

edited

Loading