HTTP tables #731

krizhanovsky · 2017-05-13T19:35:02Z

Current HTTP scheduler must be reworked to more generic HTTP tables (just like IPtables or Nftables) and work with lower TCP/IP filtering logic using nftables mark. An example for the new HTTP tables (based on example from https://github.com/tempesta-tech/tempesta/wiki/Scheduling-and-Load-Balancing#http-scheduler-for-server-groups) :

# Server groups, just the same as previously.
srv_group waf { ... }
srv_group static { ... }
srv_group foo_app { ... }
srv_group foo_app_backup { ... }

# Logical HTTP tables are defined before their usage and have names.
# This requirement enforces absense of loops.
http_chain resource_chain {
        uri == "*.php" -> static;
        host == "static.*" -> static;
        host == "foo.example.com" -> foo_app backup=foo_app_backup;
        hdr_raw == "X-Custom-Bar-Hdr: *" -> static;
}

# WAF redirection rules from issue #760.
http_chain waf_chain {
        # 4. ...the mark should be used to forward the request to backend.
        mark == 2 -> resource_chain;
        # 2. Tempesta must forward it to WAF by some scheduling policy.
        # Set mark to send an information for nftables.
        -> mark = 3;
        -> waf;
}

# The main HTTP table w/o name. Always the last one.
http_chain {
        # Block DDoS requests early using nftables mark.
        # Process value of skb->mark set by nftables
        mark != 1 -> waf_chain;
        referer != "*hacked.com" -> waf_chain;
        # Default rule instead of "match <SRV_GROUP> * * *:
        -> block;
}

Note rule referer != "*hacked.com" -> waf_chain - this is essentially L7 filtering rule. It's expected to have many of such rules produced in run-time, so all the rules must be dynamically reconfigurable in sense of #51.

There are several aspects for reworking described at the below.

Extend http_match headers set

Currently not all special headers are handled in tfw_http_match_fld_t, e.g. it doesn't handle X-Forwarded-For, User-Agent or Cookie. Also Refer header must be made special and properly handled by http_match. The last one is important to properly handle iFrame attacks. Update https://github.com/tempesta-tech/tempesta/wiki/DDoS-mitigation#iframes-on-a-busy-site .

Syntax changes

There is no sense for long prefix and suffix: prefix and suffix must be changed to * prefix or suffix correspondingly of a pattern as in usual wildcards.
eq is replaced by ==.
No more senseless match at begin of the scheduler rules.
Introduce new keywords and operators: ->, ==, !=, =, mark, block, referer, user_agent, cookie and maybe other forgotten special headers.
sched_http_rules is renamed to http_table with optional name.

Rules processing

All current functions in tfw_sched_http.c as well as the file itself must be renamed to prefixes tfw_http_tbl. Basically, the corner logictfw_http_match_req() remains the same: the list of rules in http_table are processed one after another. However, do_match() now must be renamed to something like do_eval() and reworked correspondingly since now it must be able to (1) set skb->mark for all skbs inside msg and (2) handle != as well as == (eq), In case of -> mark = 3 rule do_eval() just can return false to move to the next rule in the table.

The new keyword block must block current request according to specified block_action and the chain processing for the request finishes, just the same as for chosen server group now. Probably, if block requires tighter integration with http code, it has sense to more the module from /sched directory and build it within main Tempesta FW module.

mark keyword works in two modes: reading skb->mark as in #844 and setting it for all skbs in current msg.

-> operator is just a syntax sugar for now to separate rule condition from rule action. Further it allow us to develop the rule language by more complex expressions (e.g. ... -> mark = 3, waf instead of 2 lines in the example above).

Documentation & testing

Please revise following docs:

https://github.com/tempesta-tech/tempesta/wiki/Scheduling-and-Load-Balancing

I created issue #883 for the test. Please add other interesting cases to the task.

The text was updated successfully, but these errors were encountered:

krizhanovsky · 2018-02-10T00:01:18Z

A use case (please document it in Wiki) for protection (by Tempesta FW only without introducing a third-party WAF, issue #907 addresses the case with the third-party WAF) of many backend servers with different IP addresses can be configured in efficient way using mark (integer values provided by nftables from destination IP addresses) scheduling instead of Host values string matching:

http_chain {
        mark == 2 -> backend_0;
        mark == 3 -> backend_1;
        mark == 4 -> backend_2;
        mark == 5 -> backend_3;
        ....
}

The scheduling (routing) method can also be useful to correctly route HTTP/1.0 requests without Host header.

While Tempesta FW is able to operate with marks and routing table directly, the problem with the scenario is that Tempesta FW is a full TCP proxy having different TCP connections between a client and a server, so we can't just pass a packet to the right server and have to resend it to a server socket.

Note that it's also expected to work in busy environments with large, and constantly varying, number backend servers, so these rules also must be dynamically reconfigurable in sense of #51.

krizhanovsky · 2018-03-29T14:42:51Z

Since #688 depends on the issue and #471 which change the how server groups are handled, the configuration example at the above with #471 in mind would look like:

# Server groups, just the same as previously.
srv_group waf { ... }
srv_group static { ... }
srv_group foo_app { ... }
srv_group foo_app_backup { ... }

vhost natsys {
        proxy_pass static;
}

vhost tempesta-tech {
        location "?" {
                proxy_pass foo_app backup=foo_app_backup;
                cache_fulfill; # cache all proxied data
        }
}

http_chain resource_chain {
        uri == "*.php" -> natsys;
        host == "static.*" -> natsys;
        # The right side of '->' operator can have server group or vhost
        host == "foo.example.com" -> tempesta-tech;
        hdr_raw == "X-Custom-Bar-Hdr: *" -> natsys;
}

# WAF redirection rules from issue #760.
http_chain waf_chain {
        # 4. ...the mark should be used to forward the request to backend.
        mark == 2 -> resource_chain;
        # 2. Tempesta must forward it to WAF by some scheduling policy.
        # Set mark to send an information for nftables.
        -> mark = 3;
        -> waf;
}

# The main HTTP table w/o name. Always the last one.
http_chain {
        # Block DDoS requests early using nftables mark.
        # Process value of skb->mark set by nftables
        mark != 1 -> waf_chain;
        referer != "*hacked.com" -> waf_chain;
        # Default rule instead of "match <SRV_GROUP> * * *:
        -> block;
}

Fix #731: HTTP tables introduction.

krizhanovsky added crucial enhancement security labels May 13, 2017

krizhanovsky added this to the 1.0 WebOS milestone May 13, 2017

krizhanovsky mentioned this issue May 13, 2017

HTTP QoS for asymmetric DDoS mitigation #488

Open

krizhanovsky mentioned this issue Jul 3, 2017

nftables mark processing and scheduling #760

Closed

krizhanovsky mentioned this issue May 13, 2017

TDBv0.2: Cache background revalidation and eviction #515

Open

22 tasks

krizhanovsky changed the title ~~Multi-layer firewall~~ HTTP tables Jan 14, 2018

krizhanovsky modified the milestones: backlog, 0.6 KTLS Jan 14, 2018

This was referenced Jan 14, 2018

Functional test for HTTP tables #883

Closed

Fast HTTP match #732

Open

krizhanovsky mentioned this issue Feb 10, 2018

Variables and conditions for custom HTTP headers #907

Open

krizhanovsky assigned aleksostapenko Feb 18, 2018

krizhanovsky mentioned this issue May 15, 2017

TDB secondary index #733

Open

krizhanovsky mentioned this issue Mar 29, 2018

Web-server mode #471

Open

aleksostapenko added a commit that referenced this issue May 15, 2018

Add HTTP tables processing (#731).

126ed95

aleksostapenko added a commit that referenced this issue May 15, 2018

Minor corrections (#731).

0c892e2

aleksostapenko added a commit that referenced this issue May 16, 2018

Correction of http table cleanup procedure (#731).

07a322c

vankoven mentioned this issue May 17, 2018

Fix #731: HTTP tables introduction. #1017

Merged

aleksostapenko added a commit that referenced this issue May 18, 2018

Resolve bug during cleanup (#731).

e6688c7

aleksostapenko added a commit that referenced this issue May 18, 2018

Change manual in tempesta_fw.conf (#731).

73a1a0e

aleksostapenko added a commit that referenced this issue Jun 7, 2018

Changes according review comments (#731).

bbea5b4

aleksostapenko added a commit that referenced this issue Jun 8, 2018

Add processing of 'method' field into HTTP tables (#731).

fccc9e2

aleksostapenko added a commit that referenced this issue Jun 9, 2018

Additional changes according comments (#731).

7eb43b4

aleksostapenko closed this as completed in 187b357 Jun 9, 2018

aleksostapenko added a commit that referenced this issue Jun 9, 2018

Merge pull request #1017 from tempesta-tech/ao-731

333ec1c

Fix #731: HTTP tables introduction.

dalf mentioned this issue Jul 12, 2019

antibot: how ? searx/searx-docker#2

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HTTP tables #731

HTTP tables #731

krizhanovsky commented May 13, 2017 •

edited

Loading

krizhanovsky commented Feb 10, 2018 •

edited

Loading

krizhanovsky commented Mar 29, 2018

HTTP tables #731

HTTP tables #731

Comments

krizhanovsky commented May 13, 2017 • edited Loading

Extend http_match headers set

Syntax changes

Rules processing

Documentation & testing

krizhanovsky commented Feb 10, 2018 • edited Loading

krizhanovsky commented Mar 29, 2018

krizhanovsky commented May 13, 2017 •

edited

Loading

krizhanovsky commented Feb 10, 2018 •

edited

Loading