Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Planetfm/dsos 2408/change access logs implementation #330

Merged
Merged
Show file tree
Hide file tree
Changes from 71 commits
Commits
Show all changes
74 commits
Select commit Hold shift + click to select a range
2e67770
change default value to null and see what breaks
robertsweetman Dec 12, 2023
7bbdced
revert existing bucket default value change
robertsweetman Dec 13, 2023
581c5f7
change storage descriptor
robertsweetman Dec 13, 2023
97fb888
use athena db variable name
robertsweetman Dec 13, 2023
6510108
use existing db name
robertsweetman Dec 13, 2023
098b46a
put elasticloadalancing endpoint back
robertsweetman Dec 13, 2023
d39368a
terraform-docs: automated action
github-actions[bot] Dec 13, 2023
b731c5a
Commit changes made by code formatters
github-actions[bot] Dec 13, 2023
e6d42bd
fix de-serialisation, change names and types for columns
robertsweetman Dec 13, 2023
2ee64bb
terraform-docs: automated action
github-actions[bot] Dec 13, 2023
db4d031
remove timestamp types and use serde regex instead
robertsweetman Dec 13, 2023
e5698d3
set athena engine version and update regex
robertsweetman Dec 13, 2023
1c7ccfa
capture IP addresses and ports separately
robertsweetman Dec 14, 2023
d7836f9
fix terraform escape sequences
robertsweetman Dec 14, 2023
b5e584e
fix incorrect character
robertsweetman Dec 14, 2023
a0d4eee
correct regex column number
robertsweetman Dec 14, 2023
ffaa6e5
add nlb log format and change log path
robertsweetman Dec 14, 2023
e8ed350
terraform-docs: automated action
github-actions[bot] Dec 14, 2023
3c48a34
remove glue crawler
robertsweetman Dec 14, 2023
6d1eda3
remove glue crawler, change permissions
robertsweetman Dec 14, 2023
ac3f887
terraform-docs: automated action
github-actions[bot] Dec 14, 2023
3fbe164
formatting
robertsweetman Dec 14, 2023
883d7f2
add missing definition
robertsweetman Dec 14, 2023
de504ba
terraform-docs: automated action
github-actions[bot] Dec 14, 2023
baaa6cf
policy names must be unique
robertsweetman Dec 14, 2023
53b42fb
update s3 bucket ref, make table resources conditional
robertsweetman Dec 15, 2023
3d4afd7
terraform-docs: automated action
github-actions[bot] Dec 15, 2023
022aca7
use correct version tag syntax
robertsweetman Dec 15, 2023
415766f
terraform-docs: automated action
github-actions[bot] Dec 15, 2023
b98408b
remove un-used variable
robertsweetman Dec 18, 2023
3723cec
terraform-docs: automated action
github-actions[bot] Dec 18, 2023
7098f8a
add partition keys
robertsweetman Dec 18, 2023
7f9e66a
change table parameters
robertsweetman Dec 19, 2023
3eeab77
set parameters as strings
robertsweetman Dec 19, 2023
80a7cff
deal with string literal issues
robertsweetman Dec 19, 2023
a7c05f5
Commit changes made by code formatters
github-actions[bot] Dec 19, 2023
9b24fcc
switch value back
robertsweetman Dec 19, 2023
74945f0
add partition key
robertsweetman Dec 19, 2023
412af78
change partition format
robertsweetman Dec 19, 2023
8217173
Commit changes made by code formatters
github-actions[bot] Dec 19, 2023
07ff44d
remove partition keys
robertsweetman Dec 19, 2023
11453bc
update storage location template path
robertsweetman Dec 19, 2023
55407fc
update partition keys and projection
robertsweetman Dec 19, 2023
f308254
Commit changes made by code formatters
github-actions[bot] Dec 19, 2023
33ac38a
remove location trailing slash
robertsweetman Dec 19, 2023
ff1ce5b
use regex from docs
robertsweetman Dec 19, 2023
1474e04
update network log table to match docs
robertsweetman Dec 19, 2023
79802c3
add network LB policy
robertsweetman Dec 19, 2023
0ae136c
terraform-docs: automated action
github-actions[bot] Dec 19, 2023
3af43e1
specify glue policy based on load balancer type
robertsweetman Dec 19, 2023
61512ee
terraform-docs: automated action
github-actions[bot] Dec 19, 2023
1bd29ac
fix spelling
robertsweetman Dec 19, 2023
b643c66
update readme, use same permissions for both lb types
robertsweetman Dec 20, 2023
9b6c9fe
terraform-docs: automated action
github-actions[bot] Dec 20, 2023
5e37079
Commit changes made by code formatters
github-actions[bot] Dec 20, 2023
3f3961a
flatten resources
robertsweetman Dec 21, 2023
c931885
fix glue_s3 resource
robertsweetman Dec 21, 2023
8d18454
very dubious check
robertsweetman Dec 21, 2023
d6307d8
specify default as sse-s3 for network loadbalancer to work
robertsweetman Dec 21, 2023
8e62cce
make sse_algorithm value optional
robertsweetman Jan 2, 2024
53ca6d3
remove default from module
robertsweetman Jan 2, 2024
a96194c
specify sse_algorithm based on type
robertsweetman Jan 2, 2024
bfd10f6
update readme ref: bucket permissions
robertsweetman Jan 2, 2024
0d4f8e3
Commit changes made by code formatters
github-actions[bot] Jan 2, 2024
d3efe8f
add depends_on s3 bucket module
robertsweetman Jan 3, 2024
775e510
Commit changes made by code formatters
github-actions[bot] Jan 3, 2024
242ff4e
update readme
robertsweetman Jan 3, 2024
2fe90ed
remove sql create table as this manual step is no longer needed
robertsweetman Jan 3, 2024
1e89cab
fix alb test
robertsweetman Jan 3, 2024
014b886
Commit changes made by code formatters
github-actions[bot] Jan 3, 2024
b0f45e9
typo
robertsweetman Jan 3, 2024
af233ca
use version GUID for s3 bucket module not version tag
robertsweetman Jan 3, 2024
b464eb4
terraform-docs: automated action
github-actions[bot] Jan 3, 2024
9e3b4de
force checks to rerun
robertsweetman Jan 3, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
108 changes: 92 additions & 16 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,9 +2,9 @@

[![repo standards badge](https://img.shields.io/endpoint?labelColor=231f20&color=005ea5&style=for-the-badge&label=MoJ%20Compliant&url=https%3A%2F%2Foperations-engineering-reports.cloud-platform.service.justice.gov.uk%2Fapi%2Fv1%2Fcompliant_public_repositories%2Fendpoint%2Fmodernisation-platform-terraform-loadbalancer&logo=)](https://operations-engineering-reports.cloud-platform.service.justice.gov.uk/public-report/modernisation-platform-terraform-loadbalancer)

A Terraform module that creates application loadbalancer (with loadbalancer security groups) in AWS with logging enabled, s3 to store logs and Athena DB to query logs.
A Terraform module that creates an application loadbalancer (with loadbalancer security groups) or network loadbalancer in AWS with logging enabled, s3 to store logs and Athena DB to query logs.

An s3 bucket name can be provided in the module by adding the `existing_bucket_name` variable and adding the bucket name. Otherwise, if no bucket exists one will be created and no variable needs to be set in the module. If using an existing bucket the logs will need to be moved with the following folder structure {application_name}/AWSLogs/{account_number}/elasticloadbalancing/ otherwise you will experience errors with the gluecrawler function.
An s3 bucket name can be provided in the module by adding the `existing_bucket_name` variable and adding the bucket name. Otherwise, if no bucket exists one will be created and no variable needs to be set in the module. Application loadbalancers and network loadbalancers do not log to the same S3 bucket location. If you're using existing buckets they also need to have specific permissions applied to them. See the [External buckets](#external-buckets) section for more information.

Either pass in existing security group(s) to attach to the load balancer using the `security_groups` variable, or define `loadbalancer_ingress_rules` and `loadbalancer_egress_rules` variables to create a new security group within the module.

Expand Down Expand Up @@ -45,14 +45,91 @@ locals {

Loadbalancer target groups and listeners need to be created separately.

To run queries in Athena do the following:
Go to the Athena console and click on Saved Queries https://console.aws.amazon.com/athena/saved-queries/home
The use of "aws_glue_catalog_table" resources for application and network loadbalancers means that logs appearing in the S3 bucket will be available to query via Athena without having to carry out any manual Athena config steps.

Click the new saved query that is named `<custom_name>`-create-table and Run it. You only have to do it once.
## Module created S3 access_logs bucket

Try a query like `select * from lb_logs limit 100;`
By default the loadbalancer will set up an access_logs bucket for you, unless you set access_logs = false initially for testing or some other reason. Setting this back to true after the lb has been deployed will then create the bucket for you. The reason for the 'depends_on' here is that without the module.s3-bucket resource being created first, the module.lb resource will fail with a validation error.

```hcl
depends_on = [
module.s3-bucket
]
```

## External buckets

If you decide to use externally created buckets they need to have been created and have appropriate permissions applied to them BEFORE `access_logs = true` and `existing_bucket_name` values are added to the lb code. If you add these values before the bucket is created you will get an error because the lb module will run a check to see if the s3 bucket is writeable and if it is not it will fail.

So to use `external_bucket_name` the deployment steps are:

1. Set `access_logs = false` in the lb create code & create the lb
2. Create the bucket - making sure the appropriate permissions are applied
3. Set `existing_bucket_name` in the lb create code as your-bucket-name-GUID

### External bucket permissions

For simplicity the bucket can be created with the following policy attached to it. This applies whether the loadbalancer is an "application" or "network" loadbalancer. This uses the bucket_policy_v2 implementation using the s3_bucket module:

```hcl
public-lb-logs-bucket = {
sse_algorithm = "AES256" # required for Network Loadbalancers
bucket_policy_v2 = [
{
effect = "Allow"
actions = [
"s3:PutObject",
]
principals = {
identifiers = ["arn:aws:iam::652711504416:root"]
type = "AWS"
}
},
{
effect = "Allow"
actions = [
"s3:PutObject"
]
principals = {
identifiers = ["delivery.logs.amazonaws.com"]
type = "Service"
}

conditions = [
{
test = "StringEquals"
variable = "s3:x-amz-acl"
values = ["bucket-owner-full-control"]
}
]
},
{
effect = "Allow"
actions = [
"s3:GetBucketAcl"
]
principals = {
identifiers = ["delivery.logs.amazonaws.com"]
type = "Service"
}
}
]
iam_policies = module.baseline_presets.s3_iam_policies
}
```

If you want to see exactly what policies are needed for each then refer to [NLB Requirements](https://docs.aws.amazon.com/elasticloadbalancing/latest/network/load-balancer-access-logs.html#access-logging-bucket-requirements) and [ALB Requirements](https://docs.aws.amazon.com/elasticloadbalancing/latest/application/enable-access-logging.html#attach-bucket-policy)

## Network Loadbalancer caveats

* Access logs are created only if the load balancer has a TLS listener and they contain information only about TLS requests.
* Network loadbalancers only support SSE-S3 encryption for access logs, not aws:kms (AWS managed keys).
* They can support customer managed keys but this is not currently supported by this module.
* No "verify bucket permissions" test file is created in the relevant bucket, only that the terraform apply step will fail with a validation error if the permissions and the bucket encryption parameters are not correct.

## Application Loadbalancer caveats

* It's worth noting that Application LB's will create a test file in the S3 bucket to verify that the bucket permissions are correct.

## Usage

Expand Down Expand Up @@ -163,28 +240,28 @@ If you're looking to raise an issue with this module, please create a new issue

| Name | Source | Version |
|------|--------|---------|
| <a name="module_s3-bucket"></a> [s3-bucket](#module\_s3-bucket) | github.com/ministryofjustice/modernisation-platform-terraform-s3-bucket | 8688bc15a08fbf5a4f4eef9b7433c5a417df8df1 |
| <a name="module_s3-bucket"></a> [s3-bucket](#module\_s3-bucket) | github.com/ministryofjustice/modernisation-platform-terraform-s3-bucket | v7.1.0 |

## Resources

| Name | Type |
|------|------|
| [aws_athena_database.lb-access-logs](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/athena_database) | resource |
| [aws_athena_named_query.main](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/athena_named_query) | resource |
| [aws_athena_workgroup.lb-access-logs](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/athena_workgroup) | resource |
| [aws_glue_crawler.ssm_resource_sync](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/glue_crawler) | resource |
| [aws_iam_policy.lb_glue_crawler](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/iam_policy) | resource |
| [aws_iam_role.lb_glue_crawler](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/iam_role) | resource |
| [aws_iam_role_policy_attachment.lb_glue_crawler](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/iam_role_policy_attachment) | resource |
| [aws_iam_role_policy_attachment.lb_glue_service](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/iam_role_policy_attachment) | resource |
| [aws_glue_catalog_table.application_lb_logs](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/glue_catalog_table) | resource |
| [aws_glue_catalog_table.network_lb_logs](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/glue_catalog_table) | resource |
| [aws_iam_policy.glue_s3](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/iam_policy) | resource |
| [aws_iam_role.glue](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/iam_role) | resource |
| [aws_iam_role_policy_attachment.glue_s3](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/iam_role_policy_attachment) | resource |
| [aws_iam_role_policy_attachment.glue_service](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/iam_role_policy_attachment) | resource |
| [aws_lb.loadbalancer](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/lb) | resource |
| [aws_lb_target_group.this](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/lb_target_group) | resource |
| [aws_lb_target_group_attachment.this](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/lb_target_group_attachment) | resource |
| [aws_security_group.lb](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/security_group) | resource |
| [aws_elb_service_account.default](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/data-sources/elb_service_account) | data source |
| [aws_iam_policy_document.bucket_policy](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/data-sources/iam_policy_document) | data source |
| [aws_iam_policy_document.lb_glue_crawler](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/data-sources/iam_policy_document) | data source |
| [aws_iam_policy_document.lb_glue_crawler_assume](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/data-sources/iam_policy_document) | data source |
| [aws_iam_policy_document.glue_assume](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/data-sources/iam_policy_document) | data source |
| [aws_iam_policy_document.glue_s3](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/data-sources/iam_policy_document) | data source |
| [aws_vpc.shared](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/data-sources/vpc) | data source |

## Inputs
Expand All @@ -205,7 +282,6 @@ If you're looking to raise an issue with this module, please create a new issue
| <a name="input_load_balancer_type"></a> [load\_balancer\_type](#input\_load\_balancer\_type) | application or network | `string` | `"application"` | no |
| <a name="input_loadbalancer_egress_rules"></a> [loadbalancer\_egress\_rules](#input\_loadbalancer\_egress\_rules) | Create new security group with these egress rules for the loadbalancer. Or use the security\_groups var to attach existing group(s) | <pre>map(object({<br> description = string<br> from_port = number<br> to_port = number<br> protocol = string<br> security_groups = list(string)<br> cidr_blocks = list(string)<br> }))</pre> | `{}` | no |
| <a name="input_loadbalancer_ingress_rules"></a> [loadbalancer\_ingress\_rules](#input\_loadbalancer\_ingress\_rules) | Create new security group with these ingress rules for the loadbalancer. Or use the security\_groups var to attach existing group(s) | <pre>map(object({<br> description = string<br> from_port = number<br> to_port = number<br> protocol = string<br> security_groups = list(string)<br> cidr_blocks = list(string)<br> }))</pre> | `{}` | no |
| <a name="input_log_schedule"></a> [log\_schedule](#input\_log\_schedule) | n/a | `string` | `"cron(15 1 ? * MON *)"` | no |
| <a name="input_public_subnets"></a> [public\_subnets](#input\_public\_subnets) | Badly named variable, use subnets instead. Keeping for backward compatibility | `list(string)` | `[]` | no |
| <a name="input_region"></a> [region](#input\_region) | AWS Region where resources are to be created | `string` | n/a | yes |
| <a name="input_s3_versioning"></a> [s3\_versioning](#input\_s3\_versioning) | A boolean that determines whether s3 will have versioning | `bool` | `true` | no |
Expand Down
Loading
Loading