[Security Solution][Alerts] Replace schemas derived from FieldMaps with versioned alert schema #127218

marshallmain · 2022-03-08T21:58:57Z

Summary

The goal of this PR is to create a system of schemas that are:

Easy to read
Usable historical records of alert schemas from previous Kibana versions
Accurate for every field
Usable on both server and client side

Current alert schemas fail these criteria because they are generated from FieldMaps (adding complexity), updated in place, fail to document all fields accurately (e.g. kibana.alert.group.index is not required by the schema, but should exist for EQL alerts), and are defined server side.

Motivation - Development speed and quality

We have already run into one bug where a required field was not populated in some alert documents. Once a bug ships that creates documents incorrectly, any fix requires user action to initiate a re-index of the alerts in addition to the developer time to create and validate the fix. The changes proposed here would catch this bug at compile time. These sorts of bugs become harder to catch as the schema evolves over time and fields get added, removed, and changed. Keeping the schemas separated by version will help reduce the risk of repeated schema changes over time causing fields to be incorrectly included in or omitted from alert documents.

We are also spending more time than necessary communicating details of the alerts schema over Slack and Zoom. It will be far more efficient for the code to clearly communicate more details about the alert schema. With a more comprehensive static schema, the knowledge will transfer to new developers more efficiently.

Static types are a powerful tool for ensuring code correctness. However, each deviation of the static type from the actual runtime structure adds places where developers may need to cast, assert, or use conditional logic to satisfy the compiler. The current static types require frequent workarounds when the static types don't match what developers know or believe is true about the runtime type of the alert documents. These runtime workarounds establish patterns that evade the type system - costing developer time to create and maintain in addition to increasing the risk of bugs due to the additional complexity. Accurate static types are excellent documentation of the data structures we use but it's crucial that the static types are comprehensive to minimize cases where runtime checks are needed.

Current Structure

Alert schemas are defined by the FieldMap structures that also define the Elasticsearch alert index mappings. These FieldMaps are unversioned and may be modified in the future to add more fields. In addition, the Elasticsearch index mappings do not always align perfectly with the _source of the alerts that we intend to write. For example, the kibana.alert.rule.parameters field is type flattened in Elasticsearch mappings but we know that alerts from a particular rule type will populate kibana.alert.rule.parameters with some specific set of fields defined by the rule type. Also, multiple types of alerts may live in the same alerts index with similar but not identical sets of fields.

Proposed structure - Common Alert Schema Directory

This PR has 2 primary code changes: (1) separate the alert document schemas from the FieldMaps, and (2) set up a code structure that enables easy versioning of alert schemas. During the Detection Engine migration to the rule registry we used the FieldMaps to define the alert schema, but ended up with numerous type casts and some bugs in the process. This PR creates a new common directory x-pack/plugins/security_solution/common/detection_engine/schemas/alerts to store the various Security alert schemas by Kibana version. This PR also fully removes the RACAlert and WrappedRACAlert types that were derived from the FieldMaps. However, the schemas defined in this PR are not 100% comprehensive - some fields are not typed as strictly as they could be, particularly in the kibana.alert.rule.parameters field. Follow up work will add more detail to the schemas and provide more comprehensive coverage.

x-pack/plugins/security_solution/common/detection_engine/schemas/alerts initially contains index.ts and one folder, 8.0.0. index.ts imports the schemas from 8.0.0 and re-exports them as ...Latest, denoting that those are the "write" schemas. The reason for this is that as we add new schemas, there are many places server side where we want to ensure that we're writing the latest alert schema. By having index.ts re-export 8.0.0 schemas, when we add make a new alert schema in the future (e.g. adding an additional field in 8.x) we can simply update index.ts to re-export the new schema instead of the previous schema. index.ts also exports a DetectionAlert which is the "read" schema - this type will be maintained as a union of all versioned alert schemas, which is needed to accurately type alerts that are read from the alerts index.

Reading vs writing alerts

When writing code that deals with creating a new alert document, always use the schema from alerts/index.ts, not from a specific version folder. This way when the schema is updated in the future, your code will automatically use the latest alert schema and the static type system will tell us if code is writing alerts that don't conform to the new schema.

When writing code that deals with reading alerts, it must be able to handle alerts from any schema version. The "read schema" in index.ts DetectionAlert is a union of all of the versioned alert schemas since a valid alert from the .alerts index could be from any version. Initially there is only one versioned schema, so DetectionAlert is identical to DetectionAlert800.

Generally, Solution code should not be directly importing alert schemas from a specific version. Alert writing code should use the latest schema, and alert reading code should use the union of all schemas.

Adding new schemas

In the future, when we want to add new fields, we should create a new folder named with the version the field is being added in, create the updated schema in the new folder, and update index.ts to re-export the schemas for the new version instead of the previous version. Also, update the "read schema" DetectionAlert type in index.ts to include the new schema in addition to the previous schemas. The schema in the new version folder can either build on the previous version, e.g. 8.4.0 could import the schema from 8.0.0 and simply add a few new fields, or for larger changes the new version could build the schema from scratch. Old schemas should not change when new fields are added!

Changing existing schemas

The schema in the 8.0.0 folder, and any future versioned folders after the version is released, should not be updated with new fields. Old schemas should only be updated if a bug is discovered and it is determined that the schema does not accurately represent the alert documents that were actually written by that version, e.g. if a field is typed as string in the schema but was actually written as string[]. The goal of these schemas is to represent documents accurately as they were written and since we aren't changing the documents that already exist, the schema should generally not change.

No changes

If a version of Kibana makes no changes to the schema, a new folder for that version is not needed.

Design decisions

Why not combine the FieldMaps and alert schema, creating a single structure that can define both?

FieldMaps are integrated tightly with Elasticsearch mappings already, with minimal support for accurate TypeScript types of the fields. I wanted to avoid adding tons of extra information in to the FieldMaps that would not be used for the Elasticsearch mappings. Instead later we can write a bit of code to ensure that the alert schemas are compatible with the FieldMap schemas, essentially ensuring that the alert schemas extend the FieldMap schemas.

Why is | undefined used in field definitions instead of making fields optional?

Making all fields required, but some | undefined in the type, helps ensure that we don't forget to copy over fields that may be undefined. If the field is optional, e.g. [ALERT_RULE_NOTE]?: string, then the compiler won't complain if the field is completely left out when we build the alert document. However, when it's defined as [ALERT_RULE_NOTE]: string | undefined instead, the field must be explicitly provided when creating an object literal of the alert type - even if the value is undefined. This makes it harder to forget to populate all of the fields. This can be seen in build_alert.ts where removing one of the optional fields from the return value results in a compiler error.

Why do we need to version the schemas instead of adding all new fields as | undefined?

Adding new fields as | undefined when they're actually required reduces the accuracy of the schema, which makes it less useful and harder to work with. If we decide to add a new field and always populate it going forward then accurately representing that in the static type makes it easier to work with alerts during the alert creation process. When a field is typed as | undefined but a developer knows that it should always exist, it encourages patterns that fight the type system through type-casting, assertions, using ?? <some default value>, etc. This makes the code harder to read, harder to reason about, and thus harder to maintain because the knowledge of "this field is typed as | undefined but actually always exists here" is not represented in the code and only lives in developers minds. Versioned alert schemas aim to turn the static types into an asset that precisely documents what the alert document structure is.

Potential Future Work

Add code to ensure that the alert schema extends the FieldMap schema, so that we know the alerts are compatible with the ES mappings
Create versioned io-ts rules schemas and use those to define the kibana.alert.rule.parameters type for each type of alert
Separate all field maps by version
Add schemas for legacy signals
Create standardized, type-safe access patterns for alerts that can translate between legacy and AAD field names on both frontend and backend
Create static types for alerts from other Security Solution rule types

ecezalp · 2022-03-14T13:38:46Z

@elasticmachine merge upstream

ecezalp

LGTM. This is a really cool idea and I thought that these comments were very helpful.

On a slightly related note I was wondering about what you may think about versioning alert instances. On the product roadmap we have "Persistence of Investigation Time Enrichments", but I can also see it being useful for things like Host Risk Score trend of an alert etc. And conceptually we could have a record of the migrations that an alert may go through.

ecezalp · 2022-03-14T13:43:15Z

x-pack/plugins/rule_registry/common/schemas/8.0.0/index.ts

+  ALERT_RULE_TAGS,
+  TIMESTAMP,
+];
+export type CommonAlertFieldName800 = Values<typeof commonAlertFieldNames>;


curious to hear your thoughts about the naming convention here with CommonAlertFieldName800. If we plan to associate the version numbers with the release numbers I can see it getting a little awkward with no separators for the 800 part. If we are going to bump it by one with each change then we probably don't need to start with 800. What do you think?

I agree that ending with 800 is a bit awkward. I initially wanted to use something more like CommonAlertFieldName_8_0_0 at first, but the naming convention linter complains about using underscores if the type isn't all caps. Alerting uses a similar convention in the SO migrations logic which seems to have worked so far so I adopted that instead of trying to disable the linter rule for these type names.

If we are going to bump it by one with each change then we probably don't need to start with 800

With this proposal we wouldn't bump the number by 1, instead we'd increment it to the version of Kibana that's shipping the change, e.g. the next one might be CommonAlertFieldName830 for 8.3.0 if we make changes soon. But we haven't made changes in 8.1 or 8.2 so there would never be a CommonAlertFieldName810 or 820.

In the past we used integer version numbers, starting at 1 and incrementing on each Kibana version that shipped changes, for the legacy signals mappings and it was pretty confusing IMO. Most of the time when looking at version numbers what I wanted to now was what version of Kibana a particular version number shipped with so it would have saved time to just version the signals mappings with the Kibana version. Instead with the integer it has to be cross-referenced with Git history to figure out when it shipped.

The other possible option I considered was dropping the 800 from the name entirely and relying on the 8.0.0 in the folder path to differentiate this CommonAlertFieldName type from possible future CommonAlertFieldName types. I worried that that would make it too easy to accidentally import the wrong type though.

This is pretty nitpicky, but I'd be in favor of dropping the suffix entirely. This naming schema will become ambiguous if there's ever a minor or patch version > 10, or once we hit 10.x. Sure, it makes it easier to import the wrong version, but import paths matter and devs should be able to navigate that. If nothing else, code reviewers should be able to mitigate this possibility.

ecezalp · 2022-03-14T13:52:40Z

x-pack/plugins/rule_registry/server/utils/persistence_types.ts

@@ -27,7 +29,9 @@ export type PersistenceAlertService = <T>(
 ) => Promise<PersistenceAlertServiceResult<T>>;

 export interface PersistenceAlertServiceResult<T> {
-  createdAlerts: Array<T & { _id: string; _index: string }>;
+  createdAlerts: Array<
+    AlertWithCommonFieldsLatest<T> & { _id: string; _index: string; [key: string]: SearchTypes }


should it not be possible to include { _id: string; _index: string; [key: string]: SearchTypes } in AlertWithCommonFieldsLatest by default, or use a more specific T?

So what's happening here is the PersistenceAlertService takes in documents of type T and writes them to Elasticsearch after injecting the common fields into each document - making them AlertWithCommonFieldsLatest<T>. AlertWithCommonFieldsLatest<T> is intended to be the type of the _source of the written documents. When the function returns though, it also includes the _id and _index metadata that Elasticsearch returns so that we can make those values available in the actions context for the Cases connector, even though those metadata fields aren't part of _source.

I think you're right that [key: string]: SearchTypes index signature is unnecessary though so I'll remove it.

marshallmain · 2022-03-15T20:18:12Z

@elasticmachine merge upstream

elasticmachine · 2022-03-22T18:10:18Z

Pinging @elastic/security-detections-response (Team:Detections and Resp)

elasticmachine · 2022-03-22T18:10:19Z

Pinging @elastic/security-solution (Team: SecuritySolution)

kobelb

The changes that have been made to the rule_registry plugin are fine.

Having separate types for creating new alerts versus reading/updating existing alerts makes a lot of sense to me and I see the benefit there. Additionally, I get the reason why we'd want a historical record of the alerts schema during the different versions. However, the approach that this PR implements will result in quite a few type definitions that aren't used, as only the latest types should actually be used. It'll be interesting to see how this approach works out, but it seems fine for now.

madirey · 2022-03-29T19:04:22Z

x-pack/plugins/rule_registry/common/schemas/8.0.0/index.ts

+If you are adding new fields for a new release of Kibana, create a new sibling folder to this one
+for the version to be released and add the field(s) to the schema in that folder.
+
+Then, update `../index.ts` to import from the new folder that has the latest schemas and add the


Might also want to mention updating the *Latest exports below. https://github.com/elastic/kibana/pull/127218/files#diff-f3b181c03305d4fb9648d9940a91dc9e1ac8411d6e1be7dee7b1013700892032R15

madirey · 2022-03-29T19:07:27Z

x-pack/plugins/security_solution/common/detection_engine/schemas/alerts/8.0.0/index.ts

+
+export type GenericAlert800 = AlertWithCommonFields800<BaseFields800>;
+
+// This is the type of the final generated alert including base fields, common fields


This is all really great, thank you!

madirey · 2022-03-29T19:08:10Z

x-pack/plugins/security_solution/common/detection_engine/schemas/alerts/8.0.0/index.ts

+for the version to be released and add the field(s) to the schema in that folder.
+
+Then, update `../index.ts` to import from the new folder that has the latest schemas and add the
+new schemas to the union of all alert schemas.


Again, might want to call out https://github.com/elastic/kibana/pull/127218/files#diff-f839c699bba062f8cf03c73c58a1ac4d0dcb905c3692dba9259031125bdaf03eR21

madirey · 2022-03-29T19:09:10Z

...security_solution/server/lib/detection_engine/notifications/schedule_notification_actions.ts


 export type NotificationRuleTypeParams = RuleParams & {
  id: string;
  name: string;
 };

-const convertToLegacyAlert = (alert: RACAlert) =>
+const convertToLegacyAlert = (alert: DetectionAlert) =>


madirey · 2022-03-29T19:45:56Z

I believe the comment above is inaccurate, as every schema will be used at least for reading (in the intersection of types).

madirey

Code LGTM... couple minor things that could be addressed later, but seems to work as-is. There's a lot here, so it's possible something got by me... but I performed some sanity checks, such as: created query, threshold, and eql rules. Verified that the generated alerts have a proper UUID and ancestry tree. Building block alerts are shown correctly for EQL alerts. Timelines load properly for each rule type. Pre-built rules load and can be activated. Types resolve correctly in IDE (VSCode). Removing a required field from an alert or changing a value to an invalid type causes static type errors.

This was a monumental undertaking and will be really, really great for development moving forward. Thanks, @marshallmain !

YulNaumenko

LGTM

banderror

@marshallmain I think it's a fantastic proposal and I find the concept itself very much right.
Have some questions about the implementation details though 🙂

I went through most of the code changes and left some comments, some of which I find important, especially the one about the union types.

I'm going to post another bunch of comments and questions that are related to some of the ideas mentioned in the PR description.

x-pack/plugins/rule_registry/common/schemas/8.0.0/index.ts

banderror · 2022-03-29T21:57:23Z

x-pack/plugins/rule_registry/common/schemas/8.0.0/index.ts

+const commonAlertIdFieldNames = [ALERT_INSTANCE_ID, ALERT_UUID];
+export type CommonAlertIdFieldName800 = Values<typeof commonAlertIdFieldNames>;


Why these two id fields are not included in CommonAlertFields800 and have their own union type?

These are used by Observability for the lifecycle rule executor logic. I'm not sure why they're separated out, but I kept them the same way they were in get_common_alert_fields.ts

x-pack/plugins/security_solution/common/detection_engine/schemas/alerts/8.0.0/index.ts

banderror · 2022-03-29T22:07:57Z

x-pack/plugins/security_solution/common/detection_engine/schemas/alerts/8.0.0/index.ts

+  [ALERT_RULE_CONSUMER]: string;
+  [ALERT_ANCESTORS]: Ancestor800[];
+  [ALERT_STATUS]: string;
+  [ALERT_WORKFLOW_STATUS]: string;


Off-topic, but it would be great to eventually have a jsdoc comment for every field in the alert schema containing a description (what it means), examples of values, and any other important information.

banderror · 2022-03-29T22:45:03Z

x-pack/plugins/security_solution/common/detection_engine/schemas/alerts/8.0.0/index.ts

+
+// This is the type of the final generated alert including base fields, common fields
+// added by the alertWithPersistence function, and arbitrary fields copied from source documents
+export type DetectionAlert800 = GenericAlert800 | EqlShellAlert800 | EqlBuildingBlockAlert800;


I'm not sure how union type is gonna work here, maybe missing something.

Please check this example where properties x, y and z are not accessible without an additional type casting:

I'd expect similar issues with access to fields in the real DetectionAlert800: ALERT_GROUP_INDEX is defined as a number but available as SearchTypes in DetectionAlert800

export interface EqlBuildingBlockFields800 extends BaseFields800 { [ALERT_GROUP_ID]: string; [ALERT_GROUP_INDEX]: number; [ALERT_BUILDING_BLOCK_TYPE]: 'default'; }

I think it's going to get worse when we'll need to start adding more versions of DetectionAlert to the final union type:

// When new Alert schemas are created for new Kibana versions, add the DetectionAlert type from the new version // here, e.g. `export type DetectionAlert = DetectionAlert800 | DetectionAlert820` if a new schema is created in 8.2.0 export type DetectionAlert = DetectionAlert800;

I think what we need here instead of using the union type is constructing an interface manually that would:

Make all the common (base) fields required

Make all the alert type-specific fields optional (x | undefined)

export type DetectionAlert800 = CommonAlertFields800 & BaseFields800 & Partial<EqlShellFields800> & Partial<EqlBuildingBlockFields800>;

Still, combining multiple versions into a single DetectionAlert interface is a little bit unclear to me.

Also, TS has a pretty nasty bug in the implementation of deeply nested type intersection (microsoft/TypeScript#47935) which can become a source of bugs in the code (unless all the fields in the alert schema will be flat).

I'd expect similar issues with access to fields in the real DetectionAlert800: ALERT_GROUP_INDEX is defined as a number but available as SearchTypes in DetectionAlert800

This is working as intended IMO - when an alert is first retrieved, without additional checks at runtime the type of ALERT_GROUP_INDEX really could be anything. Once we add the full types for ALERT_RULE_PARAMETERS though, the field alert[ALERT_RULE_PARAMETERS].type can be used as a discriminant at runtime to narrow the type down to EQL alerts only. At that point the type should be EqlBuildingBlockAlert800 | EqlShellAlert800, so we'd still need some other discriminant between those 2 types. But in general for alerts from different types of rules, we can use a known field like alert[ALERT_RULE_PARAMETERS].type as a discriminant.

Alternatively, developers can fetch ALERT_GROUP_INDEX, accept that its type is SearchTypes, and then do extra runtime validation on the retrieved value to ensure that it's not undefined or an object or some other unexpected value.

In both cases though I think it's a feature that ALERT_GROUP_INDEX has a very general type if it's retrieved from DetectionAlert - we're warning developers that the value could be anything and they need to do more validation there. For fields that are common and required across all alert types, e.g. ALERT_RULE_DESCRIPTION, the type system can convey that the value received from any DetectionAlert will always be string and extra validation isn't needed.

banderror · 2022-03-30T00:41:49Z

A few other thoughts and questions.

Reading vs writing alerts.

Fully support the idea of having separate write and read models! How could we make it more explicit in the code? E.g. it's not immediately obvious that *Latest naming represents the write model. DetectionAlert doesn't have any suffixes and it's also not obvious that it represents the read model.

Changing existing schemas
The schema in the 8.0.0 folder, and any future versioned folders after the version is released, should not be updated with new fields. Old schemas should only be updated if a bug is discovered and it is determined that the schema does not accurately represent the alert documents that were actually written by that version

I remember discussions within RAC around breaking changes in the alert schema and how we could support schema evolution with runtime fields. By adding runtime fields to an old concrete index, we'd effectively change the schema of the old alerts and would have to reflect it in the corresponding DetectionAlert{Version} interface. Just a thought.

Regarding breaking changes. I remember the last decision was "we're not gonna introduce them". But let's say at some point it happens. How would we support (in the static TS schema) something like a type change of a field, field rename, etc?

Add code to ensure that the alert schema extends the FieldMap schema, so that we know the alerts are compatible with the ES mappings

Extends in the sense of interface X extends Y? Extending an interface like that allows to override properties, for instance Y.a: number with X.a: string which can lead to the final alert schema becoming incompatible with the ES mappings generated from the FieldMap. Perhaps the hand-made alert interface should match the interface inferred from the FieldMap, but should not extend it?

Create versioned io-ts rules schemas and use those to define the kibana.alert.rule.parameters type for each type of alert

It would definitely be great to fix our rule schemas (#80792). Regarding versioning, I think whatever is written to kibana.alert.rule.parameters needs to be versioned within the versioned alert schema, but:

I'm not sure I see use cases for old rule parameters in rule executors, rule management, etc.
I think the dependency goes from the Rules subdomain to the Alerts subdomain (Rules -> Alerts) because rules create and write alerts and know about their schema. I don't think it should go vice versa (Alerts -> Rules) because it would create an unnecessary circular dependency between them. In other words, rules should know about alerts, but alerts shouldn't know about rules.
I would define versioned rule parameters within the versioned alerts schema.
Then rules could depend on the "ParamsLatest" schema and use it in their definitions. The caveat here is the need to implement validation for params which shouldn't be a concern of the alerts schema and subdomain imho.
Alternatively, we could keep versioned kibana.alert.rule.parameters schemas and rule params schemas separate and just make sure they match.

kibana-ci · 2022-03-30T02:18:29Z

💚 Build Succeeded

Metrics [docs]

Public APIs missing comments

Total count of every public API that lacks a comment. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats comments for more detailed information.

id	before	after	diff
`@kbn/rule-data-utils`	69	70	+1

Async chunks

Total size of all lazy-loaded chunks that will be downloaded as the user navigates the app

id	before	after	diff
`observability`	426.5KB	426.5KB	+88.0B
`securitySolution`	4.8MB	4.8MB	+276.0B
`triggersActionsUi`	697.2KB	697.3KB	+88.0B
total			+452.0B

Public APIs missing exports

Total count of every type that is part of your API that should be exported but is not. This will cause broken links in the API documentation system. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats exports for more detailed information.

id	before	after	diff
`ruleRegistry`	7	8	+1

Page load bundle

Size of the bundles that are downloaded on every page load. Target size is below 100kb

id	before	after	diff
`apm`	31.1KB	31.2KB	+88.0B
`cases`	86.0KB	86.1KB	+88.0B
`infra`	89.6KB	89.7KB	+88.0B
`securitySolution`	250.1KB	250.2KB	+88.0B
`timelines`	286.3KB	286.4KB	+88.0B
`uptime`	24.6KB	24.6KB	+88.0B
total			+528.0B

Unknown metric groups

API count

id	before	after	diff
`@kbn/rule-data-utils`	72	73	+1

ESLint disabled in files

id	before	after	diff
`apm`	15	14	-1
`osquery`	5	4	-1
`securitySolution`	69	68	-1
`uptime`	7	6	-1
total			-4

ESLint disabled line counts

id	before	after	diff
`apm`	88	85	-3
`enterpriseSearch`	9	7	-2
`fleet`	47	46	-1
`osquery`	122	119	-3
`uptime`	49	43	-6
total			-15

References to deprecated APIs

id	before	after	diff
`canvas`	70	64	-6
`dashboard`	78	72	-6
`data`	475	465	-10
`dataEnhanced`	55	49	-6
`discover`	26	20	-6
`fleet`	20	19	-1
`lens`	18	14	-4
`management`	2	1	-1
`maps`	456	330	-126
`monitoring`	40	28	-12
`upgradeAssistant`	12	7	-5
`visDefaultEditor`	205	155	-50
`visTypeVega`	4	3	-1
`visualizations`	17	13	-4
total			-238

Total ESLint disabled count

id	before	after	diff
`apm`	103	99	-4
`enterpriseSearch`	9	7	-2
`fleet`	55	54	-1
`osquery`	127	123	-4
`securitySolution`	509	508	-1
`uptime`	56	49	-7
total			-19

History

💔 Build #34736 failed 8ab1020
💚 Build #34671 succeeded 63d0545
💔 Build #34559 failed 52ec700
💚 Build #34087 succeeded ed48a31
💚 Build #32763 succeeded dac9ded
💚 Build #30723 succeeded 3371fb8

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

marshallmain · 2022-03-30T04:14:49Z

Fully support the idea of having separate write and read models! How could we make it more explicit in the code?

I'm not sure how far we want to go on naming types based on how they're used (read vs write) rather than what they are (v8.0.0, Latest, etc). E.g. if we end up wanting to write an old version for some reason, maybe updating alerts at some point, it may break the naming model if we try to name the types based on how we think they're being used. That's not to say we can never change the names, but the Latest and version suffix are intrinsic properties of the types right now.

I remember discussions within RAC around breaking changes in the alert schema and how we could support schema evolution with runtime fields. By adding runtime fields to an old concrete index, we'd effectively change the schema of the old alerts and would have to reflect it in the corresponding DetectionAlert{Version} interface. Just a thought.

Regarding breaking changes. I remember the last decision was "we're not gonna introduce them". But let's say at some point it happens. How would we support (in the static TS schema) something like a type change of a field, field rename, etc?

For these types we wouldn't support runtime fields. The types here only represent the _source of the documents. Access through fields and runtime fields is a much more complex issue that I think we should address, and perhaps the types here could be part of the basis for a long term solution, but I would keep these types unchanged so the record of _source remains unchanged.

Perhaps the hand-made alert interface should match the interface inferred from the FieldMap, but should not extend it?

Afaik B extends A means B must be assignable to A. (TS Playground) But either way the important part is ensuring that the alert schema is assignable to the ES mapping schema.

Regarding versioning, I think whatever is written to kibana.alert.rule.parameters needs to be versioned within the versioned alert schema, but:

All great points here. The rule schemas are fairly complex in their own right (part of why I avoided including them in this PR) so I'm a bit nervous about re-implementing any parts of them and attempting to ensure that the re-implementation is in sync with the source-of-truth io-ts schemas. I agree that there isn't a ton of use in rule management/executors for old rule schemas since we can ensure that rules get migrated, but we'll need them in some form to include in the alert schemas.

marshallmain · 2022-03-30T04:15:38Z

Thanks to everyone who reviewed this for the thoughtful comments and feedback!

marshallmain and others added 10 commits March 8, 2022 12:09

Replace schemas derived from FieldMaps with versioned alert schema

7dff107

Merge branch 'main' into versioned-alert-schemas

b8c3c82

Import fixes and comment

694180d

Another import fix

2ce2c37

Separate read and write schemas

2f60cf6

Separate read and write schemas for common alert fields

ecd7f49

fix import

3f29171

Update ALERT_RULE_PARAMETERS type

b1787a6

Fix getField type

96a1a71

Fix more types

4753672

Merge branch 'main' into versioned-alert-schemas

179c195

ecezalp reviewed Mar 14, 2022

View reviewed changes

Remove unneeded index signature from PersistenceAlertServiceResult

a52cf69

kibanamachine and others added 2 commits March 15, 2022 16:18

Merge branch 'main' into versioned-alert-schemas

3371fb8

Merge branch 'main' into versioned-alert-schemas

dac9ded

marshallmain marked this pull request as ready for review March 22, 2022 18:04

marshallmain requested review from a team as code owners March 22, 2022 18:04

marshallmain added the release_note:skip Skip the PR/issue when compiling release notes label Mar 23, 2022

kobelb approved these changes Mar 24, 2022

View reviewed changes

Merge branch 'main' into versioned-alert-schemas

ed48a31

marshallmain and others added 2 commits March 29, 2022 09:08

Merge branch 'main' into versioned-alert-schemas

52ec700

Fix types and tests

be7965f

madirey reviewed Mar 29, 2022

View reviewed changes

Update comment describing new schema process

63d0545

madirey approved these changes Mar 29, 2022

View reviewed changes

YulNaumenko approved these changes Mar 29, 2022

View reviewed changes

banderror reviewed Mar 29, 2022

View reviewed changes

marshallmain added 4 commits March 29, 2022 16:45

Update Ancestor800 type

d8bae7e

Add modified PR description as initial README

8ab1020

Remove duplication in CommonAlertFields definition

63223fe

Add explicit undefined value for rule in mock

70f24ad

marshallmain merged commit 482f819 into elastic:main Mar 30, 2022

This was referenced Mar 30, 2022

[Security Solution][Alerts] Add TS types for remaining alert types #128937

Open

[Security Solution][Alerts] Integrate security rule schemas with alert schemas #128950

Open

[Security Solution][Alerts] Refactor alert document creation logic #119926

Closed

marshallmain mentioned this pull request Apr 19, 2022

[RAC][Rule Registry] Improve the API of RuleDataService and RuleDataClient #106421

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Security Solution][Alerts] Replace schemas derived from FieldMaps with versioned alert schema #127218

[Security Solution][Alerts] Replace schemas derived from FieldMaps with versioned alert schema #127218

marshallmain commented Mar 8, 2022 •

edited

Loading

ecezalp commented Mar 14, 2022

ecezalp left a comment

ecezalp Mar 14, 2022

marshallmain Mar 15, 2022

madirey Mar 29, 2022

ecezalp Mar 14, 2022

marshallmain Mar 15, 2022 •

edited

Loading

marshallmain commented Mar 15, 2022

elasticmachine commented Mar 22, 2022

elasticmachine commented Mar 22, 2022

kobelb left a comment

madirey Mar 29, 2022

madirey Mar 29, 2022

madirey Mar 29, 2022

madirey Mar 29, 2022

madirey commented Mar 29, 2022

madirey left a comment

YulNaumenko left a comment

banderror left a comment

banderror Mar 29, 2022

marshallmain Mar 29, 2022

banderror Mar 29, 2022

banderror Mar 29, 2022

banderror Mar 29, 2022

marshallmain Mar 29, 2022 •

edited

Loading

banderror commented Mar 30, 2022

kibana-ci commented Mar 30, 2022

API count

ESLint disabled in files

ESLint disabled line counts

References to deprecated APIs

Total ESLint disabled count

marshallmain commented Mar 30, 2022

marshallmain commented Mar 30, 2022


		export type GenericAlert800 = AlertWithCommonFields800<BaseFields800>;

		// This is the type of the final generated alert including base fields, common fields

		const commonAlertIdFieldNames = [ALERT_INSTANCE_ID, ALERT_UUID];
		export type CommonAlertIdFieldName800 = Values<typeof commonAlertIdFieldNames>;

[Security Solution][Alerts] Replace schemas derived from FieldMaps with versioned alert schema #127218

[Security Solution][Alerts] Replace schemas derived from FieldMaps with versioned alert schema #127218

Conversation

marshallmain commented Mar 8, 2022 • edited Loading

Summary

Motivation - Development speed and quality

Current Structure

Proposed structure - Common Alert Schema Directory

Reading vs writing alerts

Adding new schemas

Changing existing schemas

No changes

Design decisions

Potential Future Work

ecezalp commented Mar 14, 2022

ecezalp left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

marshallmain Mar 15, 2022 • edited Loading

Choose a reason for hiding this comment

marshallmain commented Mar 15, 2022

elasticmachine commented Mar 22, 2022

elasticmachine commented Mar 22, 2022

kobelb left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

madirey commented Mar 29, 2022

madirey left a comment

Choose a reason for hiding this comment

YulNaumenko left a comment

Choose a reason for hiding this comment

banderror left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

marshallmain Mar 29, 2022 • edited Loading

Choose a reason for hiding this comment

banderror commented Mar 30, 2022

kibana-ci commented Mar 30, 2022

💚 Build Succeeded

Metrics [docs]

Public APIs missing comments

Async chunks

Public APIs missing exports

Page load bundle

API count

ESLint disabled in files

ESLint disabled line counts

References to deprecated APIs

Total ESLint disabled count

History

marshallmain commented Mar 30, 2022

marshallmain commented Mar 30, 2022

marshallmain commented Mar 8, 2022 •

edited

Loading

marshallmain Mar 15, 2022 •

edited

Loading

marshallmain Mar 29, 2022 •

edited

Loading