Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] IndexingIT testIndexing failing #100371

Closed
mark-vieira opened this issue Oct 5, 2023 · 13 comments
Closed

[CI] IndexingIT testIndexing failing #100371

mark-vieira opened this issue Oct 5, 2023 · 13 comments
Labels
blocker :ml Machine learning Team:ML Meta label for the ML team >test-failure Triaged test failures from CI

Comments

@mark-vieira
Copy link
Contributor

This has started failing a ton the last few days. I frankly have no idea what team this should be assigned to.

Build scan:
https://gradle-enterprise.elastic.co/s/rnuiwsuovl67g/tests/:x-pack:qa:rolling-upgrade:v8.2.2%23twoThirdsUpgradedTest/org.elasticsearch.upgrades.IndexingIT/testIndexing
Reproduction line:

./gradlew ':x-pack:qa:rolling-upgrade:v8.2.2#twoThirdsUpgradedTest' -Dtests.class="org.elasticsearch.upgrades.IndexingIT" -Dtests.method="testIndexing" -Dtests.seed=5FCCBD08EED2E750 -Dtests.bwc=true -Dtests.locale=ar-DZ -Dtests.timezone=Etc/GMT-12 -Druntime.java=21

Applicable branches:
8.11, main

Reproduces locally?:
No

Failure history:
https://gradle-enterprise.elastic.co/scans/tests?tests.container=org.elasticsearch.upgrades.IndexingIT&tests.test=testIndexing
Failure excerpt:

java.lang.AssertionError: timed out waiting for green state for index [] cluster state [{
  "cluster_name" : "v8.2.2",
  "cluster_uuid" : "mVPEGDh-SdCu0v5EceFlWA",
  "version" : 680,
  "state_uuid" : "4fJnJEsdRyWShvznWMlEeg",
  "master_node" : "WYDNNPxcSHCzUlxU4LJM9A",
  "blocks" : {
    "indices" : {
      "mounted_index_shared_cache" : {
        "8" : {
          "description" : "index write (api)",
          "retryable" : false,
          "levels" : [
            "write"
          ]
        }
      },
      "mounted_index_full_copy" : {
        "8" : {
          "description" : "index write (api)",
          "retryable" : false,
          "levels" : [
            "write"
          ]
        }
      }
    }
  },
  "nodes" : {
    "lGvXtOvMSZmPEwEQeeGjBw" : {
      "name" : "v8.2.2-1",
      "ephemeral_id" : "bpIxschHTD-M27zJJvM85w",
      "transport_address" : "127.0.0.1:38157",
      "external_id" : "v8.2.2-1",
      "attributes" : {
        "testattr" : "test",
        "ml.config_version" : "11.0.0",
        "xpack.installed" : "true",
        "transform.config_version" : "10.0.0",
        "ml.allocated_processors" : "32",
        "ml.machine_memory" : "101258194944",
        "ml.max_jvm_size" : "536870912",
        "upgraded" : "true",
        "ml.allocated_processors_double" : "32.0"
      },
      "roles" : [
        "data",
        "data_cold",
        "data_content",
        "data_frozen",
        "data_hot",
        "data_warm",
        "ingest",
        "master",
        "ml",
        "remote_cluster_client",
        "transform"
      ],
      "version" : "8.12.0",
      "min_index_version" : 7000099,
      "max_index_version" : 8500003
    },
    "WYDNNPxcSHCzUlxU4LJM9A" : {
      "name" : "v8.2.2-0",
      "ephemeral_id" : "X1i8SReGSlO72K9B5b7IMA",
      "transport_address" : "127.0.0.1:45279",
      "external_id" : "v8.2.2-0",
      "attributes" : {
        "testattr" : "test",
        "ml.config_version" : "11.0.0",
        "xpack.installed" : "true",
        "transform.config_version" : "10.0.0",
        "ml.allocated_processors" : "32",
        "ml.machine_memory" : "101258194944",
        "ml.max_jvm_size" : "536870912",
        "upgraded" : "true",
        "ml.allocated_processors_double" : "32.0"
      },
      "roles" : [
        "data",
        "data_cold",
        "data_content",
        "data_frozen",
        "data_hot",
        "data_warm",
        "ingest",
        "master",
        "ml",
        "remote_cluster_client",
        "transform"
      ],
      "version" : "8.12.0",
      "min_index_version" : 7000099,
      "max_index_version" : 8500003
    }
  },
  "nodes_versions" : [
    {
      "node_id" : "lGvXtOvMSZmPEwEQeeGjBw",
      "transport_version" : "8512000",
      "mappings_versions" : {
        ".secrets-inference" : {
          "version" : 1,
          "hash" : -1434574148
        },
        ".triggered_watches" : {
          "version" : 1,
          "hash" : -502826165
        },
        ".fleet-agents-7" : {
          "version" : 1,
          "hash" : -644511608
        },
        ".fleet-servers-7" : {
          "version" : 1,
          "hash" : -916922632
        },
        ".fleet-policies-leader-7" : {
          "version" : 1,
          "hash" : -1108172796
        },
        ".ml-config" : {
          "version" : 1,
          "hash" : 1455005892
        },
        ".security-tokens-7" : {
          "version" : 1,
          "hash" : 576296021
        },
        ".geoip_databases" : {
          "version" : 1,
          "hash" : -305757839
        },
        ".snapshot-blob-cache" : {
          "version" : 1,
          "hash" : 632712485
        },
        ".security-profile-8" : {
          "version" : 1,
          "hash" : -909540896
        },
        ".search-app-1" : {
          "version" : 1,
          "hash" : -501711141
        },
        ".watches" : {
          "version" : 1,
          "hash" : -1045118511
        },
        ".fleet-artifacts-7" : {
          "version" : 1,
          "hash" : -1593703898
        },
        ".query-rules-1" : {
          "version" : 1,
          "hash" : 1647955624
        },
        ".transform-internal-007" : {
          "version" : 1,
          "hash" : 1144737897
        },
        ".fleet-enrollment-api-keys-7" : {
          "version" : 1,
          "hash" : -840564854
        },
        ".fleet-actions-7" : {
          "version" : 1,
          "hash" : 961753072
        },
        ".tasks" : {
          "version" : 0,
          "hash" : -945584329
        },
        ".fleet-secrets-7" : {
          "version" : 1,
          "hash" : -745394230
        },
        ".ml-meta" : {
          "version" : 1,
          "hash" : 976416370
        },
        ".security-7" : {
          "version" : 1,
          "hash" : -1061511639
        },
        ".logstash" : {
          "version" : 1,
          "hash" : -1058806351
        },
        ".ml-inference-000005" : {
          "version" : 2,
          "hash" : -1534099917
        },
        ".async-search" : {
          "version" : 0,
          "hash" : -1403744380
        },
        ".inference" : {
          "version" : 1,
          "hash" : 645017346
        },
        ".fleet-policies-7" : {
          "version" : 1,
          "hash" : 762674907
        },
        ".synonyms-2" : {
          "version" : 1,
          "hash" : -888080772
        }
      }
    },
    {
      "node_id" : "WYDNNPxcSHCzUlxU4LJM9A",
      "transport_version" : "8512000",
      "mappings_versions" : { }
    }
  ],
  "metadata" : {
    "cluster_uuid" : "mVPEGDh-SdCu0v5EceFlWA",
    "cluster_uuid_committed" : true,
    "cluster_coordination" : {
      "term" : 4,
      "last_committed_config" : [
        "WYDNNPxcSHCzUlxU4LJM9A",
        "lGvXtOvMSZmPEwEQeeGjBw",
        "DGulxgtJQKKeIF6J-oYHRA"
      ],
      "last_accepted_config" : [
        "WYDNNPxcSHCzUlxU4LJM9A",
        "lGvXtOvMSZmPEwEQeeGjBw",
        "DGulxgtJQKKeIF6J-oYHRA"
      ],
      "voting_config_exclusions" : [ ]
    },
    "templates" : {
      ".monitoring-kibana" : {
        "order" : 0,
        "version" : 8080099,
        "index_patterns" : [
          ".monitoring-kibana-7-*"
        ],
        "settings" : {
          "index" : {
            "format" : "7",
            "codec" : "best_compression",
            "number_of_shards" : "1",
            "auto_expand_replicas" : "0-1",
            "number_of_replicas" : "0"
          }
        },
        "mappings" : {
          "_doc" : {
            "dynamic" : false,
            "properties" : {
              "cluster_uuid" : {
                "type" : "keyword"
              },
              "timestamp" : {
                "type" : "date",
                "format" : "date_time"
              },
              "interval_ms" : {
                "type" : "long"
              },
              "type" : {
                "type" : "keyword"
              },
              "source_node" : {
                "properties" : {
                  "uuid" : {
                    "type" : "keyword"
                  },
                  "host" : {
                    "type" : "keyword"
                  },
                  "transport_address" : {
                    "type" : "keyword"
                  },
                  "ip" : {
                    "type" : "keyword"
                  },
                  "name" : {
                    "type" : "keyword"
                  },
                  "timestamp" : {
                    "type" : "date",
                    "format" : "date_time"
                  }
                }
              },
              "kibana_stats" : {
                "properties" : {
                  "usage" : {
                    "properties" : {
                      "index" : {
                        "type" : "keyword"
                      }
                    }
                  },
                  "kibana" : {
                    "properties" : {
                      "uuid" : {
                        "type" : "keyword"
                      },
                      "name" : {
                        "type" : "keyword"
                      },
                      "host" : {
                        "type" : "keyword"
                      },
                      "transport_address" : {
                        "type" : "keyword"
                      },
                      "version" : {
                        "type" : "keyword"
                      },
                      "snapshot" : {
                        "type" : "boolean"
                      },
                      "status" : {
                        "type" : "keyword"
                      },
                      "statuses" : {
                        "properties" : {
                          "name" : {
                            "type" : "keyword"
                          },
                          "state" : {
                            "type" : "keyword"
                          }
                        }
                      }
                    }
                  },
                  "cloud" : {
                    "properties" : {
                      "name" : {
                        "type" : "keyword"
                      },
                      "id" : {
                        "type" : "keyword"
                      },
                      "vm_type" : {
                        "type" : "keyword"
                      },
                      "region" : {
                        "type" : "keyword"
                      },
                      "zone" : {
                        "type" : "keyword"
                      },
                      "metadata" : {
                        "type" : "object"
                      }
                    }
                  },
                  "os" : {
                    "properties" : {
                      "load" : {
                        "properties" : {
                          "1m" : {
                            "type" : "half_float"
                          },
                          "5m" : {
                            "type" : "half_float"
                          },
                          "15m" : {
                            "type" : "half_float"
                          }
                        }
                      },
                      "memory" : {
                        "properties" : {
                          "total_in_bytes" : {
                            "type" : "float"
                          },
                          "free_in_bytes" : {
                            "type" : "float"
                          },
                          "used_in_bytes" : {
                            "type" : "float"
                          }
                        }
                      },
                      "uptime_in_millis" : {
                        "type" : "long"
                      }
                    }
                  },
                  "process" : {
                    "properties" : {
                      "memory" : {
                        "properties" : {
                          "heap" : {
                            "properties" : {
                              "total_in_bytes" : {
                                "type" : "float"
                              },
                              "used_in_bytes" : {
                                "type" : "float"
                              },
                              "size_limit" : {
                                "type" : "float"
                              }
                            }
                          },
                          "resident_set_size_in_bytes" : {
                            "type" : "float"
                          }
                        }
                      },
                      "event_loop_delay" : {
                        "type" : "float"
                      },
                      "uptime_in_millis" : {
                        "type" : "long"
                      }
                    }
                  },
                  "sockets" : {
                    "properties" : {
                      "http" : {
                        "properties" : {
                          "total" : {
                            "type" : "long"
                          }
                        }
                      },
                      "https" : {
                        "properties" : {
                          "total" : {
                            "type" : "long"
                          }
                        }
                      }
                    }
                  },
                  "timestamp" : {
                    "type" : "date"
                  },
                  "requests" : {
                    "properties" : {
                      "disconnects" : {
                        "type" : "long"
                      },
                      "total" : {
                        "type" : "long"
                      },
                      "status_codes" : {
                        "type" : "object"
                      }
                    }
                  },
                  "response_times" : {
                    "properties" : {
                      "average" : {
                        "type" : "float"
                      },
                      "max" : {
                        "type" : "float"
                      }
                    }
                  },
                  "concurrent_connections" : {
                    "type" : "long"
                  }
                }
              }
            }
          }
        },
        "aliases" : { }
      },
      ".monitoring-beats" : {
        "order" : 0,
        "version" : 8080099,
        "index_patterns" : [
          ".monitoring-beats-7-*"
        ],
        "settings" : {
          "index" : {
            "format" : "7",
            "codec" : "best_compression",
            "number_of_shards" : "1",
            "auto_expand_replicas" : "0-1",
            "number_of_replicas" : "0"
          }
        },
        "mappings" : {
          "_doc" : {
            "dynamic" : false,
            "properties" : {
              "beats_state" : {
                "properties" : {
                  "beat" : {
                    "properties" : {
                      "host" : {
                        "type" : "keyword"
                      },
                      "name" : {
                        "type" : "keyword"
                      },
                      "type" : {
                        "type" : "keyword"
                      },
                      "uuid" : {
                        "type" : "keyword"
                      },
                      "version" : {
                        "type" : "keyword"
                      }
                    }
                  },
                  "state" : {
                    "properties" : {
                      "beat" : {
                        "properties" : {
                          "name" : {
                            "type" : "keyword"
                          }
                        }
                      },
                      "host" : {
                        "properties" : {
                          "architecture" : {
                            "type" : "keyword"
                          },
                          "name" : {
                            "type" : "keyword"
                          },
                          "hostname" : {
                            "type" : "keyword"
                          },
                          "os" : {
                            "properties" : {
                              "build" : {
                                "type" : "keyword"
                              },
                              "family" : {
                                "type" : "keyword"
                              },
                              "platform" : {
                                "type" : "keyword"
                              },
                              "version" : {
                                "type" : "keyword"
                              }
                            }
                          }
                        }
                      },
                      "input" : {
                        "properties" : {
                          "count" : {
                            "type" : "long"
                          },
                          "names" : {
                            "type" : "keyword"
                          }
                        }
                      },
                      "module" : {
                        "properties" : {
                          "count" : {
                            "type" : "long"
                          },
                          "names" : {
                            "type" : "keyword"
                          }
                        }
                      },
                      "output" : {
                        "properties" : {
                          "name" : {
                            "type" : "keyword"
                          }
                        }
                      },
                      "service" : {
                        "properties" : {
                          "id" : {
                            "type" : "keyword"
                          },
                          "name" : {
                            "type" : "keyword"
                          },
                          "version" : {
                            "type" : "keyword"
                          }
                        }
                      }
                    }
                  },
                  "timestamp" : {
                    "format" : "date_time",
                    "type" : "date"
                  }
                }
              },
              "beats_stats" : {
                "properties" : {
                  "beat" : {
                    "properties" : {
                      "host" : {
                        "type" : "keyword"
                      },
                      "name" : {
                        "type" : "keyword"
                      },
                      "type" : {
                        "type" : "keyword"
                      },
                      "uuid" : {
                        "type" : "keyword"
                      },
                      "version" : {
                        "type" : "keyword"
                      }
                    }
                  },
                  "metrics" : {
                    "properties" : {
                      "beat" : {
                        "properties" : {
                          "cgroup" : {
                            "properties" : {
                              "cpu" : {
                                "properties" : {
                                  "id" : {
                                    "type" : "keyword"
                                  },
                                  "cfs" : {
                                    "properties" : {
                                      "period" : {
                                        "properties" : {
                                          "us" : {
                                            "type" : "long"
                                          }
                                        }
                                      },
                                      "quota" : {
                                        "properties" : {
                                          "us" : {
                                            "type" : "long"
                                          }
                                        }
                                      }
                                    }
                                  },
                                  "stats" : {
                                    "properties" : {
                                      "periods" : {
                                        "type" : "long"
                                      },
                                      "throttled" : {
                                        "properties" : {
                                          "periods" : {
                                            "type" : "long"
                                          },
                                          "ns" : {
                                            "type" : "long"
                                          }
                                        }
                                      }
                                    }
                                  }
                                }
                              },
                              "cpuacct" : {
                                "properties" : {
                                  "id" : {
                                    "type" : "keyword"
                                  },
                                  "total" : {
                                    "properties" : {
                                      "ns" : {
                                        "type" : "long"
                                      }
                                    }
                                  }
                                }
                              },
                              "memory" : {
                                "properties" : {
                                  "id" : {
                                    "type" : "keyword"
                                  },
                                  "mem" : {
                                    "properties" : {
                                      "limit" : {
                                        "properties" : {
                                          "bytes" : {
                                            "type" : "long"
                                          }
                                        }
                                      },
                                      "usage" : {
                                        "properties" : {
                                          "bytes" : {
                                            "type" : "long"
                                          }
                                        }
                                      }
                                    }
                                  }
                                }
                              }
                            }
                          },
                          "cpu" : {
                            "properties" : {
                              "system" : {
                                "properties" : {
                                  "ticks" : {
                                    "type" : "long"
                                  },
                                  "time" : {
                                    "properties" : {
                                      "ms" : {
                                        "type" : "long"
                                      }
                                    }
                                  }
                                }
                              },
                              "total" : {
                                "properties" : {
                                  "value" : {
                                    "type" : "long"
                                  },
                                  "ticks" : {
                                    "type" : "long"
                                  },
                                  "time" : {
                                    "properties" : {
                                      "ms" : {
                                        "type" : "long"
                                      }
                                    }
                                  }
                                }
                              },
                              "user" : {
                                "properties" : {
                                  "ticks" : {
                                    "type" : "long"
                                  },
                                  "time" : {
                                    "properties" : {
                                      "ms" : {
                                        "type" : "long"
                                      }
                                    }
                                  }
                                }
                              }
                            }
                          },
                          "info" : {
                            "properties" : {
                              "ephemeral_id" : {
                                "type" : "keyword"
                              },
                              "uptime" : {
                                "properties" : {
                                  "ms" : {
                                    "type" : "long"
                                  }
                                }
                              },
                              "version" : {
                                "type" : "keyword"
                              }
                            }
                          },
                          "memstats" : {
                            "properties" : {
                              "gc_next" : {
                                "type" : "long"
                              },
                              "memory_alloc" : {
                                "type" : "long"
                              },
                              "memory_total" : {
                                "type" : "long"
                              },
                              "rss" : {
                                "type" : "long"
                              }
                            }
                          },
                          "handles" : {
                            "properties" : {
                              "open" : {
                                "type" : "long"
                              },
                              "limit" : {
                                "properties" : {
                                  "hard" : {
                                    "type" : "long"
                                  },
                                  "soft" : {
                                    "type" : "long"
                                  }
                                }
                              }
                            }
                          }
                        }
                      },
                      "apm-server" : {
                        "properties" : {
                          "acm" : {
                            "properties" : {
                              "request" : {
                                "properties" : {
                                  "count" : {
                                    "type" : "long"
                                  }
                                }
                              },
                              "response" : {
                                "properties" : {
                                  "count" : {
                                    "type" : "long"
                                  },
                                  "errors" : {
                                    "properties" : {
                                      "validate" : {
                                        "type" : "long"
                                      },
                                      "internal" : {
                                        "type" : "long"
                                      },
                                      "queue" : {
                                        "type" : "long"
                                      },
                                      "count" : {
                                        "type" : "long"
                                      },
                                      "decode" : {
                                        "type" : "long"
                                      },
                                      "toolarge" : {
                                        "type" : "long"
                                      },
                                      "unavailable" : {
                                        "type" : "long"
                                      },
                                      "forbidden" : {
                                        "type" : "long"
                                      },
                                      "method" : {
                                        "type" : "long"
                                      },
                                      "notfound" : {
                                        "type" : "long"
                                      },
                                      "invalidquery" : {
                                        "type" : "long"
                                      },
                                      "ratelimit" : {
                                        "type" : "long"
                                      },
                                      "closed" : {
                                        "type" : "long"
                                      },
                                      "unauthorized" : {
                                        "type" : "long"
                                      }
                                    }
                                  },
                                  "valid" : {
                                    "properties" : {
                                      "notmodified" : {
                                        "type" : "long"
                                      },
                                      "count" : {
                                        "type" : "long"
                                      },
                                      "ok" : {
                                        "type" : "long"
                                      },
                                      "accepted" : {
                                        "type" : "long"
                                      }
                                    }
                                  },
                                  "unset" : {
                                    "type" : "long"
                                  },
                                  "request" : {
                                    "properties" : {
                                      "count" : {
                                        "type" : "long"
                                      }
                                    }
                                  }
                                }
                              }
                            }
                          },
                          "server" : {
                            "properties" : {
                              "request" : {
                                "properties" : {
                                  "count" : {
                                    "type" : "long"
                                  }
                                }
                              },
                              "concurrent" : {
                                "properties" : {
                                  "wait" : {
                                    "properties" : {
                                      "ms" : {
                                        "type" : "long"
                                      }
                                    }
                                  }
                                }
                              },
                              "response" : {
                                "properties" : {
                                  "count" : {
                                    "type" : "long"
                                  },
                                  "errors" : {
                                    "properties" : {
                                      "count" : {
                                        "type" : "long"
                                      },
                                      "toolarge" : {
                                        "type" : "long"
                                      },
                                      "validate" : {
                                        "type" : "long"
                                      },
                                      "ratelimit" : {
                                        "type" : "long"
                                      },
                                      "queue" : {
                                        "type" : "long"
                                      },
                                      "closed" : {
                                        "type" : "long"
                                      },
                                      "forbidden" : {
                                        "type" : "long"
                                      },
                                      "concurrency" : {
                                        "type" : "long"
                                      },
                                      "unauthorized" : {
                                        "type" : "long"
                                      },
                                      "internal" : {
                                        "type" : "long"
                                      },
                                      "decode" : {
                                        "type" : "long"
                                      },
                                      "method" : {
                                        "type" : "long"
                                      }
                                    }
                                  },
                                  "valid" : {
                                    "properties" : {
                                      "ok" : {
                                        "type" : "long"
                                      },
                                      "accepted" : {
                                        "type" : "long"
                                      },
                                      "count" : {
                                        "type" : "long"
                                      }
                                    }
                                  }
                                }
                              }
                            }
                          },
                          "decoder" : {
                            "properties" : {
                              "deflate" : {
                                "properties" : {
                                  "content-length" : {
                                    "type" : "long"
                                  },
                                  "count" : {
                                    "type" : "long"
                                  }
                                }
                              },
                              "gzip" : {
                                "properties" : {
                                  "content-length" : {
                                    "type" : "long"
                                  },
                                  "count" : {
                                    "type" : "long"
                                  }
                                }
                              },
                              "uncompressed" : {
                                "properties" : {
                                  "content-length" : {
                                    "type" : "long"
                                  },
                                  "count" : {
                                    "type" : "long"
                                  }
                                }
                              },
                              "reader" : {
                                "properties" : {
                                  "size" : {
                                    "type" : "long"
                                  },
                                  "count" : {
                                    "type" : "long"
                                  }
                                }
                              },
                              "missing-content-length" : {
                                "properties" : {
                                  "count" : {
                                    "type" : "long"
                                  }
                                }
                              }
                            }
                          },
                          "processor" : {
                            "properties" : {
                              "metric" : {
                                "properties" : {
                                  "decoding" : {
                                    "properties" : {
                                      "errors" : {
                                        "type" : "long"
                                      },
                                      "count" : {
                                        "type" : "long"
                                      }
                                    }
                                  },
                                  "validation" : {
                                    "properties" : {
                                      "errors" : {
                                        "type" : "long"
                                      },
                                      "count" : {
                                        "type" : "long"
                                      }
                                    }
                                  },
                                  "transformations" : {
                                    "type" : "long"
                                  }
                                }
                              },
                              "sourcemap" : {
                                "properties" : {
                                  "counter" : {
                                    "type" : "long"
                                  },
                                  "decoding" : {
                                    "properties" : {
                                      "errors" : {
                                        "type" : "long"
                                      },
                                      "count" : {
                                        "type" : "long"
                                      }
                                    }
                                  },
                                  "validation" : {
                                    "properties" : {
                                      "errors" : {
                                        "type" : "long"
                                      },
                                      "count" : {
                                        "type" : "long"
                                      }
                                    }
                                  }
                                }
                              },
                              "transaction" : {
                                "properties" : {
                                  "decoding" : {
                                    "properties" : {
                                      "errors" : {
                                        "type" : "long"
                                      },
                                      "count" : {
                                        "type" : "long"
                                      }
                                    }
                                  },
                                  "validation" : {
                                    "properties" : {
                                      "errors" : {
                                        "type" : "long"
                                      },
                                      "count" : {
                                        "type" : "long"
                                      }
                                    }
                                  },
                                  "transformations" : {
                                    "type" : "long"
                                  },
                                  "transactions" : {
                                    "type" : "long"
                                  },
                                  "spans" : {
                                    "type" : "long"
                                  },
                                  "stacktraces" : {
                                    "type" : "long"
                                  },
                                  "frames" : {
                                    "type" : "long"
                                  }
                                }
                              },
                              "error" : {
                                "properties" : {
                                  "decoding" : {
                                    "properties" : {
                                      "errors" : {
                                        "type" : "long"
                                      },
                                      "count" : {
                                        "type" : "long"
                                      }
                                    }
                                  },
                                  "validation" : {
                                    "properties" : {
                                      "errors" : {
                                        "type" : "long"
                                      },
                                      "count" : {
                                        "type" : "long"
                                      }
                                    }
                                  },
                                  "transformations" : {
                                    "type" : "long"
                                  },
                                  "errors" : {
                                    "type" : "long"
                                  },
                                  "stacktraces" : {
                                    "type" : "long"
                                  },
                                  "frames" : {
                                    "type" : "long"
                                  }
                                }
                              },
                              "span" : {
                                "properties" : {
                                  "transformations" : {
                                    "type" : "long"
                                  }
                                }
                              }
                            }
                          },
                          "sampling" : {
                            "properties" : {
                              "transactions_dropped" : {
                                "type" : "long"
                              },
                              "tail" : {
                                "properties" : {
                                  "dynamic_service_groups" : {
                                    "type" : "long"
                                  },
                                  "storage" : {
                                    "properties" : {
                                      "lsm_size" : {
                                        "type" : "long"
                                      },
                                      "value_log_size" : {
                                        "type" : "long"
                                      }
                                    }
                                  },
                                  "events" : {
                                    "properties" : {
                                      "processed" : {
                                        "type" : "long"
                                      },
                                      "dropped" : {
                                        "type" : "long"
                                      },
                                      "stored" : {
                                        "type" : "long"
                                      },
                                      "sampled" : {
                                        "type" : "long"
                                      },
                                      "head_unsampled" : {
                                        "type" : "long"
                                      }
                                    }
                                  }
                                }
                              }
                            }
                          }
                        }
                      },
                      "libbeat" : {
                        "properties" : {
                          "config" : {
                            "properties" : {
                              "module" : {
                                "properties" : {
                                  "running" : {
                                    "type" : "long"
                                  },
                                  "starts" : {
                                    "type" : "long"
                                  },
                                  "stops" : {
                                    "type" : "long"
                                  }
                                }
                              },
                              "reloads" : {
                                "type" : "long"
                              }
                            }
                          },
                          "output" : {
                            "properties" : {
                              "events" : {
                                "properties" : {
                                  "acked" : {
                                    "type" : "long"
                                  },
                                  "active" : {
                                    "type" : "long"
                                  },
                                  "batches" : {
                                    "type" : "long"
                                  },
                                  "dropped" : {
                                    "type" : "long"
                                  },
                                  "duplicates" : {
                                    "type" : "long"
                                  },
                                  "failed" : {
                                    "type" : "long"
                                  },
                                  "total" : {
                                    "type" : "long"
                                  },
                                  "toomany" : {
                                    "type" : "long"
                                  }
                                }
                              },
                              "read" : {
                                "properties" : {
                                  "bytes" : {
                                    "type" : "long"
                                  },
                                  "errors" : {
                                    "type" : "long"
                                  }
                                }
                              },
                              "type" : {
                                "type" : "keyword"
                              },
                              "write" : {
                                "properties" : {
                                  "bytes" : {
                                    "type" : "long"
                                  },
                                  "errors" : {
                                    "type" : "long"
                                  }
                                }
                              }
                            }
                          },
                          "pipeline" : {
                            "properties" : {
                              "clients" : {
                                "type" : "long"
                              },
                              "events" : {
                                "properties" : {
                                  "active" : {
                                    "type" : "long"
                                  },
                                  "dropped" : {
                                    "type" : "long"
                                  },
                                  "failed" : {
                                    "type" : "long"
                                  },
                                  "filtered" : {
                                    "type" : "long"
                                  },
                                  "published" : {
                                    "type" : "long"
                                  },
                                  "retry" : {
                                    "type" : "long"
                                  },...
@mark-vieira mark-vieira added :Data Management/Other >test-failure Triaged test failures from CI labels Oct 5, 2023
@elasticsearchmachine
Copy link
Collaborator

Pinging @elastic/es-data-management (Team:Data Management)

@elasticsearchmachine elasticsearchmachine added the Team:Data Management Meta label for data/management team label Oct 5, 2023
@benwtrent
Copy link
Member

OK, this particular failure might be due to ML. Attaching the logs. (I am not sure if this is indicative of all the "failure to get green" BWC tests).

In the logs for X-Pack rolling upgrade, I see the following logs on some of the nodes (one node failed to start) and thus couldn't get to green.

@elastic/ml-core What do y'all think?

[2023-10-05T13:20:58,984][TRACE][o.e.x.m.i.a.TrainedModelAllocationClusterService] [v8.2.2-1] [upgrade-deployment-test-stop-mixed-cluster] [lGvXtOvMSZmPEwEQeeGjBw] current metadata before update {"upgrade-deployment-test-stop-mixed-cluster":{"task_parameters":{"model_id":"upgrade-deployment-test-stop-mixed-cluster","model_bytes":1630,"inference_threads":1,"model_threads":1,"queue_capacity":1024},"routing_table":{"lGvXtOvMSZmPEwEQeeGjBw":{"routing_state":"started","reason":""},"DGulxgtJQKKeIF6J-oYHRA":{"routing_state":"started","reason":""},"WYDNNPxcSHCzUlxU4LJM9A":{"routing_state":"starting","reason":""}},"allocation_state":"stopping","reason":"client API call","start_time":"2023-10-05T13:19:17.77625836Z"}}
[2023-10-05T13:20:58,988][DEBUG][o.e.x.m.p.ProcessResultsParser] [v8.2.2-1] io error while parsing
org.elasticsearch.xcontent.XContentEOFException: com.fasterxml.jackson.core.io.JsonEOFException: Unexpected end-of-input: expected close marker for Array (start marker at [Source: (FileInputStream); line: 2, column: 1])
 at [Source: (FileInputStream); line: 3, column: 1]
	at org.elasticsearch.xcontent.provider.json.JsonXContentParser.nextToken(JsonXContentParser.java:60) ~[?:?]
	at org.elasticsearch.xpack.ml.process.ProcessResultsParser$ResultIterator.hasNext(ProcessResultsParser.java:70) [x-pack-ml-8.2.2.jar:8.2.2]
	at org.elasticsearch.xpack.ml.inference.pytorch.process.PyTorchResultProcessor.process(PyTorchResultProcessor.java:93) [x-pack-ml-8.2.2.jar:8.2.2]
	at org.elasticsearch.xpack.ml.inference.deployment.DeploymentManager.lambda$doStartDeployment$2(DeploymentManager.java:148) [x-pack-ml-8.2.2.jar:8.2.2]
	at org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingRunnable.run(ThreadContext.java:714) [elasticsearch-8.2.2.jar:8.2.2]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136) [?:?]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635) [?:?]
	at java.lang.Thread.run(Thread.java:833) [?:?]
Caused by: com.fasterxml.jackson.core.io.JsonEOFException: Unexpected end-of-input: expected close marker for Array (start marker at [Source: (FileInputStream); line: 2, column: 1])
 at [Source: (FileInputStream); line: 3, column: 1]
	at com.fasterxml.jackson.core.base.ParserMinimalBase._reportInvalidEOF(ParserMinimalBase.java:682) ~[?:?]
	at com.fasterxml.jackson.core.base.ParserBase._handleEOF(ParserBase.java:494) ~[?:?]
	at com.fasterxml.jackson.core.base.ParserBase._eofAsNextChar(ParserBase.java:511) ~[?:?]
	at com.fasterxml.jackson.core.json.UTF8StreamJsonParser._skipWSOrEnd2(UTF8StreamJsonParser.java:3111) ~[?:?]
	at com.fasterxml.jackson.core.json.UTF8StreamJsonParser._skipWSOrEnd(UTF8StreamJsonParser.java:3081) ~[?:?]
	at com.fasterxml.jackson.core.json.UTF8StreamJsonParser.nextToken(UTF8StreamJsonParser.java:756) ~[?:?]
	at org.elasticsearch.xcontent.provider.json.JsonXContentParser.nextToken(JsonXContentParser.java:58) ~[?:?]
	... 7 more

And this trace:

[2023-10-05T13:22:24,393][INFO ][o.e.c.s.ClusterApplierService] [v8.2.2-2] added {{v8.2.2-1}{lGvXtOvMSZmPEwEQeeGjBw}{bpIxschHTD-M27zJJvM85w}{127.0.0.1}{127.0.0.1:38157}{cdfhilmrstw}}, term: 4, version: 550, reason: ApplyCommitRequest{term=4, version=550, sourceNode={v8.2.2-0}{WYDNNPxcSHCzUlxU4LJM9A}{X1i8SReGSlO72K9B5b7IMA}{127.0.0.1}{127.0.0.1:45279}{cdfhilmrstw}{ml.allocated_processors_double=32.0, ml.machine_memory=101258194944, upgraded=true, xpack.installed=true, transform.config_version=10.0.0, testattr=test, ml.config_version=11.0.0, ml.max_jvm_size=536870912, ml.allocated_processors=32}}
[2023-10-05T13:22:30,590][WARN ][o.e.t.InboundHandler     ] [v8.2.2-2] Failed to deserialize response from [127.0.0.1/127.0.0.1:45279]
org.elasticsearch.transport.TransportSerializationException: Failed to deserialize response from handler [org.elasticsearch.transport.TransportService$ContextRestoreResponseHandler/UnregisterChildTransportResponseHandler[unregisterChildNode=org.elasticsearch.core.Releasables$$Lambda$5832/0x0000000801b372b8@1efa4a7a, handler=org.elasticsearch.action.support.master.TransportMasterNodeAction$AsyncSingleAction$1@46c891af/org.elasticsearch.action.ActionListener$DelegatingFailureActionListener/org.elasticsearch.action.support.ContextPreservingActionListener/org.elasticsearch.tasks.TaskManager$1{ActionResponseTaskListener[listener=org.elasticsearch.rest.action.admin.cluster.RestClusterGetSettingsAction$1@6e5cd6f3]}{Task{id=7157, type='transport', action='cluster:monitor/state', description='cluster state [metadata, blocks, customs, master timeout [30s]]', parentTask=unset, startTime=1696512150569, startTimeNanos=775610041683}}/org.elasticsearch.xpack.security.action.filter.SecurityActionFilter$$Lambda$5993/0x0000000801ba1bb8@2793b845, action=cluster:monitor/state]]
	at org.elasticsearch.transport.InboundHandler.handleResponse(InboundHandler.java:341) [elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.transport.InboundHandler.messageReceived(InboundHandler.java:142) [elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.transport.InboundHandler.inboundMessage(InboundHandler.java:94) [elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.transport.TcpTransport.inboundMessage(TcpTransport.java:795) [elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.transport.InboundPipeline.forwardFragments(InboundPipeline.java:149) [elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.transport.InboundPipeline.doHandleBytes(InboundPipeline.java:121) [elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.transport.InboundPipeline.handleBytes(InboundPipeline.java:86) [elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.transport.netty4.Netty4MessageInboundHandler.channelRead(Netty4MessageInboundHandler.java:63) [transport-netty4-8.2.2.jar:8.2.2]
	at io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:379) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:365) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:357) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.handler.logging.LoggingHandler.channelRead(LoggingHandler.java:280) [netty-handler-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:379) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:365) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:357) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.handler.codec.MessageToMessageDecoder.channelRead(MessageToMessageDecoder.java:103) [netty-codec-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:379) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:365) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:357) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.handler.ssl.SslHandler.unwrap(SslHandler.java:1372) [netty-handler-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.handler.ssl.SslHandler.decodeJdkCompatible(SslHandler.java:1235) [netty-handler-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.handler.ssl.SslHandler.decode(SslHandler.java:1284) [netty-handler-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.handler.codec.ByteToMessageDecoder.decodeRemovalReentryProtection(ByteToMessageDecoder.java:510) [netty-codec-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.handler.codec.ByteToMessageDecoder.callDecode(ByteToMessageDecoder.java:449) [netty-codec-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.handler.codec.ByteToMessageDecoder.channelRead(ByteToMessageDecoder.java:279) [netty-codec-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:379) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:365) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:357) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.DefaultChannelPipeline$HeadContext.channelRead(DefaultChannelPipeline.java:1410) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:379) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:365) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.DefaultChannelPipeline.fireChannelRead(DefaultChannelPipeline.java:919) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.nio.AbstractNioByteChannel$NioByteUnsafe.read(AbstractNioByteChannel.java:166) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.nio.NioEventLoop.processSelectedKey(NioEventLoop.java:722) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.nio.NioEventLoop.processSelectedKeysPlain(NioEventLoop.java:623) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.nio.NioEventLoop.processSelectedKeys(NioEventLoop.java:586) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.channel.nio.NioEventLoop.run(NioEventLoop.java:496) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.util.concurrent.SingleThreadEventExecutor$4.run(SingleThreadEventExecutor.java:986) [netty-common-4.1.74.Final.jar:4.1.74.Final]
	at io.netty.util.internal.ThreadExecutorMap$2.run(ThreadExecutorMap.java:74) [netty-common-4.1.74.Final.jar:4.1.74.Final]
	at java.lang.Thread.run(Thread.java:833) [?:?]
Caused by: java.lang.IllegalArgumentException: Unknown NamedWriteable [org.elasticsearch.cluster.metadata.Metadata$Custom][trained_model_assignment]
	at org.elasticsearch.common.io.stream.NamedWriteableRegistry.getReader(NamedWriteableRegistry.java:112) ~[elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.common.io.stream.NamedWriteableAwareStreamInput.readNamedWriteable(NamedWriteableAwareStreamInput.java:36) ~[elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.common.io.stream.NamedWriteableAwareStreamInput.readNamedWriteable(NamedWriteableAwareStreamInput.java:28) ~[elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.cluster.metadata.Metadata.readFrom(Metadata.java:1167) ~[elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.cluster.ClusterState.readFrom(ClusterState.java:756) ~[elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.action.admin.cluster.state.ClusterStateResponse.lambda$new$0(ClusterStateResponse.java:33) ~[elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.common.io.stream.StreamInput.readOptionalWriteable(StreamInput.java:973) ~[elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.action.admin.cluster.state.ClusterStateResponse.<init>(ClusterStateResponse.java:33) ~[elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.action.ActionListenerResponseHandler.read(ActionListenerResponseHandler.java:58) ~[elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.action.ActionListenerResponseHandler.read(ActionListenerResponseHandler.java:25) ~[elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.transport.TransportService$UnregisterChildTransportResponseHandler.read(TransportService.java:1619) ~[elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.transport.TransportService$UnregisterChildTransportResponseHandler.read(TransportService.java:1594) ~[elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.transport.TransportService$ContextRestoreResponseHandler.read(TransportService.java:1327) ~[elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.transport.TransportService$ContextRestoreResponseHandler.read(TransportService.java:1314) ~[elasticsearch-8.2.2.jar:8.2.2]
	at org.elasticsearch.transport.InboundHandler.handleResponse(InboundHandler.java:338) ~[elasticsearch-8.2.2.jar:8.2.2]
	... 39 more

1948.zip

@droberts195
Copy link
Contributor

The earliest failure I can find that's similar to this is https://gradle-enterprise.elastic.co/s/in6lvjac7niak from 3rd October.

That would put it around the time of #100143. That change caused another problem, in #100180, which was fixed in #100388.

This issue was opened before #100388 was merged. I cannot see any failures of IndexingIT since #100388 was merged.

@davidkyle please can you check if this failure would also be fixed by #100388? If it would then #100285 can probably be closed too, as I think that's the same problem, just raised against a different test (as 3 tests fail together).

@masseyke
Copy link
Member

masseyke commented Oct 11, 2023

I'm wondering if the TrainedModelAssignmentMetadata serialization problem is also causing #100379 (once it happens there, all the tests after it fail for a while). Is this not just a matter of needing to check the transport version when serializing/deserializng that? I see it was changed in 8.3.0 (#85503), and this was an 8.0.0 cluster.

@masseyke
Copy link
Member

Still no failures since the day #100388 was merged. I'll close this and we can reopen it if we start seeing this again.

@davidkyle
Copy link
Member

I have a suspicion that the test failure is actually due to the muting of another test in the same test suite that was failing and fixed by #100388, this might would explain why the failure appears to be fixed by #100388

The failure is caused by the ml model deployment code updating the clusterstate with a new named writable, this should only happen once all the nodes in the cluster have been upgraded. The unknown named writable is a fatal error for the 3rd node causing IndexingIT to timeout waiting for the cluster to have 3 nodes.

I've pushed some logging in #100800 and re-muted MLModelDeploymentsUpgradeIT::testTrainedModelDeployment to test this theory.

@droberts195
Copy link
Contributor

droberts195 commented Oct 14, 2023

There's still a problem with trained_model_assignment metadata.

This is a failure from today that shows it: https://gradle-enterprise.elastic.co/s/36366ajvxm45i

In the server-side logs, v8.2.0-2/logs/v8.2.0.log shows this:

[2023-10-14T16:23:51,980][WARN ][o.e.t.TcpTransport       ] [v8.2.0-2] exception caught on transport layer [Netty4TcpChannel{localAddress=/127.0.0.1:33964, remoteAddress=127.0.0.1/127.0.0.1:41075, profile=default}], closing connection
java.lang.Exception: java.lang.AssertionError: java.lang.IllegalArgumentException: Unknown NamedWriteable [org.elasticsearch.cluster.metadata.Metadata$Custom][trained_model_assignment]
        at org.elasticsearch.transport.netty4.Netty4MessageInboundHandler.exceptionCaught(Netty4MessageInboundHandler.java:75) [transport-netty4-8.2.0.jar:8.2.0]
        at io.netty.channel.AbstractChannelHandlerContext.invokeExceptionCaught(AbstractChannelHandlerContext.java:302) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
        at io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:381) [netty-transport-4.1.74.Final.jar:4.1.74.Final]
...

The error happens on the node that's still on 8.2 after the other two nodes in the cluster have been upgraded to 8.12.

I am pretty sure that something has been done in the last few weeks that invalidates the assumptions of #88289:

This is OK as the object serialization handles its BWC conditions when serializing over the wire.

I am wondering if something has changed in the negotiation of transport versions that means the cluster briefly thinks all the nodes are on 8.12 even though one of them is on 8.2. This would lead the following test to be true:

if (currentState.getMinTransportVersion().onOrAfter(RENAME_ALLOCATION_TO_ASSIGNMENT_TRANSPORT_VERSION)) {
logger.trace("putting custom new name");

We could test the theory by temporarily changing that trace to an info and appending the min transport version to the message.

@davidkyle
Copy link
Member

The assumption was that the TrainedModelAssignmentClusterService was accidentally updating the clusterstate with the new named writable in a mixed cluster but non of the logging added in #100800 showed up in the latest failure. MLModelDeploymentsUpgradeIT::testTrainedModelDeployment is muted and the only other model deployment is stopped at the 1/3 upgraded stage of the test. There are no trained model assignments so TrainedModelAssignmentClusterService should not be making any changes.

#100886 looks likely to be the cause although I cannot find a convincing explanation as to why. It does seem related to having no model deployments at the 2/3 stage and hence empty trained_model_assignment metadata.

@dakrone dakrone added :ml Machine learning and removed :Data Management/Other labels Oct 27, 2023
@elasticsearchmachine elasticsearchmachine added blocker Team:ML Meta label for the ML team and removed Team:Data Management Meta label for data/management team labels Oct 27, 2023
@elasticsearchmachine
Copy link
Collaborator

Pinging @elastic/ml-core (Team:ML)

@volodk85
Copy link
Contributor

@volodk85 volodk85 reopened this Dec 12, 2023
@droberts195
Copy link
Contributor

@volodk85 please could you open a separate issue for the new failure.

The new failure is in the :qa:rolling-upgrade suite. The original failure was in the :x-pack:qa:rolling-upgrade suite. The failing test classes here are completely different even though they're both called IndexingIT.

This issue is already pretty confusing and complex as the problem turned out to be in ML even though the failing test was an indexing test. In this new failure ML is almost certainly not the problem as the test is not an X-Pack test.

@mark-vieira
Copy link
Contributor Author

The new failure is in the :qa:rolling-upgrade suite. The original failure was in the :x-pack:qa:rolling-upgrade suite. The failing test classes here are completely different even though they're both called IndexingIT.

We have fare too many overloaded test suite names. This is just one of them. Sure, the package name is unique but it's easy to conflate these. I'm wonder if we should go through and give these test suites unique names, even if that means losing some test history for a time.

@DaveCTurner
Copy link
Contributor

I don't think @volodk85 ever opened a separate issue here (sorry if I just can't find it) so I opened #105452 and will close this one.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
blocker :ml Machine learning Team:ML Meta label for the ML team >test-failure Triaged test failures from CI
Projects
None yet
Development

No branches or pull requests

9 participants