Failing test: X-Pack Security API Integration Tests (Session Concurrent Limit).x-pack/test/security_api_integration/tests/session_concurrent_limit/cleanup·ts - security APIs - Session Concurrent Limit Session Concurrent Limit cleanup should properly clean up sessions that exceeded concurrent session limit even for multiple providers #149091

kibanamachine · 2023-01-18T08:16:37Z

A test failed on a tracked branch

Error: expected 6 to equal 4
    at Assertion.assert (expect.js:100:11)
    at Assertion.apply (expect.js:227:8)
    at Assertion.be (expect.js:69:22)
    at Context.<anonymous> (cleanup.ts:214:54)
    at processTicksAndRejections (node:internal/process/task_queues:95:5)
    at Object.apply (wrap_function.js:73:16)

First failure: CI Build - main

The text was updated successfully, but these errors were encountered:

elasticmachine · 2023-01-18T08:17:13Z

Pinging @elastic/kibana-security (Team:Security)

kibanamachine · 2023-01-18T14:29:15Z

New failure: CI Build - main

mistic · 2023-01-18T15:46:42Z

Skipped.

main: 74d9321

azasypkin · 2023-01-18T16:12:50Z

Duplicate of #149090

Resolves #148914 Resolves #149090 Resolves #149091 Resolves #149092 In this PR, I'm making the following Task Manager bulk APIs retry whenever conflicts are encountered: `bulkEnable`, `bulkDisable`, and `bulkUpdateSchedules`. To accomplish this, the following had to be done: - Revert the original PR (#147808) because the retries didn't load the updated documents whenever version conflicts were encountered and the approached had to be redesigned. - Create a `retryableBulkUpdate` function that can be re-used among the bulk APIs. - Fix a bug in `task_store.ts` where `version` field wasn't passed through properly (no type safety for some reason) - Remove `entity` from being returned on bulk update errors. This helped re-use the same response structure when objects weren't found - Create a `bulkGet` API on the task store so we get the latest documents prior to a ES refresh happening - Create a single mock task function that mocks task manager tasks for unit test purposes. This was necessary as other places were doing `as unknown as BulkUpdateTaskResult` and escaping type safety Flaky test runs: - [Framework] https://buildkite.com/elastic/kibana-flaky-test-suite-runner/builds/1776 - [Kibana Security] https://buildkite.com/elastic/kibana-flaky-test-suite-runner/builds/1786 Co-authored-by: kibanamachine <[email protected]>

Resolves elastic#148914 Resolves elastic#149090 Resolves elastic#149091 Resolves elastic#149092 In this PR, I'm making the following Task Manager bulk APIs retry whenever conflicts are encountered: `bulkEnable`, `bulkDisable`, and `bulkUpdateSchedules`. To accomplish this, the following had to be done: - Revert the original PR (elastic#147808) because the retries didn't load the updated documents whenever version conflicts were encountered and the approached had to be redesigned. - Create a `retryableBulkUpdate` function that can be re-used among the bulk APIs. - Fix a bug in `task_store.ts` where `version` field wasn't passed through properly (no type safety for some reason) - Remove `entity` from being returned on bulk update errors. This helped re-use the same response structure when objects weren't found - Create a `bulkGet` API on the task store so we get the latest documents prior to a ES refresh happening - Create a single mock task function that mocks task manager tasks for unit test purposes. This was necessary as other places were doing `as unknown as BulkUpdateTaskResult` and escaping type safety Flaky test runs: - [Framework] https://buildkite.com/elastic/kibana-flaky-test-suite-runner/builds/1776 - [Kibana Security] https://buildkite.com/elastic/kibana-flaky-test-suite-runner/builds/1786 Co-authored-by: kibanamachine <[email protected]>

kibanamachine · 2023-05-12T19:53:43Z

New failure: CI Build - 8.8

kibanamachine · 2023-05-12T20:01:56Z

New failure: CI Build - 8.8

jeramysoucy · 2023-05-16T20:51:40Z

Ran another flaky test runner just to be sure, but this looks tied to a series of CI failures on Friday.

…ssion limit for users (elastic#174748) ## Summary Closes elastic#149091 This PR addresses the potential issue of a session not being found in the session index by introducing a timeout before attempting to write the next one. Passing these [changes through FTR](https://buildkite.com/elastic/kibana-flaky-test-suite-runner/builds/4854) make it pass 100% of the time with 400 test runs.

## Summary This PR is for troubleshooting elastic#149091 It duplicates the timeout check per session from the `...legacy sessions` test (see elastic#174748) for the `...multiple providers` test. Note: we are not seeing the additional log of 'Failed to write a new session', in any of the recent failures. Could not reproduce the issue with a flaky test runner: https://buildkite.com/elastic/kibana-flaky-test-suite-runner/builds/4949

kibanamachine · 2024-03-08T22:27:08Z

New failure: CI Build - 8.13

kibanamachine · 2024-03-21T15:50:38Z

New failure: CI Build - 8.12

kibanamachine · 2024-04-09T16:36:22Z

New failure: CI Build - main

kibanamachine · 2024-04-15T16:32:05Z

New failure: CI Build - 8.13

kibanamachine · 2024-04-15T17:13:39Z

New failure: CI Build - main

kibanamachine · 2024-04-19T00:16:07Z

New failure: CI Build - main

kibanamachine · 2024-04-29T16:14:23Z

New failure: CI Build - main

kibanamachine · 2024-05-06T15:13:27Z

New failure: kibana-on-merge - 8.13

kibanamachine · 2024-05-09T14:49:50Z

New failure: kibana-on-merge - 8.13

kibanamachine · 2024-05-09T18:14:02Z

New failure: kibana-elasticsearch-snapshot-verify - 8.13

kibanamachine · 2024-05-22T20:00:12Z

New failure: kibana-on-merge - main

legrego · 2024-05-28T12:54:18Z

latest failure:

└-> should properly clean up sessions that exceeded concurrent session limit even for multiple providers
--
  | └-> "before each" hook: global before each for "should properly clean up sessions that exceeded concurrent session limit even for multiple providers"
  | └-> "before each" hook for "should properly clean up sessions that exceeded concurrent session limit even for multiple providers"
  | └- ✖ fail: security APIs - Session Concurrent Limit Session Concurrent Limit cleanup should properly clean up sessions that exceeded concurrent session limit even for multiple providers
  | │      Error: retry.tryForTime reached timeout 20000 ms
  | │ Error: expected 5 to equal 6
  | │     at Assertion.assert (expect.js:100:11)
  | │     at Assertion.apply (expect.js:227:8)
  | │     at Assertion.be (expect.js:69:22)
  | │     at cleanup.ts:235:56
  | │     at processTicksAndRejections (node:internal/process/task_queues:95:5)
  | │     at runAttempt (retry_for_success.ts:29:15)
  | │     at retryForSuccess (retry_for_success.ts:98:21)
  | │     at RetryService.tryForTime (retry.ts:37:12)
  | │     at Context.<anonymous> (cleanup.ts:234:7)
  | │     at Object.apply (wrap_function.js:73:16)
  | │       at onFailure (retry_for_success.ts:17:9)
  | │       at retryForSuccess (retry_for_success.ts:84:7)
  | │       at RetryService.tryForTime (retry.ts:37:12)
  | │       at Context.<anonymous> (cleanup.ts:234:7)
  | │       at Object.apply (wrap_function.js:73:16)

kibanamachine · 2024-06-10T16:51:26Z

New failure: kibana-on-merge - main

kibanamachine · 2024-07-09T11:31:25Z

New failure: kibana-on-merge - main

elena-shostak · 2024-08-12T09:17:02Z

Came up on interesting thing, the cleanupInterval is set to 5h in FTR config:

--xpack.security.session.cleanupInterval=5h

The log shows us the we invoke the cleanup task in the very first test and get 500.

[00:00:03]         │ debg Existing sessions: {"total":{"value":3,"relation":"eq"},"max_score":1,"hits":[{"_index":".kibana_security_session_1","_id":"gyCzdu4JDUUi03Cd2G+daZuJQKTEjvCidLFgsm6sgA4=","_score":1,"_source":{"provider":{"type":"basic","name":"basic1"},"idleTimeoutExpiration":1720528171722,"lifespanExpiration":1723116571722,"createdAt":1720524571722,"usernameHash":"e9ab99ee1daa1aa2b5cac38d446ac31f555b0a0bd0cd7a7335a3a2a065635e64","content":"M1k1w9v840N6u1EZkfwVVjzwLk9udSESi9jwRJ1znBPz0EO4RAliKE53gVyL99Zr1HpUAIIstiRtTbPIF0cVjoHmp/Z+kHeGxNAB2FbFecBjo7zPpxL6VnOr/7E3qNevj/8/sBObC4rW7l3kkDFV7AgSqWY9S64JazbqmpnH8mG18wE36L/hH4P0gTBmYCOTK35u83ajRoGFrr1Mbtkm8GEowxCKuRgkOAie2CO6btZ0KUGVHv6CiF4P9WKZnGIf2fyOS7YG7rFx98oWAs4OZhRm+/1CDU+4Q5x4L8e4Z3/fvSiah+15+1rHUzG3PNE9RQBtaGfFpxr7mo8KNCDTzvg="}},{"_index":".kibana_security_session_1","_id":"ED/Teu0G+MXg5zFFw63f4IGhJYNDQiLTku9RFKU73ZQ=","_score":1,"_source":{"provider":{"type":"basic","name":"basic1"},"idleTimeoutExpiration":1720528172475,"lifespanExpiration":1723116572475,"createdAt":1720524572475,"usernameHash":"e9ab99ee1daa1aa2b5cac38d446ac31f555b0a0bd0cd7a7335a3a2a065635e64","content":"SVDq4yk0ZZV76/00timLNRKDOnVQrZLYlsPxBX+gbHR8F/36smopXI+tS4MXgqQL4fAmVmrScmA69r/O4y5B3bHNPY1yqTYx9Zomel6bfZ3dsjQScHCrwKOLVUob4+Hn92FvT289OUMncks1OwQg5S5bET06Nd4jYhd7C5kKUzsH+vHDJV7mgMmBqRuf8m1rNUUge3XO4ra19Uil8Ou6qDGJXJRPbYOZWHoq2ey+Nr6h5j8zUPfr/wo5PiTSWdNw1PJG/MXOQiBHRi4hbVTw2ZIFIlxi1ARh+/D2aV5KbziAmxeCHu9o/1g6YJrUAxGNI9Qtd9Po5SY53sGmZ4mBF6U="}},{"_index":".kibana_security_session_1","_id":"/zR3wzCyms2bRib56OUNOn6WSHMnKbM2VjNtyBZrpJw=","_score":1,"_source":{"provider":{"type":"basic","name":"basic1"},"idleTimeoutExpiration":1720528173016,"lifespanExpiration":1723116573016,"createdAt":1720524573016,"usernameHash":"e9ab99ee1daa1aa2b5cac38d446ac31f555b0a0bd0cd7a7335a3a2a065635e64","content":"txDKnicnMjQu/ZP42i+W9BT5sM81jTL8VeHXYLTYfyWiQV7fPB98IJKQeWHPfeEjweGwjvHOfU/zooZANLYv81IeY5X4JlOL3O51aPTdf6a5iaSwJTi2DN3ASiOhi7OcLqabMs8csbAbzP6w9aN6O1PmMekPmdiLt/aQlTNlLWJPMuJ4drdTU6FJ+MFc4aOa4X1MfFX/brBv+AQ/fcr5Fq4oB/7EJPSoj3b8hjUeYaZ1DTu6sB381HfpXkg3OfihRdioUnTGjwaSD20HwT9c+roI9GZhmqewijXDoW+xwLyfv5JYtkxzP+zlktdPbGBZI2NUmxChUUsrMNDAbyEQ7aM="}}]}.
[00:00:03]         │ proc [kibana] [2024-07-09T11:29:33.098+00:00][ERROR][http] 500 Server Error {"http":{"response":{"status_code":500},"request":{"method":"post","path":"/session/_run_cleanup"}},"error":{"message":"Failed to run task \"session_cleanup\" as it is currently running"},"service":{"node":{"roles":["background_tasks","ui"]}}}
[00:00:03]         │ debg --- retry.tryForTime error: expected 200 "OK", got 500 "Internal Server Error"

That means the cleanup job itself was already running, which shouldn't be the case, because we set interval to 5h before and it is the very first test in out test suite and the first time we invoke session/_run_cleanup.

So if the cleanup job is running on some different interval it might corrupt sessions from the following tests even before we invoke session/_run_cleanup which leads to flaky behaviour sometimes. (2 test suites running on the same node and overriding interval?)

cc @azasypkin Probably you can shed some light on it, I don't have that much context around FTR setup in general.

kibanamachine · 2024-09-04T14:27:35Z

New failure: kibana-elasticsearch-snapshot-verify - 8.15

azasypkin · 2024-09-05T09:09:36Z

New failure: kibana-elasticsearch-snapshot-verify - 8.15

Same reason as described in #149091 (comment)

kibanamachine added the failed-test A test failure on a tracked branch, potentially flaky-test label Jan 18, 2023

botelastic bot added the needs-team Issues missing a team label label Jan 18, 2023

kibanamachine added the Team:Security Team focused on: Auth, Users, Roles, Spaces, Audit Logging, and more! label Jan 18, 2023

botelastic bot removed the needs-team Issues missing a team label label Jan 18, 2023

azasypkin self-assigned this Jan 18, 2023

mistic added a commit that referenced this issue Jan 18, 2023

skip flaky suite (#149090, #149091, #149092)

74d9321

mistic added blocker skipped-test v8.7.0 labels Jan 18, 2023

azasypkin marked this as a duplicate of #149090 Jan 18, 2023

azasypkin closed this as not planned Won't fix, can't repro, duplicate, stale Jan 18, 2023

kibanamachine removed v8.7.0 skipped-test blocker labels Jan 18, 2023

wayneseymour pushed a commit to wayneseymour/kibana that referenced this issue Jan 19, 2023

skip flaky suite (elastic#149090, elastic#149091, elastic#149092)

d0f64e8

mikecote mentioned this issue Jan 25, 2023

Fix issues around enabling and disabling tasks #148985

Merged

mikecote closed this as completed in #148985 Jan 26, 2023

kibanamachine reopened this May 12, 2023

jeramysoucy closed this as completed May 16, 2023

kibanamachine reopened this Nov 23, 2023

jeramysoucy closed this as completed Jan 26, 2024

kibanamachine reopened this Mar 8, 2024

azasypkin assigned jeramysoucy and unassigned SiddharthMantri Apr 23, 2024

elena-shostak mentioned this issue May 16, 2024

[FTR] Session Concurrency Test #183409

Merged

2 tasks

elena-shostak closed this as completed in #183409 May 17, 2024

kibanamachine reopened this May 22, 2024

azasypkin self-assigned this May 28, 2024

legrego closed this as completed Oct 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kibanamachine commented Jan 18, 2023 •

edited

Loading

elasticmachine commented Jan 18, 2023

kibanamachine commented Jan 18, 2023

mistic commented Jan 18, 2023

azasypkin commented Jan 18, 2023

kibanamachine commented May 12, 2023

kibanamachine commented May 12, 2023

jeramysoucy commented May 16, 2023

kibanamachine commented Mar 8, 2024

kibanamachine commented Mar 21, 2024

kibanamachine commented Apr 9, 2024

kibanamachine commented Apr 15, 2024

kibanamachine commented Apr 15, 2024

kibanamachine commented Apr 19, 2024

kibanamachine commented Apr 29, 2024

kibanamachine commented May 6, 2024

kibanamachine commented May 9, 2024

kibanamachine commented May 9, 2024

kibanamachine commented May 22, 2024

legrego commented May 28, 2024

kibanamachine commented Jun 10, 2024

kibanamachine commented Jul 9, 2024

elena-shostak commented Aug 12, 2024 •

edited

Loading

kibanamachine commented Sep 4, 2024

azasypkin commented Sep 5, 2024

Comments

kibanamachine commented Jan 18, 2023 • edited Loading

elasticmachine commented Jan 18, 2023

kibanamachine commented Jan 18, 2023

mistic commented Jan 18, 2023

azasypkin commented Jan 18, 2023

kibanamachine commented May 12, 2023

kibanamachine commented May 12, 2023

jeramysoucy commented May 16, 2023

kibanamachine commented Mar 8, 2024

kibanamachine commented Mar 21, 2024

kibanamachine commented Apr 9, 2024

kibanamachine commented Apr 15, 2024

kibanamachine commented Apr 15, 2024

kibanamachine commented Apr 19, 2024

kibanamachine commented Apr 29, 2024

kibanamachine commented May 6, 2024

kibanamachine commented May 9, 2024

kibanamachine commented May 9, 2024

kibanamachine commented May 22, 2024

legrego commented May 28, 2024

kibanamachine commented Jun 10, 2024

kibanamachine commented Jul 9, 2024

elena-shostak commented Aug 12, 2024 • edited Loading

kibanamachine commented Sep 4, 2024

azasypkin commented Sep 5, 2024

kibanamachine commented Jan 18, 2023 •

edited

Loading

elena-shostak commented Aug 12, 2024 •

edited

Loading