Optimize identify anomaly periods algorithm #11

kavehshahedi · 2025-02-27T21:40:31Z

What it does

The original implementation created and processed sliding windows one by one, resulting in hour-long processing times for million-row datasets. The new implementation maintains identical detection logic but uses a vectorized cumulative sum approach to calculate all window means at once, dramatically reducing computation time.

This PR also aims to fix the excessive calculation time reported in a previous pull request.

[UPDATE] Support data fetching density from trace server #10

How to test

Initialize the AnomalyDetecion module with your custom outputs (e.g., CPU Usage, Memory Usage, etc.). In the case of having a huge dataset (e.g., millions of data points), you can now observe the significant performance improvement when indicating the anomalies.

Follow-ups

N/A

Review checklist

As an author, I have thoroughly tested my changes and carefully followed the instructions in this template

Previously, we were creating each sliding window one-by-one which took forever on large datasets. Now we use a cumulative sum approach that gives identical results but runs way faster on extremly huge datasets. Signed-off-by: Kaveh Shahedi <kaveh.shahedi@ericsson.com>

bhufmann

Thanks for this contribution. It improves performance significantly.

kavehshahedi requested a review from bhufmann February 27, 2025 21:40

kavehshahedi mentioned this pull request Feb 28, 2025

[UPDATE] Support data fetching density from trace server #10

Merged

1 task

bhufmann approved these changes Feb 28, 2025

View reviewed changes

kavehshahedi merged commit 67fe0e1 into eclipse-tmll:main Feb 28, 2025
4 checks passed

kavehshahedi deleted the window-optimization branch March 6, 2025 17:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize identify anomaly periods algorithm #11

Optimize identify anomaly periods algorithm #11

Uh oh!

kavehshahedi commented Feb 27, 2025 •

edited

Loading

Uh oh!

bhufmann left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Optimize identify anomaly periods algorithm #11

Optimize identify anomaly periods algorithm #11

Uh oh!

Conversation

kavehshahedi commented Feb 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What it does

How to test

Follow-ups

Review checklist

Uh oh!

bhufmann left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kavehshahedi commented Feb 27, 2025 •

edited

Loading