cached cloudwatch log group tags #540

jvanbrie · 2022-03-10T17:29:40Z

What does this PR do?

This pr separates out our caching logic and creates a new cache specifically for cloudwatch log tags.

Motivation

The caching logic was built around the lambda custom tags use case, and that isn't great for future caching use cases. In addition we've gotten request for support for cloudwatch log group tags.

Testing Guidelines

Lambdas are running this new code just fine. We can see the s3 caches updating as well.

Additional Notes

Types of changes

Bug fix
New feature
Breaking change
Misc (docs, refactoring, dependency upgrade, etc.)

Check all that apply

This PR's description is comprehensive
This PR contains breaking changes that are documented in the description
This PR introduces new APIs or parameters that are documented and unlikely to change in the foreseeable future
This PR impacts documentation, and it has been updated (or a ticket has been logged)
This PR's changes are covered by the automated tests
This PR collects user input/sensitive content into Datadog
This PR passes the integration tests (ask a Datadog member to run the tests)
This PR passes the unit tests
This PR passes the installation tests (ask a Datadog member to run the tests)

…p tags

dylanburati

This looks good to me.

aws/logs_monitoring/cache.py

sabiurr · 2022-03-14T16:59:09Z

aws/logs_monitoring/cache.py

+            DD_S3_BUCKET_NAME, self.CACHE_LOCK_FILENAME
+        )
+        try:
+            file_content = cache_lock_object.get()


I'm not sure I fully understand how this s3 file is being used as a lock. My thought would be that there would be something to block on acquiring the lock, but I'm not seeing that

To acquire the lock we create a lock s3 file, and then to remove the lock we delete it. If the file exists we know that the lock is active and there is a lambda running that's in the process of updating the tags. If the lock is active and we didn't grab it, then we don't need to update the tags in the current running lambda process as they'll be updated in the process that grabbed the lock.

sabiurr · 2022-03-14T22:08:32Z

aws/logs_monitoring/cache.py

+        """
+        new_tags = {}
+        for log_group in self.tags_by_id.keys():
+            new_tags[log_group] = get_log_group_tags(log_group)


Shouldn't we return False (first var) if there was an exception here similar to in LambdaCustomTagsCache?

I think if there was an exception from get_log_group_tags, we'd overwrite as empty tags which we don't want to do

Here we're making multiple api calls to refill the cache, so I don't want to return false and not write to the cache if one of those api calls fails. I suppose if all the api calls fail we could return False here and skip writing to the s3 cache.
If the api call fails we can take the value we have stored from the local cache to avoid overwriting on a failed api call.

sabiurr · 2022-03-14T22:10:32Z

aws/logs_monitoring/cache.py

+        log_group_tags = self.tags_by_id.get(log_group, None)
+        if log_group_tags is None:
+            log_group_tags = get_log_group_tags(log_group)
+            self.tags_by_id[log_group] = log_group_tags


Is this needed? I think we're updating self.tags_by_id in the _refresh function

Right, we'll get here if we're encountering a log group that isn't in self.tags_by_id, so it'll be a log group we haven't gotten logs from before. In this case we grab tags for it and add it temporarily to self.tags_by_id. It'll get added to the s3 cache when we refresh. This lets us get tags for it right away rather than waiting for a refresh or refreshing all tags whenever we get 1 new log group.

magnetikonline · 2022-03-16T05:27:09Z

Just to echo my comments in #531 (comment) - it seems that the existing DD_FETCH_LAMBDA_TAGS argument will be the only way to opt out of this feature?

jvanbrie · 2022-03-16T14:54:12Z

Yep, all tag collection will be controlled by that argument.
Edit: Just made a change to separate out control over both tag fetches.

hghotra

LGTM 👍🏼

magnetikonline · 2022-03-17T00:00:11Z

Edit: Just made a change to separate out control over both tag fetches.

Wonderful - thanks @jvanbrie - reviewed that commit, makes this whole PR opt-in 👍

aws/logs_monitoring/cache.py

Co-authored-by: Peter Mescalchin <peter@magnetikonline.com>

…ataDog/datadog-serverless-functions into jon.vanbriesen/cached_log_tags

magnetikonline · 2022-03-18T00:56:37Z

@jvanbrie did you cut the release right? I can't see any of your change between the latest release and the past?

aws-dd-forwarder-3.42.0...aws-dd-forwarder-3.43.0

Maybe I'm just having a slow day here? 😄.

jvanbrie · 2022-03-18T01:14:59Z

Something is definitely fishy here. Looks like the changes didn't make it into that release. I'll cut a new release tomorrow and see if that has the changes.

jvanbrie · 2022-03-18T02:27:31Z

Thanks for bringing this up, just released a new version and it grabbed the changes this time. aws-dd-forwarder-3.43.0...aws-dd-forwarder-3.44.0

magnetikonline · 2022-03-18T02:31:18Z

Thanks for bringing this up, just released a new version and it grabbed the changes this time.

No dramas @jvanbrie - agreed - can deff see the changes in there now!

jvanbrie added 9 commits March 8, 2022 10:46

separating tag cache - wip

f824124

wip

be74cb8

separating out caching logic and adding cache for cloudwatch log grou…

6becc05

…p tags

update comment

2d37dd7

formatting and tests

99803ea

fixing tests

7d140e8

fixing formatting

7f8544d

adding testing for log group tag cache logic

02aa81d

fixing style

81bbbd5

jvanbrie changed the title ~~cached log tags~~ cached cloudwatch log group tags Mar 14, 2022

jvanbrie marked this pull request as ready for review March 14, 2022 14:39

dylanburati approved these changes Mar 14, 2022

View reviewed changes

sabiurr reviewed Mar 14, 2022

View reviewed changes

aws/logs_monitoring/cache.py Show resolved Hide resolved

sabiurr reviewed Mar 14, 2022

View reviewed changes

adding logs permission to template

35b5c73

jvanbrie mentioned this pull request Mar 14, 2022

adding log group tags #531

Merged

13 tasks

sabiurr reviewed Mar 14, 2022

View reviewed changes

jvanbrie added 2 commits March 15, 2022 10:05

chaning behavior around failed api calls

a3ca6a5

controlling the other refresh through env variable

01d92ec

sabiurr approved these changes Mar 15, 2022

View reviewed changes

jvanbrie added 2 commits March 15, 2022 14:06

fixing tests

ebcf81d

fixing style

e9c4fe3

splitting up logic for controlling lambda vs log group tags

df8b3cc

hghotra approved these changes Mar 16, 2022

View reviewed changes

magnetikonline reviewed Mar 17, 2022

View reviewed changes

aws/logs_monitoring/cache.py Outdated Show resolved Hide resolved

typo fix

81bb599

Co-authored-by: Peter Mescalchin <peter@magnetikonline.com>

jvanbrie and others added 3 commits March 17, 2022 13:42

typo fix

b4f51d6

Co-authored-by: Peter Mescalchin <peter@magnetikonline.com>

style fix

6c3c96e

Merge branch 'jon.vanbriesen/cached_log_tags' of https://github.com/D…

f742fcf

…ataDog/datadog-serverless-functions into jon.vanbriesen/cached_log_tags

jvanbrie merged commit 48581da into master Mar 17, 2022

jvanbrie deleted the jon.vanbriesen/cached_log_tags branch March 17, 2022 18:13

IrmantasMarozas mentioned this pull request Apr 12, 2022

Can't find explanation why do I need to provide S3 bucket name #554

Closed

tianchu mentioned this pull request May 6, 2022

Feature Request: Apply Cloudwatch Log Group Tags as Datadog Tags #269

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cached cloudwatch log group tags #540

cached cloudwatch log group tags #540

jvanbrie commented Mar 10, 2022 •

edited

Loading

dylanburati left a comment •

edited

Loading

sabiurr Mar 14, 2022

jvanbrie Mar 14, 2022

sabiurr Mar 14, 2022

jvanbrie Mar 15, 2022

sabiurr Mar 14, 2022

jvanbrie Mar 15, 2022 •

edited

Loading

magnetikonline commented Mar 16, 2022

jvanbrie commented Mar 16, 2022 •

edited

Loading

hghotra left a comment

magnetikonline commented Mar 17, 2022

magnetikonline commented Mar 18, 2022

jvanbrie commented Mar 18, 2022

jvanbrie commented Mar 18, 2022

magnetikonline commented Mar 18, 2022

cached cloudwatch log group tags #540

cached cloudwatch log group tags #540

Conversation

jvanbrie commented Mar 10, 2022 • edited Loading

What does this PR do?

Motivation

Testing Guidelines

Additional Notes

Types of changes

Check all that apply

dylanburati left a comment • edited Loading

Choose a reason for hiding this comment

sabiurr Mar 14, 2022

Choose a reason for hiding this comment

jvanbrie Mar 14, 2022

Choose a reason for hiding this comment

sabiurr Mar 14, 2022

Choose a reason for hiding this comment

jvanbrie Mar 15, 2022

Choose a reason for hiding this comment

sabiurr Mar 14, 2022

Choose a reason for hiding this comment

jvanbrie Mar 15, 2022 • edited Loading

Choose a reason for hiding this comment

magnetikonline commented Mar 16, 2022

jvanbrie commented Mar 16, 2022 • edited Loading

hghotra left a comment

Choose a reason for hiding this comment

magnetikonline commented Mar 17, 2022

magnetikonline commented Mar 18, 2022

jvanbrie commented Mar 18, 2022

jvanbrie commented Mar 18, 2022

magnetikonline commented Mar 18, 2022

jvanbrie commented Mar 10, 2022 •

edited

Loading

dylanburati left a comment •

edited

Loading

jvanbrie Mar 15, 2022 •

edited

Loading

jvanbrie commented Mar 16, 2022 •

edited

Loading