Implement filtering mechanism in the Logging service #57547

mshustov · 2020-02-13T10:28:04Z

Blocker for #13241
Once we move request/response logging to the new platform, we need to provide a way to censor sensitive data in the logs (e.g. authorization or cookie headers).
Evaluate how much work it would be to support a filtering mechanism compatible with elasticsearch logging settings https://logging.apache.org/log4j/2.x/manual/filters.html

The text was updated successfully, but these errors were encountered:

elasticmachine · 2020-02-13T10:28:07Z

Pinging @elastic/kibana-platform (Team:Platform)

pgayvallet · 2020-02-14T07:56:15Z

In l4j, that kind of filter is usually based on informations present in the MDC. We would not be able to rely on such mechanism. Do we have anything better than applying regexp for that kind of filter then?
Do we want to filter (remove) the whole log message when it contains sensitive data, or do we just want to remove/obstruct the sensitive data in the log message but still log it.

joshdover · 2020-02-18T14:42:01Z

Do we want to filter (remove) the whole log message when it contains sensitive data, or do we just want to remove/obstruct the sensitive data in the log message but still log it.

I strongly prefer we just filter the sensitive data out but still log the message.

mshustov · 2020-02-20T11:59:00Z

In l4j, that kind of filter is usually based on informations present in the MDC. We would not be able to relies on such mechanism. Do we have anything better than applying regexp for that kind of filter then?

LP relies on Hapi log output for request/response and filters data in JSON before formatting them. We will control the output format for request/response logging when #13241 lands. We can log additional data as MetaData and apply the filter to metadata in JSON format.

Do we want to filter (remove) the whole log message when it contains sensitive data, or do we just want to remove/obstruct the sensitive data in the log message but still log it.

I strongly prefer we just filter the sensitive data out but still log the message.

The current implementation allows only 2 operations:

remove a property
censor a string property (the full string or via regexp)

Example https://github.com/elastic/kibana/blob/8e9a8a84dccfa7965ce8a22362885e6cdef8b51f/src/legacy/server/logging/apply_filters_to_keys.js

lukeelmers · 2021-01-11T04:55:09Z

A few initial thoughts:

The current implementation allows only 2 operations:

remove a property

censor a string property (the full string or via regexp)

As @restrry outlines above, legacy platform isn't providing too much for us here: with the current filters config, we can basically just remove a property, or censor via regex.

The concept of filters in log4j actually sounds a bit different from the way we've used "filters" in LP logging; they basically serve to determine whether a log entry should be included in its entirety, or thrown out:

each filter returns an ACCEPT, DENY or NEUTRAL value for a provided log record
it allows you to configure filters at multiple levels (context, loggers, appenders, appender references)
there are lots of different types of filters besides regex, allowing you to do everything from rate-limiting to executing custom scripts

However, AFAICT what you can't do with log4j filters alone is modify existing log messages. For that they provide a RewriteAppender, which is sort of a proxy appender that modifies a log entry based on configuration before passing it to a "destination" appender.

RewriteAppenders and Filters do have some overlap (you can optionally provide a Filter in a RewriteAppender configuration), but if our primary goal is feature parity with LP, I don't think we necessarily need to introduce the concept of Filters at all -- that feels like an entirely different feature. Rather, we'd need to create our own RewriteAppender.

This would let us continue to do things like "censoring" a log message by redacting particular headers, performing string replacements in a log message, adding new data to a message, or upgrading/downgrading a log's level.

I think the main question would be how to make the config for this as simple as possible. The logic for the actions that can be performed by a RewriteAppender reside in a RewritePolicy. In our case, it seems some type of "MetaRewritePolicy" may be all we need at first.

The config could potentially look something like this:

logging:
  appenders:
    file:
      kind: file
      path: ./kibana.log
      layout:
        kind: json
    censor:
      kind: rewrite
      appenders: [file] # the destination appender where this is sent after modification
      policy:
        kind: meta # name of the policy
        mode: update # or "add" or "remove" etc
        # Need to think about naming here. log4j calls this KeyValuePair,
        # but in some cases ("remove") you may not need a value.
        property:
          key: "headers.authorization" # path within the log meta object
          value: "[redacted]"

  root:
    appenders: [censor, default]
    level: debug

The legacy approach seems simpler from a configuration standpoint, but is also less powerful and of course not as aligned with log4j 2. Would be interested to get some feedback on what feels like a logical first step to take.

cc @restrry @joshdover @pgayvallet

joshdover · 2021-01-11T16:18:57Z

In terms of keeping this in line with log4j, I think a rewrite appender makes sense 👍 To simplify things a bit, we could omit the kind: meta and mode: update keys from the rewrite policy and only introduce them if we need these features in the future. For the properties, maybe this shape makes more sense:

properties:
  - key: "headers.authorization"
    value: "[redacted]"

That way multiple properties could be on the same policy, and we could still easily support just key for the remove use case.

That said, I'm wondering if we need to support a configurable interface at all. We don't currently document the logging.filter configuration, so I don't believe we have to explicitly support it (?). We could possibly get away with hard-coding this logic in the BaseLogger class for the known keys (eg header.authorization) and only adding a true rewrite appender once we really need it. It would be great if we had some telemetry on the usage of this, but it doesn't appear that we do 🙁

lukeelmers · 2021-01-12T14:43:24Z

We don't currently document the logging.filter configuration, so I don't believe we have to explicitly support it (?). We could possibly get away with hard-coding this logic in the BaseLogger class for the known keys (eg header.authorization) and only adding a true rewrite appender once we really need it.

Yeah this would certainly save some effort, although my takeaway from the discussion in #13241 was that we still want to provide the ability to do this.

FWIW I don't think it would be enormous task to create the simplest iteration of a rewrite appender as you describe above (with a policy that will only change properties in the meta). However if we don't need that functionality at all, we can easily just exclude the authorization/cookie headers in the course of #13241.

mshustov · 2021-01-12T16:52:48Z

I don't think it would be enormous task to create the simplest iteration of a rewrite appender as you describe above (with a policy that will only change properties in the meta)

Probably we can implement this but keep it private as an escape hatch for debug purposes #13241 (comment)?

That way multiple properties could be on the same policy, and we could still easily support just key for the remove use case.

How would you add censored fields back? Add removal operation only if specified in the config?

lukeelmers · 2021-01-12T17:45:38Z

How would you add censored fields back? Add removal operation only if specified in the config?

One option could be to remove properties if null is specified, otherwise do a replacement, which means you wouldn't necessarily need a "mode" initially (unless other policies were added later):

logging:
  appenders:
    censor:
      kind: rewrite
      appenders: [file] # the destination appender where this is sent after modification
      policy:
        kind: meta # name of the policy
        properties:
          - key: "headers.authorization" # path within the log meta object
            value: "[redacted]"
          - key: "headers.cookie"
            value: null # removes this property entirely

lukeelmers · 2021-01-12T18:10:46Z

We don't currently document the logging.filter configuration, so I don't believe we have to explicitly support it (?)

To follow up on this: the cloud team has confirmed that the logging.filter config isn't exposed to users and also isn't used anywhere internally.

So if we decided not to address this at all and simply exclude Authorization and Cookie headers, the only people who would be affected are folks who have discovered this undocumented setting and have been relying on it. (And we could add a config deprecation to warn them it is going away)

Alternatively we could implement RewriteAppender for debugging purposes as @restrry suggests, and just leave it undocumented for the time being.

WDYT @restrry @joshdover? Have we seen any examples where having access to these headers would be useful from a debugging/support perspective?

mshustov · 2021-01-13T08:10:29Z

@lukeelmers Security team might need it to debug problems with login. I'd suggest you timebox RewriteAppender implementation and move forward without it if you face any problems.

lukeelmers · 2021-02-23T22:57:45Z

This will be closed by #91492 with one important caveat: All http response logs will redact authorization, cookie, and set-cookie headers by default, and this can not be overridden with the new RewriteAppender.

If we decide to pick up #92082 as a future enhancement, we'll be able to lift this restriction at that time and folks will be able to explicitly disable the censoring of those headers should they choose to do so.

With the exception of the 3 headers listed above, the new appender will allow users to remove & modify any path in the LogMeta from any logger.

mshustov added blocker Team:Core Core services & architecture: plugins, logging, config, saved objects, http, ES client, i18n, etc Feature:New Platform labels Feb 13, 2020

This was referenced Feb 13, 2020

[8.0] Remove @kbn/legacy-logging #50660

Closed

Logging service uses new config format #41956

Closed

mshustov added the Feature:Logging label Feb 21, 2020

mshustov mentioned this issue Feb 21, 2020

New platform Logging service improvements #58261

Closed

11 tasks

mshustov removed the Team:Core Core services & architecture: plugins, logging, config, saved objects, http, ES client, i18n, etc label Feb 26, 2020

joshdover mentioned this issue Mar 17, 2020

[Meta] Logging Projects #60391

Closed

30 tasks

mshustov mentioned this issue Jun 25, 2020

[Audit Logging] Add AuditTrail service #69278

Merged

joshdover mentioned this issue Nov 25, 2020

[meta] Core Team 8.0 Projects #84380

Closed

33 tasks

lukeelmers self-assigned this Dec 16, 2020

lukeelmers mentioned this issue Dec 16, 2020

Log HTTP requests, responses #13241

Closed

3 tasks

lukeelmers mentioned this issue Jan 20, 2021

[core.logging] Add response logs to the KP logging system. #87939

Merged

lukeelmers mentioned this issue Feb 16, 2021

[core.logging] Add RewriteAppender for filtering LogMeta. #91492

Merged

lukeelmers closed this as completed in #91492 Feb 24, 2021

rudolf mentioned this issue Nov 20, 2023

Add ability to configure RewritePolicy that can rewrite any fields #171523

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement filtering mechanism in the Logging service #57547

Implement filtering mechanism in the Logging service #57547

mshustov commented Feb 13, 2020

elasticmachine commented Feb 13, 2020

pgayvallet commented Feb 14, 2020 •

edited

Loading

joshdover commented Feb 18, 2020

mshustov commented Feb 20, 2020 •

edited

Loading

lukeelmers commented Jan 11, 2021 •

edited

Loading

joshdover commented Jan 11, 2021

lukeelmers commented Jan 12, 2021

mshustov commented Jan 12, 2021 •

edited

Loading

lukeelmers commented Jan 12, 2021

lukeelmers commented Jan 12, 2021

mshustov commented Jan 13, 2021

lukeelmers commented Feb 23, 2021

Implement filtering mechanism in the Logging service #57547

Implement filtering mechanism in the Logging service #57547

Comments

mshustov commented Feb 13, 2020

elasticmachine commented Feb 13, 2020

pgayvallet commented Feb 14, 2020 • edited Loading

joshdover commented Feb 18, 2020

mshustov commented Feb 20, 2020 • edited Loading

lukeelmers commented Jan 11, 2021 • edited Loading

joshdover commented Jan 11, 2021

lukeelmers commented Jan 12, 2021

mshustov commented Jan 12, 2021 • edited Loading

lukeelmers commented Jan 12, 2021

lukeelmers commented Jan 12, 2021

mshustov commented Jan 13, 2021

lukeelmers commented Feb 23, 2021

pgayvallet commented Feb 14, 2020 •

edited

Loading

mshustov commented Feb 20, 2020 •

edited

Loading

lukeelmers commented Jan 11, 2021 •

edited

Loading

mshustov commented Jan 12, 2021 •

edited

Loading