Logger middleware: support gzipped responses from graphite-web #1693

fkaleo · 2020-02-27T14:16:48Z

graphite-web can (and in my testing does) gzip compress its responses. When it errors and we want to log that error, metrictank needs to unzip the body.

Caveat: detecting gzip compression is done based on the Content-Encoding header which sometimes is set too early by the gziper middleware before compression has happened. We handle that case by logging the uncompressed error when decompression fails.

woodsaj · 2020-02-27T15:01:09Z

which sometimes is set too early by the gziper middleware before compression has happened

This sounds like the order the gziper and log middlerware handlers are being loaded is wrong.

the gzipper middleware sets the content-type header only when w.WriteHeader(int) is called. Then when w.Write([]byte) the data is compressed and passed through to the underlying responseWriter.

So it sounds like the following is happening.

Gzip middlerware wraps ResponseWriter as gzipResponseWriter
Logging middleware wraps gzipResponseWriter as loggedResponseWriter

When the response results in an error:

loggedResponseWriter.WriteHeader(500) is called
gzipResponseWriter.WriteHeader(500) is called
content-type is set to gzip
ResponseWriter.WriteHeader(500) is called
loggedResponseWriter.Write(body) is called
body is logged, but body has not yet been compressed
gzipResponseWriter.Write(body) is called
body is compressed and ResponseWriter.Write(compressedBody) is called.

to fix this, i think that r.Use(middleware.Logger())

metrictank/api/routes.go

Line 21 in 43d0f40

r.Use(middleware.Logger())

needs to be called before any of the the other middleware handlers.

This will ensure that logs are last thin to happen before response data is pushed to the client.

Dieterbe · 2020-02-27T17:49:14Z

might make more sense to make awoods the reviewer of this. I'm gonna be OOO for about a week and he knows this code much better.

woodsaj · 2020-02-27T20:32:21Z

see my comments in PR #1694 #1694 (review)

woodsaj · 2020-02-27T20:39:18Z

api/middleware/logger.go

 func (rw *LoggingResponseWriter) Write(b []byte) (int, error) {
 if rw.ResponseWriter.Status() >= 400 {
- rw.errBody = make([]byte, len(b))
- copy(rw.errBody, b)
+ if rw.Header().Get("Content-Encoding") == "gzip" {


This shouldnt be done here. We should just store the compressed payload in rw.errBody and do the decompress when we write the log message.

The client wont get their response until we call the rw.responseWriter(b), so we should call it as soon as possible.

The log message is not written until after the request has been completely sent to the client, so spending the time to decompress the data their wont impact request latency (yes, failed responses are likely only going to have small payloads and so it probably wont matter where we do the decompress)

Fixed in ff4b0e5

woodsaj · 2020-02-27T20:42:02Z

api/middleware/logger.go

+ // This happens when the gziper middleware is enabled because it
+ // sets the 'Content-Encoding' header to 'gzip' before it actually
+ // compresses the body.
+ rw.errBody = make([]byte, len(b))


rw.Write(b) might be called multiple times. This will result in the errBody only containing the end of the data.
If response resulted in an Error and the payload is really large, we probably dont want all of it, but we will want the first part of it.

Fixed with ff4b0e5 by buffering up all the entire response. For now I think we can store it all. Having it all might be necessary in the gzip case.

…d after gziper

… needed

fkaleo · 2020-02-27T22:13:43Z

which sometimes is set too early by the gziper middleware before compression has happened

This sounds like the order the gziper and log middlerware handlers are being loaded is wrong.

the gzipper middleware sets the content-type header only when w.WriteHeader(int) is called. Then when w.Write([]byte) the data is compressed and passed through to the underlying responseWriter.

So it sounds like the following is happening.

Gzip middlerware wraps ResponseWriter as gzipResponseWriter
Logging middleware wraps gzipResponseWriter as loggedResponseWriter

When the response results in an error:
* loggedResponseWriter.WriteHeader(500) is called

* gzipResponseWriter.WriteHeader(500) is called

* content-type is set to gzip

* ResponseWriter.WriteHeader(500) is called

* loggedResponseWriter.Write(body) is called

* body is logged, but body has not yet been compressed

* gzipResponseWriter.Write(body) is called

* body is compressed and ResponseWriter.Write(compressedBody) is called.
to fix this, i think that r.Use(middleware.Logger())

metrictank/api/routes.go

Line 21 in 43d0f40

r.Use(middleware.Logger())

needs to be called before any of the the other middleware handlers.

This will ensure that logs are last thin to happen before response data is pushed to the client.

Reordered the middlewares to follow your advice in 8356c6f.
Logging is wrapping first (good for timing) and called after gziper which ensures it receives a gzipped response if the Content-Encoding header is set to gzip.

woodsaj · 2020-02-28T09:33:52Z

I think we should make a few more improvements here, though they are slightly out of scope.

both the logging and tracing middleware are capturing the responseBody if the request status >= 400. we dont need two copies of the body.
I really think we should set a cap on the number of bytes to capture. if the something goes wrong and a really large response is written we are going to cause problems for our logging and tracing platforms if we try and write 100's of KB or larger.

to address these i suggest

merge the tracing and logging middleware into one, or use a shared responseWriter wrapper.
for a shared responseWriter you can just check if current responseWriter is already a 'CaptureErrorResponseWriter' or not to see if you need to updated it. eg

  var wrappedRW *CaptureErrorResponseWriter
  if wrappedRW, ok := ctx.Resp.(*CaptureErrorResponseWriter); !ok {
      wrappedRW = &CaptureErrorResponseWriter{ResponseWriter: ctx.Resp
  }
  ....
  errBody := wrappedRW.ErrorBody()

only put up to 1k of data into the captured errBody, and then additionally if we need to decompress it, only return the first 1k of data after decompression. this will work fine with gzip. It will decompress what it can then return an error. So if you get decoded data back, just ignore the error.

fkaleo · 2020-02-28T10:21:58Z

I think we should make a few more improvements here, though they are slightly out of scope.

1. both the logging and tracing middleware are capturing the responseBody if the request status >= 400.  we dont need two copies of the body.

2. I really think we should set a cap on the number of bytes to capture.  if the something goes wrong and a really large response is written we are going to cause problems for our logging and tracing platforms if we try and write 100's of KB or larger.

to address these i suggest

1. merge the tracing and logging middleware into one, or use a shared responseWriter wrapper.
   for a shared responseWriter you can just check if current responseWriter is already a 'CaptureErrorResponseWriter' or not to see if you need to updated it.  eg

  var wrappedRW *CaptureErrorResponseWriter
  if wrappedRW, ok := ctx.Resp.(*CaptureErrorResponseWriter); !ok {
      wrappedRW = &CaptureErrorResponseWriter{ResponseWriter: ctx.Resp
  }
  ....
  errBody := wrappedRW.ErrorBody()

1. only put up to 1k of data into the captured errBody, and then additionally if we need to decompress it, only return the first 1k of data after decompression.  this will work fine with gzip. It will decompress what it can then return an error.  So if you get decoded data back, just ignore the error.

Fair points. Can I do that in a separate PR?

woodsaj · 2020-02-28T10:23:04Z

Fair points. Can I do that in a separate PR?

Yes

Logger middleware: support gzipped responses from graphite-web

43d0f40

fkaleo requested a review from Dieterbe February 27, 2020 14:19

woodsaj reviewed Feb 27, 2020

View reviewed changes

fkaleo removed the request for review from Dieterbe February 27, 2020 21:09

fkaleo mentioned this pull request Feb 27, 2020

Push renderer to last position so that logger middleware properly captures all errors #1694

Closed

fkaleo added 2 commits February 27, 2020 23:05

Reorderded middleware so that logging is first (for timing) and calle…

8356c6f

…d after gziper

Logging/TracingResponseWriter to buffer Writes and decompress gzip if…

ff4b0e5

… needed

woodsaj approved these changes Feb 28, 2020

View reviewed changes

fkaleo merged commit 5dbbbc3 into master Feb 28, 2020

fkaleo deleted the logger_gzip_support branch February 28, 2020 10:23

fkaleo mentioned this pull request Feb 28, 2020

Logger/tracer middleware: store responseBody once and cap its size #1697

Closed

fkaleo added this to the sprint-8 milestone Feb 28, 2020

fkaleo self-assigned this Feb 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Logger middleware: support gzipped responses from graphite-web #1693

Logger middleware: support gzipped responses from graphite-web #1693

fkaleo commented Feb 27, 2020

woodsaj commented Feb 27, 2020

Dieterbe commented Feb 27, 2020

woodsaj commented Feb 27, 2020

woodsaj Feb 27, 2020

fkaleo Feb 27, 2020

woodsaj Feb 27, 2020

fkaleo Feb 27, 2020

fkaleo commented Feb 27, 2020

woodsaj commented Feb 28, 2020

fkaleo commented Feb 28, 2020

woodsaj commented Feb 28, 2020

Logger middleware: support gzipped responses from graphite-web #1693

Logger middleware: support gzipped responses from graphite-web #1693

Conversation

fkaleo commented Feb 27, 2020

woodsaj commented Feb 27, 2020

Dieterbe commented Feb 27, 2020

woodsaj commented Feb 27, 2020

woodsaj Feb 27, 2020

Choose a reason for hiding this comment

fkaleo Feb 27, 2020

Choose a reason for hiding this comment

woodsaj Feb 27, 2020

Choose a reason for hiding this comment

fkaleo Feb 27, 2020

Choose a reason for hiding this comment

fkaleo commented Feb 27, 2020

woodsaj commented Feb 28, 2020

fkaleo commented Feb 28, 2020

woodsaj commented Feb 28, 2020