Skip to content

AWS for Fluent Bit 2.23.0

Compare
Choose a tag to compare
@zhonghui12 zhonghui12 released this 11 Mar 01:27
· 424 commits to mainline since this release

2.23.0

This release includes:

  • An Amazon Linux 2 Base
  • Fluent Bit 1.8.13
  • Amazon CloudWatch Logs for Fluent Bit 1.7.0
  • Amazon Kinesis Streams for Fluent Bit 1.9.0
  • Amazon Kinesis Firehose for Fluent Bit 1.6.1

Compared to 2.22.0 this release adds:

  • Feature - Add timeout config for AWS SDK Go HTTP calls kinesis:178
  • Enhancement - Migrate AWS plugins to performant 2.25.0-mbedtls base64 implementation fluentbit:4422
  • Bug - Fix message loss issue using concurrency feature with 0 retries kinesis:179

Compared to 2.22.0 this release deletes the following fix which has been covered in new enhancement:

  • Bug - Downgrade mbedtls to 2.24.0 to fix the performance regression issue in mbedtls 2.26.0 fluentbit:4110

Same as 2.22.0, this release includes the following fix for AWS customers that we are working on getting accepted upstream:


We’ve run the new released image in our ECS load testing framework and here is the result. This testing result provides benchmarks of aws-for-fluent-bit under different input load.

plugin 20Mb/s 25Mb/s 30Mb/s
kinesis_firehose Log Loss
Log Duplication 0%(1064)
kinesis_streams Log Loss
Log Duplication
s3 Log Loss
Log Duplication
plugin 1Mb/s 2Mb/s 3Mb/s
cloudwatch_logs Log Loss 0%(332) 0%(2678)
Log Duplication 6%(71497) 2%(39273)

Note:

  • The green check ✅ in the table means no log loss or no log duplication.
  • Number in parentheses means the number of records out of total records. For example, 0%(1064) under 30Mb/s throughput means 1064 duplicate records out of 18M input records by which log duplication percentage is 0%.
  • CloudWatch has own throughput limit for single log stream. Based on our tests, it starts to appear throttling issue after input load > 1Mb/s.
  • Log loss is the percentage of data lost and log duplication is the percentage of duplicate logs received at the destination. Your results may differ because they can be influenced by many factors like different configs and environment settings. Log duplication is exclusively caused by partially succeeded batches that were retried, which means it is random.