Using a benchmark target's return value in a column #784

Warpten · 2018-06-07T22:40:20Z

I'm writing a deserialization library that is basically taking a file stream and producing a List<T>:

        [Benchmark(Description = "Achievement (WDC1)")]
        public StorageList<WDC1.AchievementEntry> WDC1()
        {
            using (var fs = OpenFile("Achievement.WDC1.db2"))
                return new StorageList<WDC1.AchievementEntry>(fs);
        }

And it goes on for a couple versions.

However, the execution times don't make much sense since they are a measurement of the total amount of entries per file, which changes per version (I have no control over the data). It would make a lot more sense to display the average time needed to load one single element of each version.

I'd love to be able to have an implementation of IColumn that is able to obtain the return value from a benchmark target and work on it.

I have a local project that is basically a dumbed down performance meter but it has its limitations, namely in regards to results output - producing histograms is a chore.

Is there a way to do that, or am I out of luck? By giving the source a quick glance through ILspy, I don't see much.

The text was updated successfully, but these errors were encountered:

adamsitnik · 2018-06-08T07:05:39Z

hi @Warpten

I am aware of this limitation. The problem is that we run the benchmark in a separate process, so passing any data from one to another is not trivial.

I know that @KrzysztofCwalina has faced this issue in ML.NET

@Warpten @KrzysztofCwalina would it be enough if I would add a mechanism to return a string and print it in a dedicated column? Sth like:

[ExtraData]
public string Size() => Serializer(sth).Size.ToString();

The method would be executed just once.

Warpten · 2018-06-08T11:08:42Z

Maybe exposing a method's result as an object would be less limiting? It would be great to have an extra column where I can just do

return (benchmark.Target.Result as IList).Count

Or even more complex things like

// Suppose this is a variation of MeanColumn
return /* mean time here */ / (benchmark.Target.Result as IList).Count;

This is however assuming that Target.Result never changes between executions, since this would only retrieve the last result. However, I think this is fine, since you wouldn't typically want to do this sort of things if you write a benchmark that has its result vary (at least that's not obvious to me).

adamsitnik · 2018-06-08T12:30:02Z

Maybe exposing a method's result as an object would be less limiting?

but how should I then serialize it and pass from one process to another without introducing any dependencies to BenchmarkDotNet?

gulbanana · 2018-09-26T10:44:03Z

i think the string is fine, we could implement serialisation on top of that ourselves for advanced cases

Warpten · 2018-10-14T17:38:23Z

Coming back at this, I didn't know at the time that BenchmarkDotNet was spawning subprocesses (and I also diagonally read your answer, Adam - sorry about that!). So I guess strings are the best way out.

Konard · 2019-09-30T20:29:00Z

Well, something like this is possible even now. Look at Config class, SQLiteOutput and DoubletsOutput methods.

using System.IO;
using BenchmarkDotNet.Attributes;
using BenchmarkDotNet.Configs;
using Comparisons.SQLiteVSDoublets.Model;
using Comparisons.SQLiteVSDoublets.SQLite;
using Comparisons.SQLiteVSDoublets.Doublets;

namespace Comparisons.SQLiteVSDoublets
{
    [ClrJob, CoreJob]
    [MemoryDiagnoser]
    [WarmupCount(2)]
    [IterationCount(1)]
    [Config(typeof(Config))]
    public class Benchmarks
    {
        private class Config : ManualConfig
        {
            public Config() => Add(new SizeAfterCreationColumn());
        }

        [Params(1000, 10000, 100000)]
        public int N;
        private SQLiteTestRun _sqliteTestRun;
        private DoubletsTestRun _doubletsTestRun;

        [GlobalSetup]
        public void Setup()
        {
            BlogPosts.GenerateData(N);
            _sqliteTestRun = new SQLiteTestRun("test.db");
            _doubletsTestRun = new DoubletsTestRun("test.links");
        }

        [Benchmark]
        public void SQLite() => _sqliteTestRun.Run();

        [IterationCleanup(Target = "SQLite")]
        public void SQLiteOutput() => File.WriteAllText($"disk-size.sqlite.{N}.txt", _sqliteTestRun.Results.DbSizeAfterCreation.ToString());

        [Benchmark]
        public void Doublets() => _doubletsTestRun.Run();

        [IterationCleanup(Target = "Doublets")]
        public void DoubletsOutput() => File.WriteAllText($"disk-size.doublets.{N}.txt", _doubletsTestRun.Results.DbSizeAfterCreation.ToString());
    }
}

You will also need a custom column implementation to fit data from files into the report.

using System;
using System.IO;
using System.Linq;
using BenchmarkDotNet.Columns;
using BenchmarkDotNet.Reports;
using BenchmarkDotNet.Running;

namespace Comparisons.SQLiteVSDoublets
{
    public class SizeAfterCreationColumn : IColumn
    {
        public string Id => nameof(SizeAfterCreationColumn);

        public string ColumnName => "SizeAfterCreation";

        public string Legend => "Allocated memory on disk after all records are created (1KB = 1024B)";

        public UnitType UnitType => UnitType.Size;

        public bool AlwaysShow => true;

        public ColumnCategory Category => ColumnCategory.Metric;

        public int PriorityInCategory => 0;

        public bool IsNumeric => true;

        public bool IsAvailable(Summary summary) => true;

        public bool IsDefault(Summary summary, BenchmarkCase benchmarkCase) => false;

        public string GetValue(Summary summary, BenchmarkCase benchmarkCase) => GetValue(summary, benchmarkCase, SummaryStyle.Default);

        public string GetValue(Summary summary, BenchmarkCase benchmarkCase, SummaryStyle style)
        {
            var benchmarkName = benchmarkCase.Descriptor.WorkloadMethod.Name.ToLower();
            var parameter = benchmarkCase.Parameters.Items.FirstOrDefault(x => x.Name == "N");
            if (parameter == null)
            {
                return "no parameter";
            }
            var N = Convert.ToInt32(parameter.Value);
            var filename = $"disk-size.{benchmarkName}.{N}.txt";
            return File.Exists(filename) ? File.ReadAllText(filename) : "no file";
        }

        public override string ToString() => ColumnName;
    }
}

But there is a problem with that approach, it works only if ClrJob is used, CoreJob for some reason does not write the files.

The complete example: https://github.com/linksplatform/Comparisons.SQLiteVSDoublets

sandersaares · 2020-09-04T04:41:11Z

Being able to pass a simple string would be a great start for this functionality.

Being able to pass a Dictionary<string, string> (for multiple columns) would be even better.

Really, I don't care how I get the data from the benchmark into the table - as long as I can show it to the user in a more convenient way than having to search logs for it, I am happy.

A nice stretch goal might be something in line with @Konard example - allow benchmark to emit events that are serialized to file and can be processed then by a custom column (e.g. to measure average or count "interesting" events or). But maybe this deviates too much from BenchmarkDotNet's core vision?

scalablecory · 2021-03-22T20:57:52Z

A nice stretch goal might be something in line with @Konard example - allow benchmark to emit events that are serialized to file and can be processed then by a custom column (e.g. to measure average or count "interesting" events or).

I'd love to see this. This would be very useful for networking benchmarks to show stats about the socket.

timcassell · 2021-06-21T08:49:29Z

Reposting here since I didn't realize it was a duplicate #1730. (Same idea as @sandersaares)

I was thinking it could be done by utilizing a method returning a Dictionary<string, string> with a new attribute that is called as the final thing before GlobalCleanup.

public class Benchmark
{
    [Benchmark]
    public void MyBenchmark
    {
        // ...
    }
    
    [Benchmark]
    public void MyBenchmark2
    {
        // ...
    }
    
    [CustomTableResults(Target = nameof(MyBenchmark))]
    public Dictionary<string, string> GetCustomResults()
    {
        return new Dictionary<string, string>()
        {
            ["Example Result"] = "Custom result for " + nameof(MyBenchmark)
        };
    }
}

Would output like this:

|       Method |     Mean |     Error |    StdDev |                Example Result |
|------------- |---------:|----------:|----------:|------------------------------:|
|  MyBenchmark | 1.368 ns | 0.0038 ns | 0.0032 ns | Custom result for MyBenchmark |
|------------- |----------|-----------|-----------|-------------------------------|
| MyBenchmark2 | 1.364 ns | 0.0094 ns | 0.0083 ns |                             - |

Results could be calculated after the benchmark runs inside the method, or calculated in GlobalSetup, cached, and read in that method. It's quite versatile. The fact that it's a dictionary prevents duplicate keys (though some care would need to be taken concerning existing column names).

There would be no need to muck about with files with this method (though it lacks the post-processing after all the results are calculated that the current columns have).

timcassell · 2021-09-13T17:31:35Z

We could also have a custom post-processor:

    [CustomTableResults(Target = nameof(MyBenchmark))]
    public Dictionary<string, string> GetCustomResults()
    {
        return new Dictionary<string, string>()
        {
            ["Example Result"] = "Custom result for " + nameof(MyBenchmark)
        };
    }
    
    [CustomTableResultsPostProcess]
    public static void PostProcessCustomResults(IReadOnlyDictionary<string, string[]> customResults)
    {
        foreach(var customColumn in customResults)
        {
            for (int i = 0; i < customColumn.Value.Length; ++i)
            {
                customColumn.Value[i] += " post-processed";
            }
        }
    }

BryanEuton · 2022-06-28T21:23:02Z

Is there any expectation to add this capability to Benchmark?

adamsitnik · 2022-10-19T14:46:25Z

With #2092 we made it possible to serialize any data to bytes and send it from benchmark to host process over an anonymous pipe. Which finally makes it possible to implement this issue without a lot of hassle.

Some hints for the contributor:

The IHost is being created here and it's passed to the Engine ctor so it's available during benchmark execution, but not exposed. We could extend it with a new method that would be passing an IReadOnlyDictionary<string, string> from the benchmark to host process.
We would need to somehow expose the possibility to pass the data to the end user. We could do that by introducing a new attribute similar to GlobalSetup/GlobalCleanup like proposed by @timcassell above:

[AdditionalMetrics(Target = nameof(MyBenchmark))]
public IReadOnlyDictionary<string, string> GetCustomResults()
    => new Dictionary<string, string>()
    {
        ["Compressed Size"] = _output.Length.ToString()
    };

The disadvantage is that we would need to introduce yet another attribute and implement the support for in-process and out-process toolchains. The alternative would be to extend [GlobalSetup] methods with the possibility to accept IHost argument and just pass it in the engine:

[GlobalCleanup(Target = nameof(MyBenchmark))]
public void GlobalCleanup(IHost host)
{
    Dictionary<string, string>() metrics = new()
    {
        ["Compressed Size"] = _output.Length.ToString()
    }

    host.ReportMetrics(metrics);

   _someField.Dispose(); // what typical cleanup does
}

Another alternative, which is easier to implement but we have never done it in BDN is exposing a new public static property and make Engine initialize it before GlobalCleanup call and null it after the call. Sth like:

// Engine
Host.Instance = _host;
GlobalCleanup();
Host.Instance = null;

// GlobalCleanup:
Host.Instance.ReportMetrics(metrics);

Once the user passes data, it needs to be handled by the Broker which reads from the pipe. The broker would need to handle that by storing the data somewhere. Then every executor that uses it would need to store it in ExecuteResult next to standard output, exit code etc. This would be needed to convert into metrics which would make it work with everything else out of the box by MetricsColumnProvider.

timcassell · 2022-10-19T20:02:35Z

It seems dangerous to encourage users to use IHost. That looks like it should actually be internal, and is only public for the code-gen project to use, especially if we're going to be changing its contract. (See my other comment about that.) And exposing a static property just feels dirty and unsafe.

It may take more work, but I think the new attribute approach is the safest.

ahmadi-ali · 2023-02-06T16:45:12Z

Hi All,
Any update regarding this topic?

adamsitnik · 2023-02-08T09:07:00Z

Any update regarding this topic?

The issue is still up-for-grabs, I've provided the hints for potential contributor in the comment above: #784 (comment) Until somebody grabs it, implements and sends a PR there will be no updates.

timcassell · 2023-02-08T10:48:49Z

I might take a look at implementing this soon.

Upon looking at your idea again, I kind of like this. But what would you think about exposing a new interface instead of IHost @adamsitnik?

public interface IMetricReporter
{
    void ReportMetrics(IEnumerable<CustomMetric> metrics);
}

public class CustomMetric
{
    public CustomMetric(string name, string value) { }
    public CustomMetric(string name, double value, UnitType unitType, string numberFormat = "0.##") { }
}

[GlobalCleanup(Target = nameof(MyBenchmark))]
public void GlobalCleanup(IMetricReporter metricReporter)
{
    var metrics = new[]
    {
        new CustomMetric("Compressed Size", _output.Length, UnitType.Dimensionless)
    };

    metricReporter.ReportMetrics(metrics);
}

This would allow us to extend functionality through the CustomMetric class if we need to without breaking the interface. That's much more maintainable than updating all the toolchains for any changes we want (example here is adding numeric type post-processing like the built-in metrics).

timcassell · 2023-08-14T18:54:29Z

I realized that GlobalCleanup can be called multiple times if memory randomization is active. So I think that it will have to be a separate method that runs after GlobalCleanup.

Also, this involves updating the code-gen, and I already have a lot of open PRs touching that and I don't want to add more before they're merged, so I'm holding off on starting the implementation for this for now.

stonstad · 2024-04-22T11:06:29Z

Interim solution for presenting custom table results: https://gist.github.com/stonstad/43e027ecc580612c5abd5e1fcf1e30d8

adamsitnik self-assigned this Jun 8, 2018

aalmada mentioned this issue Dec 19, 2018

Dynamic OperationsPerInvoke #258

Open

adamsitnik mentioned this issue Dec 9, 2019

Add more infos to report #1324

Closed

adamsitnik mentioned this issue Sep 3, 2020

Document how to pass value from benchmark method to column value #1527

Closed

adamsitnik mentioned this issue Jun 21, 2021

Custom results table columns #1730

Closed

timcassell mentioned this issue Jan 19, 2022

Our scenario seems to lack framework possibilities - or can't find documentation (sorry then) #1894

Closed

adamsitnik mentioned this issue Jun 23, 2022

Use result of method call as an attribute #2015

Closed

timcassell mentioned this issue Jun 28, 2022

Question: How can you add data for statistics #2027

Closed

adamsitnik mentioned this issue Aug 28, 2022

Use Pipes for host and benchmark process communication #2092

Merged

adamsitnik removed their assignment Oct 17, 2022

adamsitnik added up-for-grabs help wanted labels Oct 17, 2022

adamsitnik mentioned this issue Aug 10, 2023

How to add data to custom column in iteration cleanup? #2396

Closed

timcassell mentioned this issue Mar 31, 2024

What should I do if I need to see the output of each test item? #2551

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using a benchmark target's return value in a column #784

Using a benchmark target's return value in a column #784

Warpten commented Jun 7, 2018 •

edited

Loading

adamsitnik commented Jun 8, 2018

Warpten commented Jun 8, 2018 •

edited

Loading

adamsitnik commented Jun 8, 2018

gulbanana commented Sep 26, 2018

Warpten commented Oct 14, 2018 •

edited

Loading

Konard commented Sep 30, 2019 •

edited

Loading

sandersaares commented Sep 4, 2020 •

edited

Loading

scalablecory commented Mar 22, 2021

timcassell commented Jun 21, 2021

timcassell commented Sep 13, 2021 •

edited

Loading

BryanEuton commented Jun 28, 2022

adamsitnik commented Oct 19, 2022 •

edited

Loading

timcassell commented Oct 19, 2022 •

edited

Loading

ahmadi-ali commented Feb 6, 2023

adamsitnik commented Feb 8, 2023

timcassell commented Feb 8, 2023

timcassell commented Aug 14, 2023

stonstad commented Apr 22, 2024

Using a benchmark target's return value in a column #784

Using a benchmark target's return value in a column #784

Comments

Warpten commented Jun 7, 2018 • edited Loading

adamsitnik commented Jun 8, 2018

Warpten commented Jun 8, 2018 • edited Loading

adamsitnik commented Jun 8, 2018

gulbanana commented Sep 26, 2018

Warpten commented Oct 14, 2018 • edited Loading

Konard commented Sep 30, 2019 • edited Loading

sandersaares commented Sep 4, 2020 • edited Loading

scalablecory commented Mar 22, 2021

timcassell commented Jun 21, 2021

timcassell commented Sep 13, 2021 • edited Loading

BryanEuton commented Jun 28, 2022

adamsitnik commented Oct 19, 2022 • edited Loading

timcassell commented Oct 19, 2022 • edited Loading

ahmadi-ali commented Feb 6, 2023

adamsitnik commented Feb 8, 2023

timcassell commented Feb 8, 2023

timcassell commented Aug 14, 2023

stonstad commented Apr 22, 2024

Warpten commented Jun 7, 2018 •

edited

Loading

Warpten commented Jun 8, 2018 •

edited

Loading

Warpten commented Oct 14, 2018 •

edited

Loading

Konard commented Sep 30, 2019 •

edited

Loading

sandersaares commented Sep 4, 2020 •

edited

Loading

timcassell commented Sep 13, 2021 •

edited

Loading

adamsitnik commented Oct 19, 2022 •

edited

Loading

timcassell commented Oct 19, 2022 •

edited

Loading