Report regressiontest coverage #253

MichaMans · 2019-03-29T08:15:20Z

What is the problem / Suggestion?

At the moment we do not serve some information about the testing coverage, (as far as i saw it, please correct me if i'm wrong) but that would be a nice feature

Why do we want to solve it?

It would be very helpful to provide some test coverage information, at first, i would start with a coverage of the examples. (how many examples exist in the tested library and compare it to how many example tests are existing) That would provide some information about if at least every provided example is running.

How do we want to solve it?

by integrating a get_test_example_coverage function, working with the whole packages as well as single packages. With that everyone is free to use it within the individual Repo/bin/runUnitTests.py script. The result could be the following:

@thorade @mwetter @Mathadon what do you think? Any objections, additions?

Maybe related to #245

The text was updated successfully, but these errors were encountered:

thorade · 2019-03-29T08:44:57Z

Very nice!

MichaMans · 2019-03-29T08:54:08Z

I tested it with the following statement in runUnitTests.py

ut.get_test_example_coverage()

Leading to the following result in our gitlab-ci (with the Coverage regex: Coverage:\s+\d\d.\d\d+\s+\%$)

So it's working for our ci workflow, I'll make a PR if you think its a good addition

mwetter · 2019-03-29T16:46:25Z

@MichaMans : Let's discuss in Aachen how you run your tests and what the workflow/use case is. We run our test by starting multiple CI tests on travis to be able to use multiple instances in parallel, each running a few packages. In this case, coverage would never show 100% even though collectively we run all tests.
Antoine will also show a significant refactor of the test reporting in which your requirement may be addressed.

MichaMans · 2019-03-29T17:00:46Z

@mwetter yes sure, let's discuss in Aachen. We are using a very similar setup so we can refactor the feature that it is usable in a general way like also in the travis-ci/coverage approach (there i might need some help). A solution for the parallel setup might be to configure a coverage only test and use this for the overall coverage result. Antoine's work looks very promising too, but might not resolve the gitlab/travis coverage displaying features.

mwetter · 2019-11-06T20:40:18Z

@MichaMans : I moved your code to the branch issue253_testexamplescoverage.

However, it does not seem to be doing the right thing. For example,

$ ../bin/runUnitTests.py -s IBPSA.Controls.Discrete
Regression tests are only run for the following package:
  IBPSA.Controls.Discrete
***

Coverage:  7%
***

You are testing :  1  out of  15 total examples in 
Controls


***

The following examples are not tested

/Controls/Continuous/Examples/OffTimer.mo
/Controls/Continuous/Examples/SignalRanker.mo
/Controls/Continuous/Examples/PIDHysteresis.mo
/Controls/Continuous/Examples/LimPIDWithReset.mo
/Controls/Continuous/Examples/PIDHysteresisTimer.mo
/Controls/Continuous/Examples/LimPID.mo
/Controls/Continuous/Examples/NumberOfRequests.mo
/Controls/Continuous/Validation/LimPIDReset.mo
/Controls/Continuous/Validation/OffTimerNonZeroStart.mo
/Controls/SetPoints/Examples/OccupancySchedule.mo
/Controls/SetPoints/Examples/Table.mo
/Controls/SetPoints/Examples/HotWaterTemperatureReset.mo
/Controls/SetPoints/Validation/OccupancyScheduleNegativeStartTime.mo
/Controls/SetPoints/Validation/OccupancySchedulePositiveStartTime.mo
Using 1 of 48 processors to run unit tests for dymola.
Number of models   : 369
          blocks   : 110
          functions: 119
Generated 1 regression tests.

Comparison files output by funnel are stored in the directory 'funnel_comp' of size 0.0 MB.
Run 'report' method of class 'Tester' to access a summary of the comparison results.

Script that runs unit tests had 0 warnings and 0 errors.

I asked it to test IBPSA.Controls.Discrete so I should not be informed about not having run tests in IBPSA.Continuous. Also, it is not clear what is meant by not running an example. Each example has a tolerance and start time and a run script which are verified and used to identify what an "Example" is. Then, each is run, unless it is excluded in Resources/Scripts/BuildingsPy/conf.json. Wouldn't these entries show what the coverage is (which is already written to console, just not in percent), or do I miss understand what you consider an "Example".

MichaMans · 2019-11-06T20:56:29Z

@mwetter Thanks for moving it. I'll have a look at it again. You are right, it seems to not working correct. Just for clarification what it should do:

look up all example files within a subpackage or the whole library
look up all "test scripts" for the models/examples tested within the CI
compare these two numbers which will indicate how many examples are automatically tested compared to the total number of examples available. If there is already a number which indicates this fraction, i missed that.

Do you generally agree that if this works right, it would be an addition for buildingspy?

mwetter · 2019-11-06T21:02:28Z

@MichaMans : I am still struggling a bit about the exact use case: When would you not have 100% "coverage"? I would need to dig in again to see how exactly we recognize a model as an "Example". I think the test would be that this example is somehow excluded from the test, either because it is listed in the json file, or because the experiment annotation or .mos script is missing. But wouldn't the latter be considered an error rather than not having coverage?

Also, "coverage" is in my view misleading: If you have for example a MixingVolume, and only one test that tests its use a a dynamic mixing volume, you did not cover the equations that would be used if it were configured as steady-state. We should therefore think if there is a better term.

MichaMans · 2019-11-06T21:12:33Z

@mwetter I see your last point and would generally agree with it. We could definitely discuss and maybe find a "real" coverage test for modelica.

Regarding your first point. I would agree that maybe for the modelica-ibpsa the coverage is always 100% 😃 but it is definitely not for AixLib and i have no idea whats with the other libraries. Speaking for the AixLib. we provide a lot of Examples how models are working or used in, for example, a system context, but these do not yet have a test script and so are not tested within the ci. That's the reason behind why, in our case such a feature is useful.

mwetter · 2019-11-06T21:31:27Z

I see. Then it would be good to flag these examples in your CI testing with some "coverage" metrics.

thorade · 2019-11-07T08:44:16Z

Also, "coverage" is in my view misleading: If you have for example a MixingVolume, and only one test that tests its use a a dynamic mixing volume, you did not cover the equations that would be used if it were configured as steady-state. We should therefore think if there is a better term.

I agree, test coverage would usually mean the ratio (tested models and variants) / (all existing models and variants).

MichaMans · 2019-11-08T11:07:03Z

@mwetter and @thorade I agree with you. I'll change the naming and implement a working check for singel packages. As a suggestion I would name it. Example-Models-Coverage?

mwetter · 2019-11-08T15:20:50Z

@MichaMans : I would call it "Models-Coverage" as it also includes models in Validation.

MichaMans mentioned this issue Mar 29, 2019

Again with The unit test RWTH-EBC/AixLib#697

Closed

DaJansenGit mentioned this issue Nov 5, 2019

clean CI infrastructure RWTH-EBC/AixLib#802

Closed

3 tasks

MichaMans mentioned this issue Nov 6, 2019

Issue 253 examples coverage #307

Closed

mwetter mentioned this issue Dec 20, 2019

Issue253 testexamplescoverage #315

Open

3 tasks

FWuellhorst mentioned this issue Jun 17, 2024

Issue253 coverage #557

Merged

mwetter mentioned this issue Jun 21, 2024

Issue253 coverage (#557) #563

Merged

mwetter closed this as completed in #563 Aug 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Report regressiontest coverage #253

Report regressiontest coverage #253

MichaMans commented Mar 29, 2019

thorade commented Mar 29, 2019

MichaMans commented Mar 29, 2019 •

edited

Loading

mwetter commented Mar 29, 2019

MichaMans commented Mar 29, 2019

mwetter commented Nov 6, 2019

MichaMans commented Nov 6, 2019 •

edited

Loading

mwetter commented Nov 6, 2019

MichaMans commented Nov 6, 2019

mwetter commented Nov 6, 2019

thorade commented Nov 7, 2019

MichaMans commented Nov 8, 2019

mwetter commented Nov 8, 2019

Report regressiontest coverage #253

Report regressiontest coverage #253

Comments

MichaMans commented Mar 29, 2019

What is the problem / Suggestion?

Why do we want to solve it?

How do we want to solve it?

thorade commented Mar 29, 2019

MichaMans commented Mar 29, 2019 • edited Loading

mwetter commented Mar 29, 2019

MichaMans commented Mar 29, 2019

mwetter commented Nov 6, 2019

MichaMans commented Nov 6, 2019 • edited Loading

mwetter commented Nov 6, 2019

MichaMans commented Nov 6, 2019

mwetter commented Nov 6, 2019

thorade commented Nov 7, 2019

MichaMans commented Nov 8, 2019

mwetter commented Nov 8, 2019

MichaMans commented Mar 29, 2019 •

edited

Loading

MichaMans commented Nov 6, 2019 •

edited

Loading