tool to determine measurement error
Related to: resolver-benchmarking#22
Values in various diffsum report fields have different "stability" and it is hard to see by naked eye if two consecutive reports indicate statistically significant change or not.
We need a tool which will determine "stability" for each report field, and these numbers can be then used as input for resolver-benchmarking#22.