100% code coverage is near-meaningless - but is there a good measure to use?

vampatori@feddit.uk · 1 year ago

100% code coverage is near-meaningless - but is there a good measure to use?

SorteKanin@feddit.dk · 1 year ago

But is there any accepted means of formally measuring a system and ensuring that some level of test quality exists?

Formally? No, this is basically impossible by Rice’s Theorem. There is not even a guarantee that if you have 100% test coverage, the program is good (the tests could be flawed).

This is just a natural limitation of turing completeness. You can’t decide these properties while also having full computational power. In order to decide such things, you need a less powerful mode of computation (something not turing complete) that can be analyzed more thoroughly and with more guarantees.

vampatori@feddit.uk · 1 year ago

That makes sense, thank you. Yes, it’s specifically “test quality” I’m looking to measure, as 100% coverage is effectively meaningless if the tests are poor.

SorteKanin@feddit.dk · 1 year ago

Yea I’m afraid the only real way to “measure” that is to read through the tests and the code and make a good ol human value judgement on the state of the code and tests. But it won’t give you a number.