The Truth, the Whole Truth, and Nothing but the Truth: A Pragmatic Guide to Assessing Empirical Evaluations

Blackburn, Stephen M and Diwan, Amer and Hauswirth, Mattias and Sweeney, Peter F and Amaral, Jose Nelson and Brecht, Tim and Bulej, Lubomr and Click, Cliff and Eeckhout, Lieven and Fischmeister, Sebastian and Frampton, Daniel and Hendren, Laurie J and Hind, Michael and Hosking, Antony L and Jones, Richard E. and Kalibera, Tomas and Keynes, Nathan and Nystrom, Nathaniel and Zeller, Andreas (2016) The Truth, the Whole Truth, and Nothing but the Truth: A Pragmatic Guide to Assessing Empirical Evaluations. Transactions on Programming Languages and Systems (TOPLAS), 38 (4). ISSN 0164-0925. E-ISSN 1558-4593. (doi:https://doi.org/10.1145/2983574) (Full text available)

PDF - Author's Accepted Manuscript
Download (434kB) Preview
[img]
Preview

Abstract

An unsound claim can misdirect a field, encouraging the pursuit of unworthy ideas and the abandonment of promising ideas. An inadequate description of a claim can make it difficult to reason about the claim, for example to determine whether the claim is sound. Many practitioners will acknowledge the threat of un- sound claims or inadequate descriptions of claims to their field. We believe that this situation is exacerbated and even encouraged by the lack of a systematic approach to exploring, exposing, and addressing the source of unsound claims and poor exposition. This paper proposes a framework that identifies three sins of reasoning that lead to unsound claims and two sins of exposition that lead to poorly described claims. Sins of exposition obfuscate the objective of determining whether or not a claim is sound, while sins of reasoning lead directly to unsound claims. Our framework provides practitioners with a principled way of critiquing the integrity of their own work and the work of others. We hope that this will help individuals conduct better science and encourage a cultural shift in our research community to identify and promulgate sound claims.

Item Type: Article
Uncontrolled keywords: Performance evaluation (efficiency and effectiveness); experimental evaluation; observation study; experimentation
Subjects: Q Science > QA Mathematics (inc Computing science) > QA 76 Software, computer programming,
Divisions: Faculties > Sciences > School of Computing > Programming Languages and Systems Group
Depositing User: Richard Jones
Date Deposited: 28 Apr 2016 12:39 UTC
Last Modified: 14 Nov 2016 14:08 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/55171 (The current URI for this page, for reference purposes)
Jones, Richard E.: https://orcid.org/0000-0002-8159-0297
  • Depositors only (login required):

Downloads

Downloads per month over past year