To read this content please select one of the options below:

Garbage in, Garbage out: A Theory-Driven Approach to Improve Data Handling in Supervised Machine Learning

aBoise State University, USA
bUniversity of Texas San Antonio, USA
cTexas Christian University, USA

Methods to Improve Our Field

ISBN: 978-1-80455-365-7, eISBN: 978-1-80455-364-0

Publication date: 18 January 2023

Abstract

Machine learning (ML) has recently gained momentum as a method for measurement in strategy research. Yet, little guidance exists regarding how to appropriately apply the method for this purpose in our discipline. We address this by offering a guide to the application of ML in strategy research, with a particular emphasis on data handling practices that should improve our ability to accurately measure our constructs of interest using ML techniques. We offer a brief overview of ML methodologies that can be used for measurement before describing key challenges that exist when applying those methods for this purpose in strategy research (i.e., sample sizes, data noise, and construct complexity). We then outline a theory-driven approach to help scholars overcome these challenges and improve data handling and the subsequent application of ML techniques in strategy research. We demonstrate the efficacy of our approach by applying it to create a linguistic measure of CEOs' motivational needs in a sample of S&P 500 firms. We conclude by describing steps scholars can take after creating ML-based measures to continue to improve the application of ML in strategy research.

Keywords

Citation

Hyde, S.J., Bachura, E. and Harrison, J.S. (2023), "Garbage in, Garbage out: A Theory-Driven Approach to Improve Data Handling in Supervised Machine Learning", Hill, A.D., McKenny, A.F., O'Kane, P. and Paroutis, S. (Ed.) Methods to Improve Our Field (Research Methodology in Strategy and Management, Vol. 14), Emerald Publishing Limited, Leeds, pp. 101-132. https://doi.org/10.1108/S1479-838720220000014006

Publisher

:

Emerald Publishing Limited

Copyright © 2023 by Emerald Publishing Limited