Comparison of machine learning predictions of subjective poverty in rural China
China Agricultural Economic Review
ISSN: 1756-137X
Article publication date: 9 September 2022
Issue publication date: 2 May 2023
Abstract
Purpose
Despite rising incomes and reduction of extreme poverty, the feeling of being poor remains widespread. Support programs can improve well-being, but they first require identifying who are the households that judge their income is insufficient to meet their basic needs, and what factors are associated with subjective poverty.
Design/methodology/approach
Households report the income level they judge is sufficient to make ends meet. Then, they are classified as being subjectively poor if their own monetary income is inferior to the level they indicated. Second, the study compares the performance of three machine learning algorithms, the random forest, support vector machines and least absolute shrinkage and selection operator (LASSO) regression, applied to a set of socioeconomic variables to predict subjective poverty status.
Findings
The random forest generates 85.29% of correct predictions using a range of income and non-income predictors, closely followed by the other two techniques. For the middle-income group, the LASSO regression outperforms random forest. Subjective poverty is mostly associated with monetary income for low-income households. However, a combination of low income, low endowment (land, consumption assets) and unusual large expenditure (medical, gifts) constitutes the key predictors of feeling poor for the middle-income households.
Practical implications
To reduce the feeling of poverty, policy intervention should continue to focus on increasing incomes. However, improvements in nonincome domains such as health expenditure, education and family demographics can also relieve the feeling of income inadequacy. Methodologically, better performance of either algorithm depends on the data at hand.
Originality/value
For the first time, the authors show that prediction techniques are reliable to identify subjective poverty prevalence, with example from rural China. The analysis offers specific attention to the modest-income households, who may feel poor but not be identified as such by objective poverty lines, and is relevant when policy-makers seek to address the “next step” after ending extreme poverty. Prediction performance and mechanisms for three machine learning algorithms are compared.
Keywords
Acknowledgements
This research was supported by the National Social Science Foundation of China (22CJY037, 20CSH048, 20AZD080, 21ZDA062), the Humanities and Social Sciences Project Funded by the Chinese Ministry of Education (21YJC790110), the Social Science Foundation of Chongqing (2022YC004), and the Innovation Research 2035 Pilot Plan of Southwest University (SWUPilotPlan026).
Citation
Maruejols, L., Wang, H., Zhao, Q., Bai, Y. and Zhang, L. (2023), "Comparison of machine learning predictions of subjective poverty in rural China", China Agricultural Economic Review, Vol. 15 No. 2, pp. 379-399. https://doi.org/10.1108/CAER-03-2022-0051
Publisher
:Emerald Publishing Limited
Copyright © 2022, Emerald Publishing Limited