To read this content please select one of the options below:

Comparison of machine learning predictions of subjective poverty in rural China

Lucie Maruejols (University of Göttingen, Göttingen, Germany)
Hanjie Wang (Southwest University, Chongqing, China)
Qiran Zhao (College of Economics and Management, China Agricultural University, Beijing, China)
Yunli Bai (Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China) (UNEP-International Ecosystem Management Partnership, Beijing, China)
Linxiu Zhang (Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China) (UNEP-International Ecosystem Management Partnership, Beijing, China)

China Agricultural Economic Review

ISSN: 1756-137X

Article publication date: 9 September 2022

Issue publication date: 2 May 2023

632

Abstract

Purpose

Despite rising incomes and reduction of extreme poverty, the feeling of being poor remains widespread. Support programs can improve well-being, but they first require identifying who are the households that judge their income is insufficient to meet their basic needs, and what factors are associated with subjective poverty.

Design/methodology/approach

Households report the income level they judge is sufficient to make ends meet. Then, they are classified as being subjectively poor if their own monetary income is inferior to the level they indicated. Second, the study compares the performance of three machine learning algorithms, the random forest, support vector machines and least absolute shrinkage and selection operator (LASSO) regression, applied to a set of socioeconomic variables to predict subjective poverty status.

Findings

The random forest generates 85.29% of correct predictions using a range of income and non-income predictors, closely followed by the other two techniques. For the middle-income group, the LASSO regression outperforms random forest. Subjective poverty is mostly associated with monetary income for low-income households. However, a combination of low income, low endowment (land, consumption assets) and unusual large expenditure (medical, gifts) constitutes the key predictors of feeling poor for the middle-income households.

Practical implications

To reduce the feeling of poverty, policy intervention should continue to focus on increasing incomes. However, improvements in nonincome domains such as health expenditure, education and family demographics can also relieve the feeling of income inadequacy. Methodologically, better performance of either algorithm depends on the data at hand.

Originality/value

For the first time, the authors show that prediction techniques are reliable to identify subjective poverty prevalence, with example from rural China. The analysis offers specific attention to the modest-income households, who may feel poor but not be identified as such by objective poverty lines, and is relevant when policy-makers seek to address the “next step” after ending extreme poverty. Prediction performance and mechanisms for three machine learning algorithms are compared.

Keywords

Acknowledgements

This research was supported by the National Social Science Foundation of China (22CJY037, 20CSH048, 20AZD080, 21ZDA062), the Humanities and Social Sciences Project Funded by the Chinese Ministry of Education (21YJC790110), the Social Science Foundation of Chongqing (2022YC004), and the Innovation Research 2035 Pilot Plan of Southwest University (SWUPilotPlan026).

Citation

Maruejols, L., Wang, H., Zhao, Q., Bai, Y. and Zhang, L. (2023), "Comparison of machine learning predictions of subjective poverty in rural China", China Agricultural Economic Review, Vol. 15 No. 2, pp. 379-399. https://doi.org/10.1108/CAER-03-2022-0051

Publisher

:

Emerald Publishing Limited

Copyright © 2022, Emerald Publishing Limited

Related articles