- A new hybrid model framework is built to solve the problem of predicting the financial distress of listed Chinese companies using unbalanced data.
- This novel hybrid model framework is developed based on logistic regression and backpropagation neural networks combined with a security level SMOTE.
- We get 19 important characteristics for the prediction of financial distress, which cover six categories, including solvency, profitability, leverage, operational capacity, growth capacity and leaders.
When predicting financial hardship, an unbalanced business dataset can lead to overfitting of the majority class and lead to poor classifier performance. The problem of classification with unbalanced data is therefore a realistic and critical problem. In this paper, a new hybrid model framework is constructed to solve the problem of predicting the financial distress of listed Chinese companies using unbalanced data. This framework is developed based on logistic regression and backpropagation neural networks combined with the safe level synthetic minority oversampling technique. We validate the model on a dataset of Chinese listed companies and compare the proposed model with seven benchmark models. The results confirm that the proposed model has superior performance. In addition, we find 19 important characteristics that significantly influence the financial distress of Chinese listed companies.