Research project investigating AI-driven approaches to financial stability monitoring
Our research team recently presented this work at the Consortium for Computing Sciences in Colleges (CCSC) Central Plains Conference. The presentation highlighted our novel approach to bank failure prediction using machine learning techniques and received valuable feedback from the academic community.
While this research is ongoing, we're excited to share our preliminary findings and methodological approach with the broader computer science and financial analytics communities.
We've developed a composite financial metric called HEALTH (Holistic Evaluation of Asset Liquidity, Transparency, and Holdover stability), which is defined as (Equity - Goodwill) / Total Assets. This ratio serves as a comprehensive proxy for bank financial well-being:
The HEALTH score captures core dimensions of banking performance:
Following Wheelock and Wilson (2000), banks falling below a HEALTH threshold of 0.02 are flagged as at-risk.
Unlike binary failure labels, this continuous target enables both granular risk assessments and earlier detection of deterioration trends. HEALTH is employed in both classification (thresholded) and regression (direct prediction) contexts, allowing for rich interpretability and dynamic modeling flexibility across varying regulatory needs.
Our approach aims to identify at-risk institutions up to six quarters before traditional warning signs appear, providing regulators crucial lead time for intervention.
Our research employs a multi-model comparative framework to identify the most effective techniques for bank failure prediction:
Benchmark statistical and tree-based models providing interpretability and baseline performance:
Temporal deep learning models capturing sequential patterns in financial data over time:
State-of-the-art neural network designs leveraging attention mechanisms and hybrid approaches:
Our preprocessing begins with z-score normalization and log-transforming skewed indicators to ensure distributional stability. We emphasize lower HEALTH ranges via exponential rescaling, improving model sensitivity to early signs of deterioration.
Class imbalance, critical in the rare-event nature of bank failures, is addressed with weighted loss functions and SMOTE augmentation techniques. We use a fixed four-quarter input window (specifically quarters 6-9 prior to the prediction quarter), allowing our models to make predictions six quarters in advance using a full year of historical financial data.
Recursive feature elimination combined with SHAP value analysis helps select high-impact, interpretable features without introducing overfitting or collinearity, resulting in a final set of 24 key indicators. Feature importance analyses highlight liquidity ratios (SHAP 0.23) and capital adequacy metrics (SHAP 0.19) as key predictors.
The ROC curve displayed highlights XGBoost's exceptional performance when enhanced with SMOTE (Synthetic Minority Over-sampling Technique) for bank failure prediction. With an outstanding Area Under Curve (AUC) of 0.963, the visualization demonstrates why XGBoost consistently outperformed other models in project.
The poster shown above is the poster used in the 2025 CCSC Student Poster Contest.
This project is a collaborative effort between:
Under the supervision of Dr. Eric Manley and Dr. Sean Severe at Drake University.
Our findings validate ML architectures as valuable tools for bank failure prediction. The HEALTH framework supports both risk flagging and continuous monitoring. XGBoost delivers superior classification (AUC 0.93) while transformer architectures (AUC 0.892) excel at capturing deterioration patterns.
Liquidity ratios and capital adequacy emerge as critical predictors. ML-enhanced systems can identify at-risk institutions up to six quarters before traditional warning signs, providing regulators crucial lead time for intervention.
Following the CCSC Conference presentation, our team is continuing to refine our methodologies and evaluate different model architectures to identify the most effective approach to bank failure prediction. Full methodological details and final results will be published following the completion of our research.
For more information about this ongoing research project, feel free to contact me at portfolio@colemanpagac.com.