Exploring a Hybrid Deep Learning Approach for Anomaly Detection in Mental Healthcare Provider Billing: what it means for business leaders
A 2025 study shows how pairing Isolation-Forest pseudo-labels with LSTM-Transformer ensembles flags fraudulent mental-health claims earlier, giving insurers and compliance teams an efficient, scalable fraud-screening tool.
1. What the method is
The pipeline creates pseudo-labels with Isolation Forest and autoencoders, then trains an LSTM and a Transformer on these labels. A gradient-boosted meta-learner fuses their outputs to score each billing sequence’s fraud risk.
2. Why the method was developed
Manual fraud labels are scarce, biasing supervised models toward legitimate claims. The hybrid approach enlarges the minority class automatically and exploits both temporal patterns (LSTM) and cross-field interactions (Transformer) to surface new scams.
3. Who should care
- Special-investigation units at health insurers
- Revenue-cycle & compliance managers at provider groups
- Government auditors policing over-billing
- AI product leads building payment-integrity tools
4. How the method works
Cleaned claim tables are windowed into 50-record sequences. Outlier scorers flag the top 0.4 % as anomalies. These pseudo-labels train an LSTM and a Transformer; their logits feed a boosted tree meta-classifier that outputs final fraud probabilities.
5. How it was evaluated
On two Dutch datasets (71 k declarations, 1.59 M operations) with 80 / 20 splits, models were judged by recall, precision, F1, and AUROC. Five-fold validation set hyper-parameters.
6. How it performed
Declaration-level recall hit 96.3 % (F1 0.84). At the operation level, the full hybrid reached 74.4 % recall, a 10-point gain over the best single model, enabling earlier, broader fraud capture. (Source: arXiv 2507.01924, 2025)
← Back to dossier index