Privacy-Aware Explainable AI Framework for Multi-Modal Big Data Analytics in Real-Time Payment Fraud Detection and Pharmaceutical Network Intelligence

Authors

  • João Felipe Ribeiro Machado Alves Independent Researcher, Brazil Author

DOI:

https://doi.org/10.15662/IJRAI.2025.0806815

Keywords:

Explainable AI, differential privacy, real-time analytics, payment fraud detection, pharmaceutical network intelligence, multimodal machine learning, counterfactuals, privacy-preserving ML

Abstract

Financial fraud in real-time payment systems and intelligence extraction from pharmaceutical networks both rely on fast, accurate, and trustworthy machine learning systems that consume increasingly large and heterogeneous data. This paper proposes a unified, privacy-aware, explainable AI (XAI) framework designed for multi-modal big data analytics that simultaneously addresses latency constraints, regulatory privacy requirements, and the need for interpretable decisions in high-stakes domains. The framework integrates streaming data preprocessing, a modular ensemble of modality-specialized encoders (transactional sequences, device telemetry, text logs, and molecular/biological networks), privacy-preserving computations (differential privacy, secure aggregation, and selective homomorphic operations), and a two-tier explanation system combining local exemplars (counterfactuals and feature attributions) with global concept-based explanations. We evaluate the framework in two case studies: (1) real-time payment fraud detection on a synthetic but realistic streaming dataset reflecting imbalanced classes, latency constraints, and adversarial noise; (2) pharmaceutical network intelligence for drug–target interaction prioritization using multi-omics and literature-mined relations. Results show that the proposed architecture achieves competitive detection and prioritization performance while producing explanations that improve analyst trust and comply with privacy budgets. We conclude with deployment considerations, limitations, and a roadmap for future research bridging privacy, interpretability, and multi-modal real-time analytics.

References

1. Bahdanau, D., Cho, K., & Bengio, Y. (2015). Neural machine translation by jointly learning to align and translate. International Conference on Learning Representations (ICLR).

2. Poornima, G., & Anand, L. (2025). Medical image fusion model using CT and MRI images based on dual scale weighted fusion based residual attention network with encoder-decoder architecture. Biomedical Signal Processing and Control, 108, 107932.

3. Caleb, D. A. M. (2025). AI-Driven Smart Fabric Provisioning: Transforming Network Automation through Intelligent Orchestration and Dynamic Testing. Journal of Computer Science and Technology Studies, 7(3), 783-790.

4. Achari, A. P. S. K., & Sugumar, R. (2024, November). Performance analysis and determination of accuracy using machine learning techniques for naive bayes and random forest. In AIP Conference Proceedings (Vol. 3193, No. 1, p. 020199). AIP Publishing LLC.

5. Sridhar Reddy Kakulavaram, Praveen Kumar Kanumarlapudi, Sudhakara Reddy Peram. (2024). Performance Metrics and Defect Rate Prediction Using Gaussian Process Regression and Multilayer Perceptron. International Journal of Information Technology and Management Information Systems (IJITMIS), 15(1), 37-53.

6. Adari, V. K., Chunduru, V. K., Gonepally, S., Amuda, K. K., & Kumbum, P. K. (2024). Artificial Neural Network in Fibre-Reinforced Polymer Composites using ARAS method. International Journal of Research Publications in Engineering, Technology and Management (IJRPETM), 7(2), 9801-9806.

7. Vinay Kumar Ch, Srinivas G, Kishor Kumar A, Praveen Kumar K, Vijay Kumar A. (2021). Real-time optical wireless mobile communication with high physical layer reliability Using GRA Method. J Comp Sci Appl Inform Technol. 6(1): 1-7. DOI: 10.15226/2474-9257/6/1/00149

8. Suchitra, R. (2023). Cloud-Native AI model for real-time project risk prediction using transaction analysis and caching strategies. International Journal of Research Publications in Engineering, Technology and Management (IJRPETM), 6(1), 8006–8013. https://doi.org/10.15662/IJRPETM.2023.0601002

9. Nagarajan, G. (2024). Cloud-Integrated AI Models for Enhanced Financial Compliance and Audit Automation in SAP with Secure Firewall Protection. International Journal of Advanced Research in Computer Science & Technology (IJARCST), 7(1), 9692-9699.

10. Islam, M. S., Shokran, M., & Ferdousi, J. (2024). AI-Powered Business Analytics in Marketing: Unlock Consumer Insights for Competitive Growth in the US Market. Journal of Computer Science and Technology Studies, 6(1), 293-313.

11. Kandula, N. (2025). FALCON 2.0 SNAPPY REPORTS A NOVEL TOPSIS-DRIVEN APPROACH FOR REAL-TIME MULTI-ATTRIBUTE DECISION ANALYSIS. International Journal of Computer Engineering and Technology. https://d1wqtxts1xzle7.cloudfront.net/123658421/IJCET_16_03_025-libre.pdf?1751969013=&response-content-disposition=inline%3B+filename%3DFALCON_2_0_SNAPPY_REPORTS_A_NOVEL_TOPSIS.pdf&Expires=1764704374&Signature=F-ej5AhUV~5MWbQdQfYNUSst601RwN9WDWlsZFU4FH~jDQ2N1dKwm5WQm7pnc1-o~Rj8iCjkl-4RSyPEYVhORjpm0uN5jUapMX0WnN~LdFw4EYZ3vUUAZFSEymAWyUy~LuVck7FfwgF5odMg4joyb2-dfqVX9kVI8s4e0E7W6tXcmKVuR7oEIN9s4uaVuiUAk7jjD-ExXtEYM156cPXKQOu1JoULT85mGb0qJVkp5gpOIbsAVht6UT15DHJA7h4Op9Amlz-hBtMy0Jzz4vC7~TJ08RWWFDzKU-xmuMLWzjnKWvAmvf6yot5Ow~JwC2vkPVTN-dAPAP38YGRTYKAYsA__&Key-Pair-Id=APKAJLOHF5GGSLRBV4ZA

12. Kiran, A., Rubini, P., & Kumar, S. S. (2025). Comprehensive review of privacy, utility and fairness offered by synthetic data. IEEE Access.

13. Arora, Anuj. "Detecting and Mitigating Advanced Persistent Threats in Cybersecurity Systems." Science, Technology and Development, vol. XIV, no. III, Mar. 2025, pp. 103–117.

14. Ratnala, A. K., Inampudi, R. K., & Pichaimani, T. (2024). Evaluating time complexity in distributed big data systems: A case study on the performance of hadoop and apache spark in large-scale data processing. J Artif Intell Res Appl, 4(1), 732-773.

15. Singh, S. K. (2025). Identification of Key Opinion Leaders in Pharmaceuticals Using Network Analysis. Journal Of Multidisciplinary, 5(7), 18-26.

16. Vasugi, T. (2023). AI-empowered neural security framework for protected financial transactions in distributed cloud banking ecosystems. International Journal of Advanced Research in Computer Science & Technology (IJARCST), 6(2), 7941-7950.

17. Mahajan, A. S. (2025). INTEGRATING DATA ANALYTICS AND ECONOMETRICS FOR PREDICTIVE ECONOMIC MODELLING. International Journal of Applied Mathematics, 38(2s), 1450-1462.

18. Vijayaboopathy, V., Mathur, T., & Selvaraj, G. S. (2025). Generative AI Documentation of Dynamic IT Architectures. Newark Journal of Human-Centric AI and Robotics Interaction, 5, 178-214.

19. He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770–778.

20. Hopkins, A. L. (2008). Network pharmacology: the next paradigm in drug discovery. Nature Chemical Biology, 4(11), 682–690.

21. Gopalan, R., Viswanathan, G., Roy, D., & Satheesh, A. (2025). Integrating Multi-Modal Knowledge Sources: A Comprehensive Tool for AS/400 Legacy System Knowledge Transition and Business Process Documentation. International Journal of Emerging Trends in Computer Science and Information Technology, 209-219.

22. McMahan, H. B., Moore, E., Ramage, D., Hampson, S., & Arcas, B. A. Y. (2017). Communication-efficient learning of deep networks from decentralized data. Artificial Intelligence and Statistics (AISTATS).

23. Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). "Why should I trust you?": Explaining the predictions of any classifier. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 1135–1144.

24. Thangavelu, K., Muthusamy, P., & Das, D. (2024). Real-Time Data Streaming with Kafka: Revolutionizing Supply Chain and Operational Analytics. Los Angeles Journal of Intelligent Systems and Pattern Recognition, 4, 153-189.

25. Shokri, R., Stronati, M., Song, C., & Shmatikov, V. (2017). Membership inference attacks against machine learning models. IEEE Symposium on Security and Privacy (SP).

26. Soroush, H., et al. (2020). [Placeholder for domain-specific paper on streaming fraud detection]. (Used for conceptual alignment—replace with project-specific dataset citation in a final manuscript.)

27. Althati, C., Rambabu, V. P., & Devan, M. (2023). Big Data Integration in the Insurance Industry: Enhancing Underwriting and Fraud Detection. Journal of Computational Intelligence and Robotics, 3(1), 123-162.

28. Md Manarat Uddin, M., Rahanuma, T., & Sakhawat Hussain, T. (2025). Privacy-Aware Analytics for Managing Patient Data in SMB Healthcare Projects. International Journal of Informatics and Data Science Research, 2(10), 27-57.

29. Sivaraju, P. S. (2023). Thin client and service proxy architectures for X systems in distributed operations. International Journal of Advanced Research in Computer Science & Technology, 6(6), 9510–9515.

30. Kumar, R. K. (2024). Real-time GenAI neural LDDR optimization on secure Apache–SAP HANA cloud for clinical and risk intelligence. IJEETR, 8737–8743. https://doi.org/10.15662/IJEETR.2024.0605006

31. Muthusamy, M. (2024). Cloud-Native AI metrics model for real-time banking project monitoring with integrated safety and SAP quality assurance. International Journal of Research and Applied Innovations (IJRAI), 7(1), 10135–10144. https://doi.org/10.15662/IJRAI.2024.0701005

32. Kusumba, S. (2025). Empowering Federal Efficiency: Building an Integrated Maintenance Management System (Imms) Data Warehouse for Holistic Financial And Operational Intelligence. Journal Of Multidisciplinary, 5(7), 377-384.

33. Sukla, R. R. (2025). The Evolution of AI in Software Quality and Cloud Management: A Framework for Autonomous Systems. Journal of Computer Science and Technology Studies, 7(6), 353-359.

34. Akhtaruzzaman, K., Md Abul Kalam, A., Mohammad Kabir, H., & KM, Z. (2024). Driving US Business Growth with AI-Driven Intelligent Automation: Building Decision-Making Infrastructure to Improve Productivity and Reduce Inefficiencies. American Journal of Engineering, Mechanics and Architecture, 2(11), 171-198. http://eprints.umsida.ac.id/16412/1/171-198%2BDriving%2BU.S.%2BBusiness%2BGrowth%2Bwith%2BAI-Driven%2BIntelligent%2BAutomation.pdf

35. Kiran, A., & Kumar, S. A methodology and an empirical analysis to determine the most suitable synthetic data generator. IEEE Access 12, 12209–12228 (2024).

36. Balaji, K. V., & Sugumar, R. (2023, December). Harnessing the Power of Machine Learning for Diabetes Risk Assessment: A Promising Approach. In 2023 International Conference on Data Science, Agents & Artificial Intelligence (ICDSAAI) (pp. 1-6). IEEE.

37. Archana, R., & Anand, L. (2025). Residual u-net with Self-Attention based deep convolutional adaptive capsule network for liver cancer segmentation and classification. Biomedical Signal Processing and Control, 105, 107665.

38. Ramakrishna, S. (2022). AI-augmented cloud performance metrics with integrated caching and transaction analytics for superior project monitoring and quality assurance. International Journal of Engineering & Extended Technologies Research (IJEETR), 4(6), 5647–5655. https://doi.org/10.15662/IJEETR.2022.0406005

39. Zaharia, M., et al. (2013). Discretized streaming and micro-batch processing (Spark Streaming). Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing.

Downloads

Published

2025-12-01

How to Cite

Privacy-Aware Explainable AI Framework for Multi-Modal Big Data Analytics in Real-Time Payment Fraud Detection and Pharmaceutical Network Intelligence. (2025). International Journal of Research and Applied Innovations, 8(Special Issue 1), 86-95. https://doi.org/10.15662/IJRAI.2025.0806815