Deep Learning-Based Multimodal System for Early Detection Of Livestock Diseases Using Image and Audio Fusion

Authors

  • Vedantika Mane UG Student, Department of Computer Science & Engineering, Karmaveer Bhaurao Patil College of Engineering, Satara, Maharashtra, India Author
  • Rutuja Toraskar Assistent Profesor, Department of Computer Science & Engineerig, Karmaveer Bhaurao Patil College of Engineering, Satara, Maharashtra, India Author
  • Omprakash Todkari UG Student, Department of Computer Science & Engineering, Karmaveer Bhaurao Patil College of Engineering, Satara, Maharashtra, India Author
  • Aniruddha Kshirsagar UG Student, Department of Computer Science & Engineering, Karmaveer Bhaurao Patil College of Engineering, Satara, Maharashtra, India Author

DOI:

https://doi.org/10.47392/IRJAEM.2026.0329

Keywords:

Deep Learning, CNN, BiLSTM, Multimodal Learning, Animal Disease Detection, Computer Vision, Audio Analysis, Artificial Intelligence

Abstract

Livestock health monitoring is one of the most crucial yet underserved areas in agriculture. Timely detection of diseases can prevent economic losses and safeguard food security. Manual observation methods are subjective, time-consuming, and error-prone. This paper proposes a Deep Learning-Based Multimodal Framework that combines image and audio data to detect animal diseases automatically. The proposed system utilizes Convolutional Neural Networks (CNN) for visual analysis of disease symptoms and Bidirectional Long ShortTerm Memory (BiLSTM) networks for acoustic pattern recognition. Feature-level fusion integrates both modalities to improve accuracy and robustness. The system is designed as a costeffective, scalable, and fully software-based solution suitable for deployment on local or cloudbased platforms. Theoretical analysis suggests that the proposed framework can achieve high accuracy in classifying diseases using only vision and sound modalities. 

Downloads

Download data is not yet available.

Downloads

Published

2026-06-09