Classify Bird Species Audio by Augment Convolutional Neural Network


Jasim H. A., Ahmed S. R., Ibrahim A. A., Duru A. D.

4th International Congress on Human-Computer Interaction, Optimization and Robotic Applications, HORA 2022, Ankara, Türkiye, 9 - 11 Haziran 2022 identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Doi Numarası: 10.1109/hora55278.2022.9799968
  • Basıldığı Şehir: Ankara
  • Basıldığı Ülke: Türkiye
  • Anahtar Kelimeler: audio, Birds, classification, CNN, identification, librosa, spectrogram, xeno-canto
  • Marmara Üniversitesi Adresli: Evet

Özet

© 2022 IEEE.Using convolutional neural networks, this thesis aims to create a system for fully automated identification of bird species based on spectrogram images. Spectrogram analysis is more difficult when trying to make an advance identification of a bird species. On a publicly available dataset of 8000 audio examples, we've begun by analyzing the challenges of bird species detection, segmentation, and classification to achieve our goal. It has been determined also that deep learning-based technique CNN with Fully convolutional learning calls for easier results because it eliminates the possible future modelling error caused by an imprecise knowledge of bird species and works well on coding in cohesion with the spectral analysis kernel using the librosa library. We have concluded. After obtaining the dataset from the open-source repository, it is then processed locally. For training, testing, and validation we used a subset of the dataset of 8000 sound samples. We offered a method relying on a CNN reset learned that proved to be very quick and optimum because it was first needing the spectrogram analytic kernel to learn what to class in bird species, and then it gets the system trained on features extracted. In a novel 9-step implementation, a bird species spectrogram can be detected from an audio sample. There was a loss of less than 0.0063, and the conditioning workouts accuracy is 0.9895 for the system, 0.9 as precision, and training and validation use 50 epochs in system.