Análise de dados de sons de bezerros - Clique aqui para versão PT-BR
Analysis of bovine sound data, project titled "Standardization of dairy calves' vocalization," developed in partnership with the University of São Paulo (EESC-USP) and the University of São Paulo (ESALQ-USP), aiming to search for patterns in the vocalizations of the calves using Machine Learning techniques.
In animal science, the importance of animal vocalization is well known; however, little is explored regarding the composition of each sound and its relation to animal welfare. Therefore, we seek patterns by analyzing features extracted from sound data, using specialized libraries for audio analysis and processing (Librosa).
The audios were recorded on a farm with a controlled environment, using professional audio recording equipment.
Database with scarce data, Data Augmentation was applied to the audios in order to increase the robustness of the audios and the accuracy of the Machine Learning model to search for patterns.
The audio database contains animal vocalizations, organized into vocalizations from July and November, following a naming convention:
"animalID_breastfeeding_DayMonthYear_"
Concepts of Animal Science:
pm --> Pre-suckling
dm --> Post-suckling
- 01_dm_manha_100722.wav --> Animal 01 Post-weaning on July 10, 2022
Furthermore, the database contains the dataframes with the 11 features extracted from the audios, separated by each animal group, date, and condition (pm or dm).
Data augmentation techniques were applied, and the features were extracted considering the augmented sound data, which are labeled with "aumentado" (meaning "augmented" in PT-BR) at the end of the folder name. The Data Augmentation techniques were adapted to work with sound and have the same effect as when applied to images. Therefore, for the sounds, we applied the following techniques:
- Addition of random noise
- Change of pitch
- Time shifting
- Jupyter Notebook (Python 3)
- Librosa
- Pandas
- Mosqito
- SciPy
- Scikit-learn
- Seaborn
- Matplotlib
🔧Access requirements.txt for installation
- Mean FFTs (Fast Fourier Transform)
- Max FFT’s (Fast Fourier Transform)
- Loudness
- Envelopes - five histograms of data
- Roughness
- Sharpness
- Spectrogram
With that, 11 features will be extracted from the audios using audio analysis and processing libraries. The features will be processed through classification algorithms to select the best features for the model.
- PhD. Maíra Martins da Silva (EESC-USP)
- PhD. Iran José Oliveira da Silva
- M.Sc Karen Airosa Machado de Azevedo