- Neural Networks with Keras Cookbook
- V Kishore Ayyadevara
- 169字
- 2021-07-02 12:46:33
Classifying common audio
In the previous sections, we have understood the strategy to perform modeling on a structured dataset and also on unstructured text data.
In this section, we will be learning about performing a classification exercise where the input is raw audio.
The strategy we will be adopting is that we will be extracting features from the input audio, where each audio signal is represented as a vector of a fixed number of features.
There are multiple ways of extracting features from an audio—however, for this exercise, we will be extracting the Mel Frequency Cepstral Coefficients (MFCC) corresponding to the audio file.
Once we extract the features, we shall perform the classification exercise in a way that is very similar to how we built a model for MNIST dataset classification—where we had hidden layers connecting the input and output layers.
In the following section, we will be performing classification on top of an audio dataset where there are ten possible classes of output.
- 基于粒計算模型的圖像處理
- 程序員面試白皮書
- DevOps with Kubernetes
- Web交互界面設計與制作(微課版)
- Java編程指南:基礎知識、類庫應用及案例設計
- 實戰Java程序設計
- 基于免疫進化的算法及應用研究
- Object-Oriented JavaScript(Second Edition)
- Mastering Python High Performance
- HTML5 and CSS3 Transition,Transformation,and Animation
- Nginx Essentials
- 精通Python設計模式(第2版)
- Django 3.0應用開發詳解
- Android應用開發深入學習實錄
- Mastering jQuery Mobile