- Neural Networks with Keras Cookbook
- V Kishore Ayyadevara
- 169字
- 2021-07-02 12:46:33
Classifying common audio
In the previous sections, we have understood the strategy to perform modeling on a structured dataset and also on unstructured text data.
In this section, we will be learning about performing a classification exercise where the input is raw audio.
The strategy we will be adopting is that we will be extracting features from the input audio, where each audio signal is represented as a vector of a fixed number of features.
There are multiple ways of extracting features from an audio—however, for this exercise, we will be extracting the Mel Frequency Cepstral Coefficients (MFCC) corresponding to the audio file.
Once we extract the features, we shall perform the classification exercise in a way that is very similar to how we built a model for MNIST dataset classification—where we had hidden layers connecting the input and output layers.
In the following section, we will be performing classification on top of an audio dataset where there are ten possible classes of output.
- Dynamics 365 for Finance and Operations Development Cookbook(Fourth Edition)
- Advanced Splunk
- C語言程序設(shè)計(第3版)
- C#程序設(shè)計實(shí)訓(xùn)指導(dǎo)書
- Visual Basic程序設(shè)計
- AIRIOT物聯(lián)網(wǎng)平臺開發(fā)框架應(yīng)用與實(shí)戰(zhàn)
- Android應(yīng)用開發(fā)實(shí)戰(zhàn)(第2版)
- Learning Android Application Testing
- Practical Maya Programming with Python
- 精益軟件開發(fā)管理之道
- Getting Started with the Lazarus IDE
- Web程序設(shè)計與架構(gòu)
- Spring MVC Cookbook
- R統(tǒng)計應(yīng)用開發(fā)實(shí)戰(zhàn)
- Web前端開發(fā)全程實(shí)戰(zhàn):HTML5+CSS3+JavaScript+jQuery+Bootstrap