官术网_书友最值得收藏!

Classifying common audio

In the previous sections, we have understood the strategy to perform modeling on a structured dataset and also on unstructured text data.

In this section, we will be learning about performing a classification exercise where the input is raw audio.

The strategy we will be adopting is that we will be extracting features from the input audio, where each audio signal is represented as a vector of a fixed number of features.

There are multiple ways of extracting features from an audio—however, for this exercise, we will be extracting the Mel Frequency Cepstral Coefficients (MFCC) corresponding to the audio file.

Once we extract the features, we shall perform the classification exercise in a way that is very similar to how we built a model for MNIST dataset classification—where we had hidden layers connecting the input and output layers.

In the following section, we will be performing classification on top of an audio dataset where there are ten possible classes of output.

主站蜘蛛池模板: 玉林市| 四子王旗| 垦利县| 霍州市| 凉城县| 信丰县| 酒泉市| 黄石市| 莎车县| 卢龙县| 丰原市| 吴旗县| 锦屏县| 嵩明县| 中方县| 高安市| 扎兰屯市| 长武县| 三明市| 南木林县| 金山区| 云龙县| 建昌县| 湘阴县| 昔阳县| 宜春市| 遂平县| 寿光市| 绥德县| 东源县| 柳州市| 黄陵县| 永年县| 涞水县| 正安县| 岑巩县| 永登县| 东宁县| 泊头市| 会昌县| 松江区|