官术网_书友最值得收藏!

Speech

Adding one of the Speech APIs allows your application to hear and speak to your users. The APIs can filter noise and identify speakers. Based on the recognized intent, they can drive further actions in your application.

The speech domain contains three APIs that are outlined in the following sections.

Bing Speech

Adding the Bing Speech API to your application allows you to convert speech to text and vice versa. You can convert spoken audio to text either by utilizing a microphone or other sources in real time or by converting audio from files. The API also offers speech intent recognition, which is trained by the Language Understanding Intelligent Service (LUIS) to understand the intent.

Speaker recognition

The speaker recognition API gives your application the ability to know who is talking. By using this API, you can verify that the person that is speaking is who they claim to be. You can also determine who an unknown speaker is based on a group of selected speakers.

Translator speech API

The translator speech API is a cloud-based automatic translation service for spoken audio. Using this API, you can add end-to-end translation across web apps, mobile apps, and desktop applications. Depending on your use cases, it can provide you with partial translations, full translations, and transcripts of the translations cover all speech-related APIs in Chapter 5, Speak with Your Application.

主站蜘蛛池模板: 通化县| 乌苏市| 金塔县| 安多县| 湾仔区| 海南省| 万荣县| 浏阳市| 通河县| 西乌| 本溪市| 博客| 德钦县| 水富县| 萍乡市| 沛县| 榕江县| 慈溪市| 镇安县| 彰化县| 防城港市| 东港市| 曲靖市| 汾阳市| 荔波县| 宁安市| 南乐县| 宝鸡市| 罗城| 上杭县| 武鸣县| 电白县| 安溪县| 海宁市| 昭觉县| 科尔| 固原市| 临汾市| 商南县| 萝北县| 乌兰察布市|