官术网_书友最值得收藏!

Selecting the best N-grams

The number of different N-grams grows exponentially in N. Even for a fixed tiny N, such as N=3, there are 256x256x256=16,777,216 possible N-grams. This means that the number of N-grams features is impracticably large. Consequently, we must select a smaller subset of N-grams that will be of most value to our classifiers. In this section, we show three different methods for selecting the topmost informative N-grams.

主站蜘蛛池模板: 合川市| 东台市| 济源市| 鄂托克前旗| 兴文县| 堆龙德庆县| 夏津县| 霸州市| 辽中县| 醴陵市| 乐山市| 宜丰县| 修水县| 河源市| 龙口市| 鸡西市| 三河市| 江华| 英超| 盐池县| 云南省| 南乐县| 辉南县| 海晏县| 黎川县| 延安市| 阿鲁科尔沁旗| 铜山县| 龙川县| 长治市| 盐亭县| 龙井市| 营山县| 石林| 平江县| 庐江县| 星座| 南昌县| 安西县| 卢龙县| 谷城县|