官术网_书友最值得收藏!

Selecting the best N-grams

The number of different N-grams grows exponentially in N. Even for a fixed tiny N, such as N=3, there are 256x256x256=16,777,216 possible N-grams. This means that the number of N-grams features is impracticably large. Consequently, we must select a smaller subset of N-grams that will be of most value to our classifiers. In this section, we show three different methods for selecting the topmost informative N-grams.

主站蜘蛛池模板: 开封市| 东源县| 灌南县| 岑巩县| 白河县| 凤凰县| 行唐县| 逊克县| 麻阳| 新津县| 清新县| 阜新市| 德昌县| 灵璧县| 茂名市| 扎囊县| 巴林左旗| 临城县| 嘉善县| 瑞昌市| 板桥市| 嘉义市| 名山县| 武功县| 正定县| 辽宁省| 哈密市| 陆川县| 招远市| 尤溪县| 鹤岗市| 东丰县| 洱源县| 新巴尔虎左旗| 巫山县| 兴仁县| 华宁县| 康马县| 定安县| 安阳市| 收藏|