官术网_书友最值得收藏!

Introduction

In this chapter, you will learn two very important recipes. The first recipe demonstrates how you can index your data, and the second recipe, which is very closely connected to the first recipe, demonstrates how you can search through your indexed data.

For both indexing and searching, we will be using Apache Lucene. Apache Lucene is a free, opensource Java software library used heavily for information retrieval. It is supported by the Apache Software Foundation and is released under the Apache Software License.

Many different modern search platforms, such as Apache Solr and ElasticSearch, or crawling platforms, such as Apache Nutch, use Apache Lucene in the backend for data indexing and searching. Therefore, any data scientist who learns those search platforms will benefit from the two basic recipes in this chapter.

主站蜘蛛池模板: 微博| 柞水县| 临邑县| 灌南县| 习水县| 杭州市| 湘西| 台前县| 公主岭市| 琼海市| 斗六市| 汕头市| 玛沁县| 拜泉县| 舞钢市| 独山县| 运城市| 嘉兴市| 武宣县| 岗巴县| 廊坊市| 卢湾区| 蒙阴县| 岳阳市| 贡嘎县| 商水县| 禹州市| 安徽省| 富阳市| 盐城市| 凌源市| 巴马| 贡嘎县| 巧家县| 桐梓县| 天门市| 冀州市| 昌江| 油尖旺区| 大理市| 景泰县|