基于lucene的图书搜索引擎的设计与实现外文文献内容摘要:

ucene often mistake it for a readytouse application like a filesearch program, a web crawler, or a web site search engine. That isn’t what Lucene is: Lucene is a software library, a toolkit if you will, not a fullfeatured search application. It concerns itself with text indexing and searching, and it does those things very well. Lucene lets your application deal with business rules specific to its problem domain while hiding the plexity of indexing and searching implementation behind a simpletouse API. You can think of Lucene as a layer that applications sit on top of, as depicted in figure . A number of fullfeatured search applications have been built on top of Lucene. If you’re looking for something prebuilt or a framework for crawling, document handling, and searching, consult the Lucene Wiki “powered by” page ( lucene/PoweredBy) for many options: Zilverline, SearchBlox, Nutch, LARM, and jSearch, to name a few. Case studies of both Nutch and SearchBlox are included in chapter 10. What Lucene can do for you Lucene allows you to add indexing and searching capabilities to your applications (these functions are described in section ). Lucene can index and make searchable any data that can be converted to a textual format. As you can see in figure . Lucene doesn’t care about the source of the data, its format, or even its language, as long as you can convert it to text. This means you can use Lucene to index and search data stored in files: web pages on remote web servers, documents stored in local file systems, simple text files, Microsoft Word documents, HTML or PDF files, or any other format from which you can extract textual information. Similarly, with Lucene’s help you can index data stored in your databases, giving your users fulltext search capabilities that many databases don’t provide. Once you integrate Lucene, users of your applications can make searches such as +Gee +Rice eat pudding, Apple – pie +Tiger, animal:monkey AND food:banana, and so on. With Lucene, you can index and search messages, mailinglist archives, instant messenger chats, your Wiki pages „ the list goes on. History of Lucene Lucene was originally written by Doug Cutting。 2 it was initially available for download from its home at the SourceFe web site. It joined the Apache Software Foundation’s Jakarta family of high quality open source Java products in September 2020. With each release since then, the project has enjoyed increased visibility, attracting more users and developers. As of July 2020, Lucene version has been released, with a bug fix release in early October. Table shows Lu。
阅读剩余 0%
本站所有文章资讯、展示的图片素材等内容均为注册用户上传(部分报媒/平媒内容转载自网络合作媒体),仅供学习参考。 用户通过本站上传、发布的任何内容的知识产权归属用户或原始著作权人所有。如有侵犯您的版权,请联系我们反馈本站将在三个工作日内改正。