Blog

Tika Text Extraction: Introduction & Example

Tika is a content extraction framework that builds on the best of breed open source content extraction libraries like Apache...
Weiterlesen
Blog

Getting Started with Lucene Setup

Apache Lucene is a fast, full-featured, full-text search library used in a large number of production environments. In this article,...
Weiterlesen
Blog

Solr Search Relevance Testing

Many people focus purely on the speed of search, often neglecting the quality of the results produced by the system....
Weiterlesen
Blog

Google chooses Solr for Public Service Website Search

One of the things I've long envied at Google is their 20% rule, by which they allow developers to carve...
Weiterlesen
Blog

Lucid Gaze for Lucene

Java Developers have long been familiar with the power of the Apache Lucene open source search library, and have made...
Weiterlesen
Blog

Searching rich format documents stored in a DBMS

By Jonck van der Kogel Introduction As companies gather more and more data, the ability to search this data is...
Weiterlesen
Blog

Trends: Know your relevance

"Control, exploration, flexibility, tunability," were the answers expounded by representatives of Microsoft, Endeca, and Vivisimo. Relevance is in the eye...
Weiterlesen
Blog

ASF Interview with Apache Lucene creator Doug Cutting

https://www.youtube.com/v/XyDQAY9dwsQ
Weiterlesen
Blog

Training: Up and to the right

As the Great Recession tests all of our economic patience, many people I know, myself include, have gotten into the...
Weiterlesen
Blog

What Is the SpanQuery?

SpanQuerys allow for nested, positional restrictions when matching documents in Lucene. SpanQuery's are much like PhraseQuerys or MultiPhraseQuerys in that...
Weiterlesen
Blog

Publishing Case Study: McClatchy Interactive Employs Open Source Search

Highlights Dramatically increased index speed improves timeliness and relevance of user searches Reduced the number of required servers by 85%...
Weiterlesen
Blog

Library/Catalog Case Study: Europeana – bringing European culture online

Case Study: Chartered by the European Commission in 2007, the overarching goal of Europeana is to create an online environment...
Weiterlesen