Terrier 3.0

Flexible search engine for everyone
The Terrier software is a flexible, efficient and effective search engine.

The application is readily deployable on large-scale collections of documents. Terrier implements state-of-the-art indexing and retrieval functionalities, and provides an ideal platform for the rapid development and evaluation of large-scale retrieval applications.

Main features:

  • Efficient:
  • Terrier can index large corpora of documents, and provides multiple indexing strategies, such as multi-pass, single-pass and large-scale MapReduce indexing.
  • Effective:
  • State-of-the-art retrieval approaches are provided, such as Divergence From Randomness, BM25F, as well as term dependence proximity models such as Markov Random Fields.
  • Flexible:
  • Terrier is ideal for performing information retrieval experiments. It can index and perform batch retrieval experiments for all known TREC test collections. Tools to evaluate experiments results are also included.
  • Multi-lingual:
  • Terrier uses UTF internally, and can support corporas written in languages other than English.
  • Extensible:
  • Terrier follows a plugin architecture, and is easy to extend to develop new retrieval techniques.
  • Interactive:
  • View search results in a handy desktop search application, or online from a JSP web interface.

last updated on:
June 17th, 2011, 10:08 GMT
file size:
14.3 MB
price:
FREE!
developed by:
The Terrier Team
license type:
Mozilla Public License 
operating system(s):
Windows All
category:
C: \ Internet \ Search engine tools/submiting

FREE!

In a hurry? Add it to your Download Basket!

user rating 3

3.7/5
 

0/5

Rate it!
4 Screenshots
Terrier - The main window of the application can be used to search for terms in your indexed locations.Terrier - The index tab found in the program displays the number of indexed documents.Terrier - You can add or remove folders from the program's index from this dedicated window.Terrier - Index your TREC collections using this included window found in the Terrier software.
What's New in This Release:
  • Major update:
  • Support for indexing WARC collections; improved index structure layout; improved MapReduce mode indexing; refined, scalable structure access at retrieval time; moved all code to terrier.org namespace; added field-based and proximity term dependence models; added HTTP-based retrieval interface; added many JUnit tests. All indices must be rebuilt.
  • Indexing:
read full changelog

Add your review!

SUBMIT