Xapian

{{Short description|Information retrieval library}}

{{Notability|Product|date=February 2022}}

{{Infobox software

| name = Xapian

| logo =

| screenshot =

| caption =

| developer = xapian.org

| latest release version = {{wikidata|property|edit|P348}}

| latest release date = {{Start date and age|{{wikidata|qualifier|P348|P577}}}}

| programming language = C++

| operating system = Cross-platform

| platform =

| genre = Search and index API.

| license = GNU General Public License

| website = {{official URL}}

}}

Xapian is a free and open-source probabilistic information retrieval library, released under the GNU General Public License (GPL).{{cite web |url=http://xapian.org/features |title=The Xapian Project : Features |access-date=2018-03-31}} It is a full-text search engine library for programmers.

It is written in C++, with bindings to allow use from Perl, Python (2 and 3), PHP (5 and 7), Java, Tcl, C#, Ruby, Lua, Erlang, Node.js and R.{{cite web |url=https://xapian.org/ |title=The Xapian Project |access-date=2020-09-02}}[https://github.com/amandaJayanetti/RXapian#readme RXapian] Xapian is highly portable and runs on Linux, OS X, FreeBSD, NetBSD, OpenBSD, Solaris, HP-UX, AIX, Windows, OS/2{{cite web |url=http://freshmeat.net/projects/xapian/releases/273896 |title=Version 1.0.6 of Xapian and Omega |website=freshmeat.net|access-date=2018-03-31 |archive-url=https://web.archive.org/web/20110612182915/http://freshmeat.net/projects/xapian/releases/273896 |archive-date=2011-06-12}} and Hurd,[http://buildd.debian-ports.org/fetch.php?&pkg=xapian-core&ver=1.0.12-2&arch=hurd-i386&stamp=1242995814&file=log&as=raw Debian built success report for GNU Hurd]{{dead link|date=July 2016 |bot=InternetArchiveBot |fix-attempted=yes }}{{cite web |url=https://packages.debian.org/search?suite=all&arch=hurd-i386&searchon=names&keywords=xapian |title=Debian -- Package Search Results -- xapian |access-date=2018-03-31}} as well as Tru64.{{Citation needed|date=March 2018}} Xapian grew out of the Muscat search engine, written by Dr. Martin F. Porter at the University of Cambridge.{{cite web |url=https://xapian.org/history |title=The Xapian Project: History|access-date=2022-04-10}} The first official release of Xapian was version 0.5.0 on September 20, 2002.{{cite web |url=https://xapian.org/docs/xapian-core-1.4.19/NEWS |title=Xapian Core NEWS |access-date=2022-04-10}}

Xapian allows developers to add advanced indexing and search facilities to their own applications.

Organisations and projects using Xapian include the Library of the University of Cologne, Debian, Die Zeit, MoinMoin, and One Laptop per Child.{{cite web |url=http://xapian.org/users |title=Xapian Users |access-date=2018-03-31}}

Features<ref name=features/>

  • Supports Unicode 9.0 (including codepoints beyond the BMP) and stores indexed text in UTF-8.
  • Transactions: if database update fails in the middle of a transaction, the database is guaranteed to remain in a consistent state.
  • Simultaneous search and update, with new documents being immediately visible.
  • Support for large databases: Xapian has been proven to scale to hundreds of millions of documents.
  • Accurate probabilistic ranking: more relevant documents are listed first.
  • Phrase and proximity searching.
  • Relevance feedback, which improves ranking and can expand a query, find related documents, categorise documents etc.
  • Structured Boolean queries, e.g. "race AND condition NOT horse"
  • Wildcard search, e.g. "wiki*"
  • Spelling correction
  • Synonyms
  • Omega, a packaged solution for adding a search engine to a web site or intranet. Omega can easily be extended and adapted to fit changing requirements.

GUI front-ends

See also

{{Portal|Free and open-source software}}

References

{{Reflist}}