Skip to end of metadata
Go to start of metadata

1.11. How do I make the archives searchable

The archives (unless made explicitly private) are indexable and searchable by standard web search tools (for example you could encourage google to index your archive pages and then do searches with google restricting results to your domain).

  eg Search google for, "clever technique site:your.domain.dom"

But you should note that Google does not index all your posts, especially the recent posts which have not yet been crawled and indexed yet. An alternative is to use a third-party mailing list archive service which usually index your messages up to the minute. These services are usually free, although some may run Google ads. Below is a list of providers:

Mail-archive: http://www.mail-archive.com/ Home of the official Mailman-* mailing list searchable archives, they also run Google ads.

The following sites have set up unofficial gateways of the Mailman-* mailing lists in the past, but were asked to terminate their unapproved activities, or investigation into their unapproved activities is ongoing:

Gmane: http://www.gmane.org/ Gmane is an older USENET gateway/archive service, some lists may have delay in indexing
Nabble: http://www.nabble.com/ Nabble has instant indexing up to the minute, clean UI.

Go to the sites, fill out a form to request your lists to get added. Then you will have a searchable archive within a day or two. They may also offer import of historical mailing list archives.

Other commonly used and recommended external search tools include:

Swish: http://www.swish-e.org
MnoGoSearch: http://www.mnogosearch.org/
HT:Dig: http://www.htdig.org/

For instructions on how to incorporate Namazu as the search engine for Mailman, see 4.08 How can I add Namazu as a search engine for my Mailman list archives?.

For instructions on how to incorporate Swish-e as the search engine for Mailman, see http://wpkg.org/Integrating_Mailman_with_a_Swish-e_search_engine
There is also some useful instruction at this page: http://infothecary.org/jordan/mailman.html

As an alternative, there are some patches for Mailman that integrate it with HT:Dig and work with both pipermail, the Mailman internal archiver, and/or with the MHonArc external archive page generator when that is also integrated with Mailman. The patches maintain list archive privacy while providing per-list indexing and searching of the archive pages. The patch adds a search box to each list's archive TOC page.

The patches to handle this are on the Mailman Launchpad bugs pages, with ids:-

  [ #266553 ] Archive indexer control to improve indexing
  [ #266554 ] Integration of Mailman & htdig for archive search
  [ #558089 ] Mailman/pipermail/MHonArc integration patch

The Launchpad URLs for the patches are given below but if you have any difficulty retrieving them from Launchpad then they are also posted on the following site where there is more detail about each patch and its use:

http://www.openinfo.co.uk/

https://bugs.launchpad.net/mailman/+bug/266553
https://bugs.launchpad.net/mailman/+bug/266554
https://bugs.launchpad.net/mailman/+bug/558089

Instructions for applying these patches (you need the copy of the Mailman source tree you installed from) are given on the referenced pages.

Current versions of the indexing and htdig integration patches combined into a single patch are available at http://www.msapiro.net/mm/ (mirrored at http://fog.ccsf.cc.ca.us/~msapiro/mm/)

Converted from the Mailman FAQ Wizard

This is one of many Frequently Asked Questions.

Labels
  • None