]> Sergey Matveev's repositories - public-inbox.git/commit
search: support alt-ID for mapping legacy serial numbers
authorEric Wong <e@80x24.org>
Thu, 11 Aug 2016 00:23:48 +0000 (00:23 +0000)
committerEric Wong <e@80x24.org>
Thu, 11 Aug 2016 02:28:40 +0000 (02:28 +0000)
commit58a5bb3e18901237b1ca34ef8f03f696be27d305
tree1725220ed46271629d71b3c5af3b8b2b328755a1
parentc357e8699d99e20e1033e13bd1e2faa9856fb456
search: support alt-ID for mapping legacy serial numbers

For some existing mailing list archives, messages are identified
by serial number (such as NNTP article numbers in gmane).  Those
links may become inaccessible (as is the current case for
gmane), so ensure users can still search based on old serial
numbers.

Now, I run the following periodically to get article numbers
from gmane (while news.gmane.org remains):

NNTPSERVER=news.gmane.org
export NNTPSERVER
GROUP=gmane.comp.version-control.git
perl -I lib scripts/xhdr-num2mid $GROUP --msgmap=/path/to/gmane.sqlite3

(I might integrate this further with public-inbox-* scripts one day).

My ~/.public-inbox/config as an added "altid" snippet which now
looks like this:

[publicinbox "git"]
address = git@vger.kernel.org
mainrepo = /path/to/git.vger.git
newsgroup = inbox.comp.version-control.git

; relative pathnames expand to $mainrepo/public-inbox/$file
altid = serial:gmane:file=gmane.sqlite3

And run "public-inbox-index --reindex /path/to/git.vger.git"
periodically.

This ought to allow searching for "gmane:12345" to work for
Xapian-enabled instances.

Disclaimer: while public-inbox supports NNTP and stable article
serial numbers, use of those for public links is discouraged
since it encourages centralization.
MANIFEST
lib/PublicInbox/AltId.pm [new file with mode: 0644]
lib/PublicInbox/Config.pm
lib/PublicInbox/Inbox.pm
lib/PublicInbox/Msgmap.pm
lib/PublicInbox/Search.pm
lib/PublicInbox/SearchIdx.pm
script/public-inbox-index
scripts/xhdr-num2mid
t/altid.t [new file with mode: 0644]