1 Wed Dec 16 00:42:12 GMT 2009 Olly Betts <olly@survex.com>
3 * stemming/german/output.txt,stemming/german2/output.txt: Update
4 expected output from the german and german2 stemmers to match the
7 Tue Sep 02 04:16:57 GMT 2008 Olly Betts <olly@survex.com>
9 * stemming/romanian/: Split out the romanian stemming test data into
10 the vanilla files taken from Snowball and supplemental files with
11 our extra data. Our voc.txt and output.txt files now exactly match
13 * stemming/README: Update to document the new scheme.
15 Tue Sep 02 02:59:00 GMT 2008 Olly Betts <olly@survex.com>
17 * stemming/: Rename all the stemming data files to match those in the
18 Snowball sources, to make maintenance easier.
20 Tue Sep 02 02:46:36 GMT 2008 Olly Betts <olly@survex.com>
22 * README: Reword. Drop author attribution, as it's not relevant who
23 wrote this two line note.
25 Thu Oct 18 18:11:07 BST 2007 Olly Betts <olly@survex.com>
27 * stemming/turkish.st,stemming/turkish.voc: Append new Snowball
29 * stemming/README: Note that most data here is from Snowball, but
32 Thu Mar 29 17:55:54 BST 2007 Richard Boulton <richard@lemurconsulting.com>
34 * stemming/romanian.st: Oops - also remove corresponding word from the
35 expected output file, or it won't work.
37 Thu Mar 29 17:49:44 BST 2007 Richard Boulton <richard@lemurconsulting.com>
39 * stemming/romanian.voc: Remove sample word which had a capital
40 letter, since stemtest doesn't cope with these.
42 Tue Mar 27 14:49:28 BST 2007 Olly Betts <olly@survex.com>
44 * stemming/romanian2.st,stemming/romanian2.voc: Remove romanian2
45 vocab and output since the corresponding stemmer has been removed.
47 Tue Mar 27 12:08:41 BST 2007 Richard Boulton <richard@lemurconsulting.com>
49 * stemming/: Rename romanian1 vocab list to romanian, and update
50 the output to correspond to that from the new romanian stemmer.
52 Tue Mar 27 04:51:43 BST 2007 Olly Betts <olly@survex.com>
54 * stemming/hungarian.voc: Lowercase terms with uppercase letters.
55 * stemming/: Import new stemming test data from snowball.
57 Mon Feb 26 21:46:41 GMT 2007 Olly Betts <olly@survex.com>
59 * stemming/: Add vocab lists and stemmed equivalents for
60 hungarian, kraaij_pohlmann, and romanian1.
62 Mon Feb 26 20:18:51 GMT 2007 Richard Boulton <richard@lemurconsulting.com>
64 * stemming/: Use new snowball vocab lists and stemmed word lists -
65 changes character set to UTF-8, and updates some of the stemmings
66 to reflect changes in the snowball stemmers.
68 Mon Oct 14 00:04:54 BST 2002 Olly Betts <olly@survex.com>
70 * stemming/: Use Snowball vocab lists instead of the old Xapian ones;
71 removed stopword lists (we'll use the Snowball ones, but they want
72 to be in xapian-core which everyone downloads, not with the stemmer
73 test data which few people need).
75 Sun Oct 13 03:44:58 BST 2002 Olly Betts <olly@survex.com>
77 * stemming/: Fixed layout of stopsource files where the muscat3.6
78 style "e^a" accents have been changed into "é" etc.
80 Sun Oct 13 03:34:09 BST 2002 Olly Betts <olly@survex.com>
82 * stemming/: Updated files so that they are correct for the Snowball
83 stemmers (which use iso-8859-1 accents and produce slightly
86 Fri Apr 12 14:07:40 BST 2002 Olly Betts <olly@survex.com>
88 * Started ChangeLog; updated README files to refer to Xapian and