1 Fri Sep 25 04:39:54 GMT 2015 Olly Betts <olly@survex.com>
3 * docs/overview.rst: SVG extraction is built-in too.
5 Fri Sep 25 04:32:04 GMT 2015 Olly Betts <olly@survex.com>
7 * docs/cgiparams.rst: Document behaviour if xDB is not set.
9 Fri Sep 25 04:31:15 GMT 2015 Olly Betts <olly@survex.com>
11 * docs/cgiparams.rst,query.cc: If xFILTERS is not set, don't force the
12 first page as that's unhelpful if someone fails to set it in their
15 Fri Sep 25 01:22:11 GMT 2015 Olly Betts <olly@survex.com>
17 * configure.ac: Don't provide our own implementation of sleep() under
18 __WIN32__ if there's already one - mingw provides one, and in some
19 situations it seems to clash with ours. Reported to xapian-discuss
22 Wed May 20 09:57:45 GMT 2015 Olly Betts <olly@survex.com>
24 * NEWS: Bump release date for 1.2.21.
26 Fri May 15 05:32:35 GMT 2015 Olly Betts <olly@survex.com>
28 * NEWS: Another update for 1.2.21.
30 Fri May 15 05:16:05 GMT 2015 Olly Betts <olly@survex.com>
32 * Makefile.am,configure.ac: Use -no-install or -no-fast-install when
33 linking test programs which never get installed, which means libtool
34 can often avoid creating a shell script wrapper.
36 Thu May 14 14:52:14 GMT 2015 Olly Betts <olly@survex.com>
38 * NEWS,configure.ac: Update for 1.2.21.
40 Thu May 14 10:17:32 GMT 2015 Olly Betts <olly@survex.com>
44 Thu May 14 05:16:08 GMT 2015 Olly Betts <olly@survex.com>
46 * omindex.cc: Message tweak.
48 Thu May 14 05:15:18 GMT 2015 Olly Betts <olly@survex.com>
50 * outlookmsg2html.in: Fix handling of message/rfc822 subparts.
52 Thu May 14 04:16:34 GMT 2015 Olly Betts <olly@survex.com>
54 * cdb_hash.cc,md5.cc: Remove 'register' as it's deprecated, and
55 likely to just be ignored by any modern compiler anyway.
57 Wed May 06 10:08:33 GMT 2015 Olly Betts <olly@survex.com>
61 Wed May 06 07:26:28 GMT 2015 Olly Betts <olly@survex.com>
63 * omega.cc: Drop compilation date and time from output - they prevent
64 reproducible builds and the version number is sufficient
67 Wed May 06 07:24:27 GMT 2015 Olly Betts <olly@survex.com>
69 * commonhelp.cc,omega.cc,omindex.cc,scriptindex.cc: Add spaces between
70 literal strings and macros which expand to literal strings for C++11
73 Wed May 06 04:56:39 GMT 2015 Olly Betts <olly@survex.com>
75 * Makefile.am: Add common/stringutils.cc to urlenctest_SOURCES, needed
76 now urldecode.h uses C_isxdigit().
78 Wed May 06 04:53:56 GMT 2015 Olly Betts <olly@survex.com>
80 * configfile.cc,htmlparse.cc,myhtmlparse.cc,omega.cc,omindex.cc,
81 query.cc,scriptindex.cc,urldecode.h: Consistently use C_isupper(),
82 C_toupper(), etc as these versions aren't affected by the locale
83 setting, and also allow signed char values (so we don't need to
84 cast the argument to unsigned char).
86 Tue May 05 14:17:16 GMT 2015 Olly Betts <olly@survex.com>
88 * templates/query: When listing matching terms, don't make the commas
91 Tue May 05 12:43:59 GMT 2015 Olly Betts <olly@survex.com>
93 * docs/overview.rst: Document 'E' prefixed boolean terms for filtering
94 by extension (see #668, reported by bramvdh).
96 Tue May 05 12:42:59 GMT 2015 Olly Betts <olly@survex.com>
98 * docs/overview.rst: Whitespace cleanup.
100 Tue May 05 12:41:52 GMT 2015 Olly Betts <olly@survex.com>
102 * templates/xml: Add XML declaration.
104 Tue May 05 12:40:44 GMT 2015 Olly Betts <olly@survex.com>
106 * templates/query: Eliminate blank line before <html>.
108 Tue May 05 12:39:01 GMT 2015 Olly Betts <olly@survex.com>
110 * templates/godmode: Return charset utf-8 in the content-type.
112 Tue May 05 12:29:46 GMT 2015 Olly Betts <olly@survex.com>
114 * docs/encodings.rst: $prettyurl undoes %-encoding of UTF-8 in 1.2.21
117 Tue May 05 12:27:14 GMT 2015 Olly Betts <olly@survex.com>
119 * docs/encodings.rst: Note that one should ensure that Omega gets sent
120 form submissions encoded in UTF-8.
122 Tue May 05 12:23:09 GMT 2015 Olly Betts <olly@survex.com>
124 * docs/encodings.rst: Discuss encodings of filenames (see #550).
126 Tue May 05 12:20:00 GMT 2015 Olly Betts <olly@survex.com>
128 * urldecode.h,urlenctest.cc: $prettyurl now decodes valid UTF-8
129 sequences. Fixes #550 and #644, reported by catkin and terencz.
131 Tue May 05 12:14:06 GMT 2015 Olly Betts <olly@survex.com>
133 * docs/: Add a document about character encoding, as suggested by
134 James Aylett in #550.
136 Tue May 05 12:08:39 GMT 2015 Olly Betts <olly@survex.com>
138 * urldecode.h,urlenctest.cc: Improve decoding done by $prettyurl - we
139 now leave the query and fragment parts of the URL alone and don't
140 decode an escaped "/" (omindex doesn't create URLs with any of
141 these, so we only risk breaking other URLs which have them), and we
142 decode some additional ASCII characters in the path part:
143 []@!$&'()*+.;= (addresses #550 in part)
145 Wed Mar 04 00:29:44 GMT 2015 Olly Betts <olly@survex.com>
147 * NEWS,configure.ac: Update for 1.2.20.
149 Wed Mar 04 00:03:35 GMT 2015 Olly Betts <olly@survex.com>
151 * Backport change from trunk:
152 * configure.ac: Avoid doing link tests with libmagic in configure as
153 they fail on mingw due to not automatically picking up libraries
154 which libmagic itself depends on.
156 Sat Feb 14 10:46:47 GMT 2015 Olly Betts <olly@survex.com>
160 Tue Jan 27 04:37:12 GMT 2015 Olly Betts <olly@survex.com>
162 * docs/overview.rst,omindex.cc: Interpret a command of "false" in
163 "--filter" as meaning to ignore files with that MIME type.
165 Sat Jan 24 12:14:33 GMT 2015 Olly Betts <olly@survex.com>
169 Thu Jan 22 01:44:01 GMT 2015 Olly Betts <olly@survex.com>
171 * omindex.cc: Ignore extensions .msi and .msp, which are Microsoft
172 installer files, but which libmagic sometimes incorrectly identifies
173 as application/msword.
175 Fri Dec 19 03:21:13 GMT 2014 Olly Betts <olly@survex.com>
177 * docs/cgiparams.rst: Improve wording of docs for SORT parameter.
179 Tue Jan 06 21:15:14 GMT 2015 Olly Betts <olly@survex.com>
181 * configure.ac: Use pkg-config in preference to determine flags needed
182 to compile and link with PCRE, as this will just work when
183 cross-compiling (at least under MXE).
185 Sun Dec 21 21:54:48 GMT 2014 Olly Betts <olly@survex.com>
187 * query.cc: Handle [=0 as [=1.
189 Tue Dec 16 03:53:25 GMT 2014 Olly Betts <olly@survex.com>
191 * configure.ac: Define MINGW_HAS_SECURE_API under mingw to get
192 _putenv_s() declared in stdlib.h.
194 Wed Dec 03 04:17:18 GMT 2014 Olly Betts <olly@survex.com>
196 * templates/xml: Update handling of DATE1, DATE2 and DAYSMINUS which
197 were renamed in 0.6.x and the compatibility aliases removed in
200 Wed Dec 03 04:15:51 GMT 2014 Olly Betts <olly@survex.com>
202 * docs/omegascript.rst: Update documentation references to DATE1,
203 DATE2, and DAYSMINUS which were renamed in 0.6.x and the
204 compatibility aliases removed in 1.0.0.
206 Wed Dec 03 02:29:37 GMT 2014 Olly Betts <olly@survex.com>
208 * omindex.cc: Remove unused '#include <fstream>'.
210 Wed Dec 03 02:18:51 GMT 2014 Olly Betts <olly@survex.com>
212 * diritor.h: Fix get_mtime() to return time_t not off_t. In practice,
213 this probably wouldn't have caused issues until at least 2038.
215 Fri Nov 28 11:38:56 GMT 2014 Olly Betts <olly@survex.com>
217 * configure.ac: Move AC_CANONICAL_HOST before first use of $host_os.
218 In practice this wasn't a problem, as LT_INIT implicitly calls
219 AC_CANONICAL_HOST before this point anyway.
221 Wed Nov 26 03:55:13 GMT 2014 Olly Betts <olly@survex.com>
223 * configure.ac: Enable automake option 'subdir-objects' to avoid
224 warning from newer automake.
226 Tue Oct 21 00:33:58 GMT 2014 Olly Betts <olly@survex.com>
228 * NEWS,configure.ac: Update for 1.2.19.
230 Mon Oct 20 10:52:38 GMT 2014 Olly Betts <olly@survex.com>
234 Mon Oct 20 10:44:27 GMT 2014 Olly Betts <olly@survex.com>
236 * Backport change from trunk:
237 * docs/overview.rst: Note that pdftotext is part of poppler as well as
238 xpdf. (Noted by Paul Wise)
240 Sun Jun 22 04:32:15 GMT 2014 Olly Betts <olly@survex.com>
242 * NEWS,configure.ac: Update for 1.2.18.
244 Sat Jun 21 00:50:56 GMT 2014 Olly Betts <olly@survex.com>
246 * Backport change from trunk:
247 * Makefile.am: Don't compile in unixperm.cc - it isn't currently used,
248 and it fails to build with mingw. (fixes #635)
250 Thu Jun 19 13:56:02 GMT 2014 Olly Betts <olly@survex.com>
254 Thu Jun 19 13:11:43 GMT 2014 Olly Betts <olly@survex.com>
256 * omindex.cc,urlencode.cc,urlencode.h: To improve compatibility with
257 existing databases in cases which are OK in practice, only URL
258 encode the starting URL if it contains a character <= 31 or at least
259 one of '#', '%', ':' or '?'.
261 Thu Jun 19 08:44:22 GMT 2014 Olly Betts <olly@survex.com>
263 * Backport change from trunk:
264 * omindex.cc,urlencode.cc,urlencode.h: URL encode starting URL
267 Thu Jun 19 08:27:13 GMT 2014 Olly Betts <olly@survex.com>
269 * Backport change from trunk:
270 * docs/omegascript.rst: Put ``...`` around Xapian C++ class names.
272 Thu Jun 19 08:10:17 GMT 2014 Olly Betts <olly@survex.com>
274 * Backport change from trunk:
275 * diritor.cc: Fix "applications/msword" to "application/msword" in the
276 fallback code for CDF files.
278 Thu Jun 19 07:55:11 GMT 2014 Olly Betts <olly@survex.com>
280 * Backport change from trunk:
281 * diritor.cc: In fallback for CDF files, compare the extension
282 *without* leading dot.
284 Thu Jun 19 07:02:12 GMT 2014 Olly Betts <olly@survex.com>
286 * Backport change from trunk:
287 * diritor.cc: libmagic can return a second string starting "Composite
288 Document File V2 Document" for the mime-type, so just look for that
289 prefix. And newer libmagic returns "application/CDFV2-corrupt" in
290 these cases, so handle that too.
292 Fri May 23 13:43:33 GMT 2014 Olly Betts <olly@survex.com>
294 * Backport change from trunk:
295 * date.cc,scriptindex.cc: Pass std::string by const reference.
297 Fri May 23 13:42:02 GMT 2014 Olly Betts <olly@survex.com>
299 * Backport change from trunk:
300 * query.cc: Removed unused inline function.
302 Fri May 23 13:38:33 GMT 2014 Olly Betts <olly@survex.com>
304 * Backport change from trunk:
305 * omindex.cc: Report the size limit in the message when we skip a file
308 Wed Apr 30 05:44:54 GMT 2014 Olly Betts <olly@survex.com>
312 Mon Apr 14 10:28:51 GMT 2014 Olly Betts <olly@survex.com>
314 * Backport change from trunk:
315 * diritor.cc: Work around libmagic returning a MIME content-type of
316 "Composite Document File V2 Document, No summary info".
318 Mon Apr 14 09:38:51 GMT 2014 Olly Betts <olly@survex.com>
320 * Backport change from trunk:
321 * expand.cc: Fix warning when built with GCC 4.7.2 using -Os.
323 Fri Mar 14 12:12:59 GMT 2014 Olly Betts <olly@survex.com>
325 * Backport change from trunk:
326 * Makefile.am,docs/omegascript.rst,expand.cc,expand.h,omega.cc,
327 query.cc: Add support for setting the query expansion scheme to use.
329 Tue Jan 28 23:50:11 GMT 2014 Olly Betts <olly@survex.com>
331 * NEWS,configure.ac: Update for 1.2.17.
333 Sun Jan 05 22:11:39 GMT 2014 Olly Betts <olly@survex.com>
337 Thu Dec 26 01:21:42 GMT 2013 Olly Betts <olly@survex.com>
339 * Backport change from trunk:
340 * Makefile.am,portability/mkdtemp.cc,portability/mkdtemp.h,omindex.cc:
341 Add header with prototype of mkdtemp() to avoid "no previous
342 declaration" warning on platforms which don't have mkdtemp() as
345 Fri Dec 20 08:00:25 GMT 2013 Olly Betts <olly@survex.com>
349 Fri Dec 20 07:04:32 GMT 2013 Olly Betts <olly@survex.com>
351 * Backport change from trunk:
352 * docs/overview.rst: Add Abiword as an example use of --filter, based
353 on patch from Frank J Bruzzaniti (fixes#383). Update unoconv
354 example to talk about LibreOffice instead of OpenOffice.
356 Tue Dec 03 23:13:48 GMT 2013 Olly Betts <olly@survex.com>
358 * NEWS: Final update for 1.2.16.
360 Tue Dec 03 07:33:40 GMT 2013 Olly Betts <olly@survex.com>
362 * Backport change from trunk:
363 * configure.ac: Define __MSVCRT_VERSION__ to 0x0601 on mingw so we get
364 __ftime64() defined in the headers.
366 Sat Nov 30 20:36:36 GMT 2013 Olly Betts <olly@survex.com>
368 * Makefile.am: Need to ship common/safewinsock2.h now.
370 Fri Nov 29 04:34:18 GMT 2013 Olly Betts <olly@survex.com>
372 * Backport change from trunk:
373 * configure.ac: Sync GCC checks with xapian-core.
375 Fri Nov 29 01:09:01 GMT 2013 Olly Betts <olly@survex.com>
377 * configure.ac: Update for 1.2.16.
379 Thu Oct 31 12:33:31 GMT 2013 Olly Betts <olly@survex.com>
383 Sun Oct 13 23:25:11 GMT 2013 Olly Betts <olly@survex.com>
385 * Backport change from trunk:
386 * omindex.cc: Group-readable files which are owner-readable but not
387 world-readable should still get a "readable by owner" term added.
388 Reported by Emmanuel Garette.
390 Tue Oct 08 01:06:45 GMT 2013 Olly Betts <olly@survex.com>
392 * NEWS: Update from ChangeLog.
394 Sun Oct 06 08:17:57 GMT 2013 Olly Betts <olly@survex.com>
396 * Backport change from trunk:
397 * configure.ac: Sync compiler warning flag machinery against
398 xapian-core. The changes are special handling for clang, passing
399 -fshow-column where supported, and handling for new warning flags
402 Sun Oct 06 08:13:56 GMT 2013 Olly Betts <olly@survex.com>
404 * Backport changes from trunk:
405 * xlsxparse.cc: Handle pre-defined numfmtid codes for dates.
406 * omindex.cc,xlsxparse.cc,xlsxparse.h: Fix detection of cells with a
407 date format to work with xlsx files other than my first example.
408 * omindex.cc,xlsxparse.cc,xlsxparse.h: Decode dates for xlsx files.
410 Sun Oct 06 08:01:59 GMT 2013 Olly Betts <olly@survex.com>
412 * Backport change from trunk:
413 * omindex.cc,docs/overview.rst: Ignore 'adm', 'cur', and 'ico' by
416 Sat Oct 05 06:46:32 GMT 2013 Olly Betts <olly@survex.com>
418 * Backport change from trunk:
419 * weight.cc: Use "" not <> to include local header weight.h.
421 Sat Oct 05 06:44:23 GMT 2013 Olly Betts <olly@survex.com>
423 * Backport change from trunk:
424 * configure.ac,xapian-omega.spec.in: Compress source tarballs with xz
427 Sat Oct 05 06:40:57 GMT 2013 Olly Betts <olly@survex.com>
429 * Backport change from trunk:
430 * diritor.h: Mark DirectoryIterator ctor as 'explicit'.
432 Sat Oct 05 06:37:08 GMT 2013 Olly Betts <olly@survex.com>
434 * Backport change from trunk:
435 * datematchdecider.h: Fix filename in comment at top of file.
437 Sat Oct 05 06:35:19 GMT 2013 Olly Betts <olly@survex.com>
439 * Backport change from trunk:
440 * omindex.cc: Fix off-by-one when finding documents to delete which
441 would sometimes cause omindex to fail to delete documents from the
442 database when they weren't refound during an index update.
444 Sat Oct 05 06:28:56 GMT 2013 Olly Betts <olly@survex.com>
446 * Backport change from trunk:
447 * Makefile.am,README,docs/Makefile.am: SVN -> git.
449 Tue Apr 16 23:41:20 GMT 2013 Olly Betts <olly@survex.com>
451 * NEWS,configure.ac: Update for 1.2.15.
453 Tue Apr 16 10:43:52 GMT 2013 Olly Betts <olly@survex.com>
455 * NEWS: Update from ChangeLog.
457 Wed Mar 27 09:39:12 GMT 2013 Olly Betts <olly@survex.com>
459 * Backport change from trunk:
460 * Makefile.am: Don't link utf8convert.cc code into omega CGI.
462 Thu Mar 14 22:43:50 GMT 2013 Olly Betts <olly@survex.com>
464 * NEWS,configure.ac: Update for 1.2.14.
466 Thu Mar 14 12:34:29 GMT 2013 Olly Betts <olly@survex.com>
468 * Backport changes from trunk:
469 * docs/overview.rst: Update to add com to the list of ignored
471 * omindex.cc: Ignore .com files by default.
473 Thu Mar 07 09:41:25 GMT 2013 Olly Betts <olly@survex.com>
475 * NEWS: Update from ChangeLog.
477 Sat Feb 16 08:20:36 GMT 2013 Olly Betts <olly@survex.com>
479 * Backport change from trunk:
480 * xlsxparse.cc: Correct "max" -> "min" when reserving space for shared
481 strings. This only means we now reserve a more appropriate amount
482 of space to start with.
484 Wed Jan 09 11:15:03 GMT 2013 Olly Betts <olly@survex.com>
486 * NEWS,configure.ac: Update for 1.2.13.
488 Wed Jan 09 10:10:18 GMT 2013 Olly Betts <olly@survex.com>
490 * NEWS: Format new entries.
492 Tue Jan 08 11:09:38 GMT 2013 Olly Betts <olly@survex.com>
494 * Backport change from trunk:
495 * commonhelp.cc: Tweak wording about default to match other options
498 Tue Jan 08 10:58:58 GMT 2013 Olly Betts <olly@survex.com>
500 * Backport change from trunk:
501 * omindex.cc: Note default size limit on files to index is unlimited.
502 Update --help to reflect that --sample-size now accepts the same
503 formats as --max-size).
505 Tue Jan 08 10:41:27 GMT 2013 Olly Betts <olly@survex.com>
507 * Backport change from trunk:
508 * omindex.cc: When generating a sample for a CSV file, limit the
509 reserved size to the CSV file size as sample_size could be set
510 really high by the user.
512 Tue Jan 08 10:31:13 GMT 2013 Olly Betts <olly@survex.com>
514 * Backport change from trunk:
515 * diritor.cc: MAGIC_MIME_TYPE was added in 4.22, so note that in the
516 comment about its conditional use.
518 Tue Jan 08 10:28:07 GMT 2013 Olly Betts <olly@survex.com>
520 * Backport change from trunk:
521 * Makefile.am: In automake, INCLUDES is now deprecated in favour of
522 AM_CPPFLAGS so update to use the latter.
524 Tue Jan 08 10:14:47 GMT 2013 Olly Betts <olly@survex.com>
526 * Backport change from trunk:
527 * md5wrap.cc: If we get a read error while calculating the md5 checksum
528 of a file, fail rather than returning the checksum of the file up to
531 Tue Jan 08 10:13:36 GMT 2013 Olly Betts <olly@survex.com>
533 * Backport change from trunk:
534 * omindex.cc: Calculate the md5 from the loaded file contents when
535 indexing SVG and Atom files. Use a const ref to avoid a string
536 copy of the file contents for HTML and uncompressed ABI word.
538 Wed Nov 21 05:02:18 GMT 2012 Olly Betts <olly@survex.com>
540 * NEWS: Update from ChangeLog.
542 Wed Nov 21 05:00:51 GMT 2012 Olly Betts <olly@survex.com>
544 * Backport change from trunk:
545 * omindex.cc: Fix omindex not to segfault when -F option without a ':'
548 Sat Nov 10 08:08:20 GMT 2012 Olly Betts <olly@survex.com>
550 * NEWS: Update from ChangeLog.
552 Sat Oct 13 12:24:59 GMT 2012 Olly Betts <olly@survex.com>
554 * Backport change from trunk:
555 * Makefile.am,omindex.cc: Replace shell_protect() with
556 append_filename_argument() from common/append_filename_arg.h.
557 Extracting text using external filters now works for filenames
558 containing a newline character.
559 * common/: Update svn:externals to latest version of xapian-core
560 on 1.2 branch to get common/append_filename_arg.h.
562 Wed Jul 18 10:51:39 GMT 2012 Olly Betts <olly@survex.com>
564 * Backport change from trunk:
565 * urldecode.h: Fix to decode escaped character at the end of the
567 * urlenctest.cc: Add regression testcase.
569 Wed Jun 27 12:15:37 GMT 2012 Olly Betts <olly@survex.com>
571 * NEWS,configure.ac: Update for 1.2.12.
573 Tue Jun 26 12:06:15 GMT 2012 Olly Betts <olly@survex.com>
575 * NEWS,configure.ac: Update for 1.2.11.
577 Tue Jun 26 12:04:16 GMT 2012 Olly Betts <olly@survex.com>
579 * Backport change from trunk:
580 * configure.ac: Set link_all_deplibs_CXX=no on solaris, like we
581 already do for xapian-core.
583 Tue Jun 26 08:48:58 GMT 2012 Olly Betts <olly@survex.com>
585 * NEWS: Update from ChangeLog.
587 Fri Jun 22 12:16:30 GMT 2012 Olly Betts <olly@survex.com>
589 * Backport changes from trunk:
590 * xlsxparse.cc: Check for "uniquecount" parameter, not "unqiueCount" as
591 we normalise parameter names to lower case.
592 * omindex.cc: unzip extracts files in the order they are in the
593 archive, not the order they are on the command line, so call unzip
594 twice when the order of extraction matters.
595 * Makefile.am,omindex.cc,opendocparse.cc,opendocparse.h,xmlparse.cc:
596 Improve handling of headers and footers on OpenDocument documents.
598 Fri Jun 22 12:12:56 GMT 2012 Olly Betts <olly@survex.com>
600 * Backport change from trunk:
601 * omindex.cc: Tweak previous change.
603 Fri Jun 22 12:10:34 GMT 2012 Olly Betts <olly@survex.com>
605 * Backport changes from trunk:
606 * omindex.cc,xlsxparse.cc,xlsxparse.h: Rework .xlsx parsing to
607 substitute the shared strings into the positions they are used
608 in, so that the sample actually matches what appears in the
610 * xlsxparse.cc,xlsxparse.h: Subclass XlsxParser directly from
612 * Makefile.am,omindex.cc,xlsxparse.cc,xlsxparse.h: Index calculated
613 numbers from .xlsx files.
615 Fri Jun 22 12:03:06 GMT 2012 Olly Betts <olly@survex.com>
617 * Backport change from trunk:
618 * omindex.cc: pdftotext outputs a formfeed between each page, which
619 messes up our "empty body" check, so trim any trailing formfeeds
621 * omindex.cc: Fix the "trim trailing formfeeds" code not to remove one
623 * omindex.cc: Properly fix the "trim trailing formfeeds" code.
625 Sat Jun 09 11:50:16 GMT 2012 Olly Betts <olly@survex.com>
627 * Backport change from trunk:
628 * Cherry pick changes from Mihai Bivol's GSoC snippets branch:
629 * omindex.cc: Add option for the document sample size.
630 * omindex.cc: Add short option for sample-size
631 * omindex.cc: Make sample-size consistent with max-size
633 Fri Jun 08 12:48:47 GMT 2012 Olly Betts <olly@survex.com>
635 * NEWS: Update from ChangeLog.
637 Mon May 21 07:28:26 GMT 2012 Olly Betts <olly@survex.com>
639 * Backport change from trunk:
640 * htmlparsetest.cc,myhtmlparse.cc,myhtmlparse.h: Change parsing of
641 multiple <body> tags and text outside of <body> to match the
642 behaviour if modern web browsers. (ticket#599)
644 Mon May 21 06:34:43 GMT 2012 Olly Betts <olly@survex.com>
646 * Backport change from trunk:
647 * configure.ac: Set link_all_deplibs_CXX=no on freebsd and openbsd,
648 like we already do for xapian-core.
650 Wed May 09 10:43:41 GMT 2012 Olly Betts <olly@survex.com>
652 * NEWS,configure.ac: Update for 1.2.10.
654 Wed May 09 08:19:21 GMT 2012 Olly Betts <olly@survex.com>
656 * NEWS: Update from ChangeLog.
658 Wed May 09 01:20:25 GMT 2012 Olly Betts <olly@survex.com>
660 * Backport change from trunk:
661 * runfilter.cc: Add cast to rlim_t, required for C++11 compatibility
662 according to new error from GCC 4.7 (reported by Gaurav Arora).
664 Tue May 08 05:23:11 GMT 2012 Olly Betts <olly@survex.com>
666 * Backport changes from trunk:
667 * atomparse.cc: For type="html", use the charset of the XML rather
669 * Makefile.am,atomparse.cc,atomparse.h,overview.rst,omindex.cc: Add
670 support for atom feed files, patch from Mihai Bivol in ticket#595.
671 * Makefile.am,atomparsetest.cc: Add tests for AtomParser.
673 Tue May 08 04:58:57 GMT 2012 Olly Betts <olly@survex.com>
675 * Backport change from trunk:
676 * htmlparse.cc,htmlparsetest.cc: Add support for CDATA to HTML parser.
678 Tue May 08 04:48:36 GMT 2012 Olly Betts <olly@survex.com>
680 * Backport change from trunk:
681 * omindex.cc: Add --max-size option, based on patch from ndaley in
684 Wed Mar 21 11:21:08 GMT 2012 Olly Betts <olly@survex.com>
686 * NEWS: Update from ChangeLog.
688 Tue Mar 13 10:16:19 GMT 2012 Olly Betts <olly@survex.com>
690 * Backport change from trunk:
691 * omindex.cc: If the document with the highest existing docid was
692 updated, we'd previously report it as "added", but now we correctly
693 report it as "updated".
695 Tue Mar 13 10:12:10 GMT 2012 Olly Betts <olly@survex.com>
697 * Backport change from trunk:
698 * omindex.cc: Catch and report std::exception.
700 Thu Mar 08 02:08:53 GMT 2012 Olly Betts <olly@survex.com>
702 * NEWS,configure.ac: Update for 1.2.9.
704 Tue Mar 06 11:13:03 GMT 2012 Olly Betts <olly@survex.com>
706 * Backport changes from trunk:
707 * docs/overview.rst,omindex.cc: More extensions to ignore by default:
709 * docs/overview.rst: Wrap over-long line.
710 * docs/overview.rst,omindex.cc: Add more extensions to the default
711 ignore list: bin dat db jar lnk pyc pyo sqlite sqlite3 sqlite-journal
714 Tue Mar 06 11:10:59 GMT 2012 Olly Betts <olly@survex.com>
716 * Backport change from trunk:
717 * docs/overview.rst,htmlparse.cc,htmlparsetest.cc: Add support for
718 ignoring sections bracketed by <!--UdmComment--> and
719 <!--/UdmComment--> like we already do for <!--htdig_noindex-->.
720 Patch from Raphael Geissert.
722 Wed Feb 29 10:58:43 GMT 2012 Olly Betts <olly@survex.com>
724 * NEWS: Update from ChangeLog.
726 Mon Jan 23 12:48:00 GMT 2012 Olly Betts <olly@survex.com>
728 * Backport change from trunk:
729 * docs/overview.rst: Document that libmagic is used to determine
730 the MIME type if the extension isn't known. Partly addresses
733 Mon Jan 23 12:44:36 GMT 2012 Olly Betts <olly@survex.com>
735 * Backport change from trunk:
736 * docs/overview.rst: We now limit time as well as CPU and memory for
739 Tue Dec 13 12:42:10 GMT 2011 Olly Betts <olly@survex.com>
741 * NEWS: Final update for 1.2.8.
743 Tue Dec 13 12:38:04 GMT 2011 Olly Betts <olly@survex.com>
745 * Backport change from trunk:
746 * docs/omegascript.rst,query.cc,templates/emptydocs,templates/godmode,
747 templates/query,urldecode.h,urlenctest.cc: Add new $prettyurl{}
748 command which undoes RFC3986 URL escaping which doesn't affect
749 semantics in practice. Partly addresses ticket#550.
751 Tue Dec 13 11:19:56 GMT 2011 Olly Betts <olly@survex.com>
753 * NEWS,configure.ac: Update for 1.2.8.
755 Thu Dec 08 08:27:28 GMT 2011 Olly Betts <olly@survex.com>
757 * Backport change from trunk:
758 * omindex.cc: Improve --help output (and man page which is generated
759 from it). Closes bug#572.
761 Thu Dec 08 04:53:40 GMT 2011 Olly Betts <olly@survex.com>
763 * Backport change from trunk:
764 * Makefile.am: Ship new header urldecode.h.
766 Thu Dec 08 04:46:18 GMT 2011 Olly Betts <olly@survex.com>
768 * Backport change from trunk:
769 * Makefile.am,cgiparam.cc,urldecode.h,urlenctest.cc: Add new
770 implementation of URL decoding - the old one didn't handle
771 various corner cases well, and had two cut and pasted variants
772 for handling a input from a C string (GET) or from stdin (POST).
773 Also add a new unit test program to test URL encoding and decoding.
776 Tue Dec 06 13:30:58 GMT 2011 Olly Betts <olly@survex.com>
778 * Backport change from trunk:
779 * scriptindex.cc: If no rules are found in the index script, report an
780 error and give up - this is inevitably the result of a mistake, and
781 adding empty documents to the database isn't helpful.
783 Tue Dec 06 13:28:32 GMT 2011 Olly Betts <olly@survex.com>
785 * Backport change from trunk:
786 * docs/omegascript.rst: Add note to discourage use of percentage
788 * templates/query: Don't show the percentage score in the default
791 Tue Dec 06 13:26:30 GMT 2011 Olly Betts <olly@survex.com>
793 * Backport change from trunk:
794 * configure.ac,runfilter.cc: If we don't get any data from a filter
795 for 5 minutes, give up - it has probably ended up blocked
798 Tue Dec 06 12:51:24 GMT 2011 Olly Betts <olly@survex.com>
800 * Backport changes from trunk:
801 * templates/query: HTML escape topterms.
802 * templates/godmode: HTML escape the contents of document values.
804 Tue Dec 06 12:48:24 GMT 2011 Olly Betts <olly@survex.com>
806 * Backport change from trunk:
807 * scriptindex.cc: MyHtmlParser::parse_html() no longer throws bool to
808 stop parsing early, so we no longer need to catch it.
810 Tue Dec 06 12:45:48 GMT 2011 Olly Betts <olly@survex.com>
812 * Backport change from trunk:
813 * configure.ac: Sync changes from xapian-core: Don't pass -Wshadow for
814 GCC < 4.1; don't pass -Wstrict-null-sentinel for GCC 4.0.x; only
815 enable symbol visibility on platforms where it is supported; remove
816 now superfluous check for GCC >= 3. Also, add FIXME for enabling
817 -Woverloaded-virtual.
819 Tue Dec 06 11:57:58 GMT 2011 Olly Betts <olly@survex.com>
821 * Backport change from trunk:
822 * scriptindex.cc: Add link to
823 http://xapian.org/docs/omega/scriptindex.html to --help output (and
824 so also to the man page which is generated from this).
826 Tue Dec 06 11:30:42 GMT 2011 Olly Betts <olly@survex.com>
828 * Backport change from trunk:
829 * omega.cc: If P had trailing spaces, we would remove all but one -
830 fixed to remove all of them!
832 Thu Sep 01 12:31:59 GMT 2011 Olly Betts <olly@survex.com>
834 * Backport change from trunk:
835 * xapian-omega.spec.in: Package outlookmsg2html helper.
837 Wed Aug 10 06:13:41 GMT 2011 Olly Betts <olly@survex.com>
839 * NEWS: Finalise 1.2.7.
841 Tue Aug 09 09:00:12 GMT 2011 Olly Betts <olly@survex.com>
843 * NEWS,configure.ac: Initial update for 1.2.7.
845 Thu Jun 23 01:01:25 GMT 2011 Olly Betts <olly@survex.com>
847 * Backport change from trunk:
848 * query.cc: Improve $version output from "Xapian - xapian-omega 1.2.6"
849 to "xapian-omega 1.2.6".
850 * docs/omegascript.rst: Update example to match (and use less ancient
853 Thu Jun 23 00:38:18 GMT 2011 Olly Betts <olly@survex.com>
855 * Backport change from trunk:
856 * dbi2omega: Remove uninteresting reference to 0.9.4.
858 Thu Jun 23 00:29:28 GMT 2011 Olly Betts <olly@survex.com>
860 * Backport change from trunk:
861 * hashterm.cc: Avoid unnecessary temporary string object.
863 Thu Jun 23 00:26:13 GMT 2011 Olly Betts <olly@survex.com>
865 * Backport change from trunk:
866 * hashterm.cc: Fix comment typo.
868 Wed Jun 22 16:59:05 GMT 2011 Olly Betts <olly@survex.com>
870 * Backport change from trunk:
871 * xapian-omega.spec.in: We're ABI compatible within a release series
872 so make dependency on xapian-core-libs >= rather than =.
874 Wed Jun 22 16:55:10 GMT 2011 Olly Betts <olly@survex.com>
876 * Backport change from trunk:
877 * scriptindex.cc: Avoid unnecessary temporary string object.
879 Mon Jun 13 08:00:16 GMT 2011 Olly Betts <olly@survex.com>
881 * Backport change from trunk:
882 * docs/termprefixes.rst: Expand to document mapping a user prefix to
883 multiple term prefixes.
885 Mon Jun 13 03:23:47 GMT 2011 Olly Betts <olly@survex.com>
887 * docs/overview.rst: Improve documentation of htdig_noindex.
889 Sun Jun 12 11:52:29 GMT 2011 Olly Betts <olly@survex.com>
891 * NEWS: Final update for 1.2.6.
893 Fri Jun 10 12:02:32 GMT 2011 Olly Betts <olly@survex.com>
895 * NEWS,configure.ac: Update in preparation for 1.2.6.
897 Fri Jun 10 03:28:33 GMT 2011 Olly Betts <olly@survex.com>
899 * templates/inc/anyallexactradio: Remove unused duplicate of
902 Fri Jun 10 03:21:25 GMT 2011 Olly Betts <olly@survex.com>
904 * configure.ac,omindex-config.cc,omindex-config.html: Strip out partly
905 written and long untouched omindex-config utility.
907 Thu Jun 09 14:20:46 GMT 2011 Olly Betts <olly@survex.com>
909 * weight.cc: Fix a compiler warning (I failed to note the compiler
912 Sun May 29 13:00:26 GMT 2011 Olly Betts <olly@survex.com>
914 * templates/query: Make search query input type=search.
916 Sun May 29 12:24:43 GMT 2011 Olly Betts <olly@survex.com>
918 * templates/query: Autofocus the search query input (using HTML
919 autofocus attribute with Javascript fallback for older browsers).
922 Wed May 25 14:33:18 GMT 2011 Olly Betts <olly@survex.com>
924 * docs/omegascript.rst: Correct the documentation of the colours used by
927 Fri May 13 05:50:35 GMT 2011 Olly Betts <olly@survex.com>
929 * docs/overview.rst: Add using unoconv as more complex example of
930 using --filter (ticket#324).
932 Wed Apr 20 07:00:56 GMT 2011 Olly Betts <olly@survex.com>
934 * NEWS: Fix typo; clarify wording.
936 Mon Apr 04 13:58:06 GMT 2011 Olly Betts <olly@survex.com>
938 * NEWS: Update release date.
940 Mon Apr 04 13:53:34 GMT 2011 Olly Betts <olly@survex.com>
942 * templates/xml: Fix syntax error from recent edit.
944 Sun Apr 03 10:54:04 GMT 2011 Olly Betts <olly@survex.com>
946 * NEWS,configure.ac: Update for 1.2.5.
948 Sat Apr 02 14:15:32 GMT 2011 Olly Betts <olly@survex.com>
950 * templates/query: Use $add{$field{modtime}} to ensure it is numeric.
952 Sat Apr 02 14:14:06 GMT 2011 Olly Betts <olly@survex.com>
954 * templates/godmode: More missing escaping.
956 Sat Apr 02 14:07:45 GMT 2011 Olly Betts <olly@survex.com>
958 * templates/xml: Remove double escaping.
960 Sat Apr 02 13:58:44 GMT 2011 Olly Betts <olly@survex.com>
962 * templates/query: More escaping fixes.
964 Sat Apr 02 13:55:03 GMT 2011 Olly Betts <olly@survex.com>
966 * templates/emptydocs,templates/opensearch,templates/xml: More missing
969 Sat Apr 02 12:34:42 GMT 2011 Olly Betts <olly@survex.com>
971 * templates/query: Add missing escaping.
973 Sat Apr 02 11:48:43 GMT 2011 Olly Betts <olly@survex.com>
975 * templates/godmode: Add missing escaping.
977 Sat Apr 02 10:34:58 GMT 2011 Olly Betts <olly@survex.com>
979 * templates/xml: Remove support for undocumented HILITECLASS CGI
980 variable. There's no evidence I can find using Google code search
981 or web search that this has been used anywhere, and it's problematic
984 Sat Mar 26 14:51:36 GMT 2011 Olly Betts <olly@survex.com>
986 * INSTALL: Copy new Multi-Arch section from xapian-core/INSTALL.
987 Replace VPATH section with better equivalent from
990 Wed Mar 23 15:21:41 GMT 2011 Olly Betts <olly@survex.com>
992 * htmlparse.cc,htmlparse.h,htmlparsetest.cc,metaxmlparse.cc,
993 metaxmlparse.h,myhtmlparse.cc,myhtmlparse.h,omindex.cc,svgparse.cc,
994 svgparse.h,xmlparse.cc,xmlparse.h,xpsxmlparse.cc,xpsxmlparse.h:
995 Instead of throwing a bool to abandon parsing, change methods to
996 return bool to signify if they want to continue parsing or not.
997 This is a bit faster (~0.23% for indexing a lot of HTML files).
999 Mon Mar 21 05:48:08 GMT 2011 Olly Betts <olly@survex.com>
1001 * myhtmlparse.cc,myhtmlparse.h,omindex.cc: Add --ignore-exclusions
1002 option, which will index HTML files despite meta robots tags, etc -
1003 omindex is often used in environments where such exclusions aren't
1006 Fri Mar 18 10:24:58 GMT 2011 Olly Betts <olly@survex.com>
1008 * omindex.cc: Just report the mimetype as unknown instead of saying
1009 "unknown Office 2007 MIME subtype".
1011 Fri Mar 18 05:53:21 GMT 2011 Olly Betts <olly@survex.com>
1013 * diritor.h: Avoid using S_IRUSR, etc under __WIN32__.
1015 Fri Mar 18 03:00:16 GMT 2011 Olly Betts <olly@survex.com>
1017 * docs/overview.rst,omindex.cc: Ignore *.css and *.js by default too.
1019 Thu Mar 17 23:34:07 GMT 2011 Olly Betts <olly@survex.com>
1021 * omindex.cc: For skip messages which are only to be shown in verbose
1022 mode, call skip with new SKIP_VERBOSE_ONLY flag. Pass new
1023 SKIP_SHOW_FILENAME flag for skip messages shown before we say what
1024 file we are indexing so we know to show the filename even in verbose
1027 Thu Mar 17 03:47:54 GMT 2011 Olly Betts <olly@survex.com>
1029 * omindex.cc: Restore handling of exceptions from
1030 DirectoryIterator::get_type(), and handle exceptions from
1031 DirectoryIterator::next() which ended up at the top level
1032 before (though they probably never happen, at least on Linux).
1034 Wed Mar 16 06:19:01 GMT 2011 Olly Betts <olly@survex.com>
1036 * omindex.cc: Push all the code associated with indexing a file into
1039 Wed Mar 16 02:55:53 GMT 2011 Olly Betts <olly@survex.com>
1041 * omindex.cc: Push try block around index_file() call into the
1044 Wed Mar 16 02:51:52 GMT 2011 Olly Betts <olly@survex.com>
1046 * omindex.cc: Factor out handling for skipping files, and improve
1047 these messages by consistently reporting the filename.
1049 Tue Mar 15 12:47:12 GMT 2011 Olly Betts <olly@survex.com>
1051 * docs/Makefile.am,docs/index.rst: Add index page which links to all
1052 the other documentation pages.
1054 Tue Mar 15 12:20:30 GMT 2011 Olly Betts <olly@survex.com>
1056 * omindex.cc: Add --empty-docs option to allow documents we extract
1057 no body text from to be indexed (existing behaviour), skipped, or
1058 reported and then indexed.
1060 Fri Mar 04 14:13:47 GMT 2011 Olly Betts <olly@survex.com>
1062 * docs/omegascript.rst: Minor improvements.
1064 Wed Mar 02 11:17:42 GMT 2011 Olly Betts <olly@survex.com>
1068 Wed Mar 02 06:14:41 GMT 2011 Olly Betts <olly@survex.com>
1070 * docs/termprefixes.rst: New standard prefix E for filename extension.
1071 * omindex.cc: Index file extension as E-prefixed term.
1073 Mon Feb 28 13:45:32 GMT 2011 Olly Betts <olly@survex.com>
1075 * omindex.cc: Tell xls2csv not to quote fields and to put spaces
1076 not commas between them. Fixes indexing of numeric fields, and
1077 means we don't need to use our CSV parser to get a sample.
1079 Mon Feb 28 12:10:53 GMT 2011 Olly Betts <olly@survex.com>
1081 * xmlparse.cc: Add whitespace between chunks of text extracted from
1082 Microsoft Office 2007 formats.
1084 Wed Feb 23 12:34:28 GMT 2011 Olly Betts <olly@survex.com>
1086 * templates/xml: Try $field{caption} (which is what omindex sets)
1087 before $field{title} when getting a value for the hit tag's title
1088 attribute - this is consistent with how the query template gets the
1089 title. Add new type attribute which gives $field{type}.
1091 Thu Feb 17 05:19:28 GMT 2011 Olly Betts <olly@survex.com>
1093 * templates/xml: Add DBSize attribute to <result> element.
1095 Wed Feb 16 03:19:57 GMT 2011 Olly Betts <olly@survex.com>
1097 * Makefile.am,omindex.cc,query.cc,urlencode.cc,urlencode.h: Update
1098 URL encoding to follow RFC3986.
1100 Tue Feb 15 03:20:40 GMT 2011 Olly Betts <olly@survex.com>
1102 * omindex.cc: Encode reserved characters in URLs - now links to
1103 files with names containing '#' and '?' will work.
1105 Sun Jan 23 13:27:48 GMT 2011 Olly Betts <olly@survex.com>
1107 * docs/overview.rst,omindex.cc: Later Microsoft Works version produce
1108 .xlr spreadsheet files, which are apparently XL files with a
1109 different extension, so handle them as XL files.
1111 Thu Jan 20 11:07:46 GMT 2011 Olly Betts <olly@survex.com>
1113 * docs/omegascript.rst,omega.cc,query.cc,templates/query: Allow
1114 QueryParser flags to be set from OmegaScript (ticket#418).
1116 Sat Jan 15 11:14:32 GMT 2011 Olly Betts <olly@survex.com>
1118 * NEWS: Update from ChangeLog, 1.0.22 and 1.0.23.
1120 Wed Jan 12 02:21:59 GMT 2011 Olly Betts <olly@survex.com>
1122 * query.cc: Fix double Content-Type header in some error reporting
1123 situations (regression introduced in 1.2.4).
1125 Mon Jan 10 10:00:00 GMT 2011 Olly Betts <olly@survex.com>
1127 * omindex.cc,pkglibbindir.cc,pkglibbindir.h: Fix typo in function name
1128 (get_pkglibdindir() -> get_pkglibbindir()).
1130 Mon Jan 10 09:50:38 GMT 2011 Olly Betts <olly@survex.com>
1132 * diritor.cc,diritor.h: Don't define or try to set euid member of
1133 DirectoryIterator on platforms where we aren't going to use it.
1135 Mon Jan 10 09:15:24 GMT 2011 Olly Betts <olly@survex.com>
1137 * diritor.h: Stub out get_owner() and get_group() for __WIN32__.
1139 Fri Dec 24 10:35:29 GMT 2010 Olly Betts <olly@survex.com>
1141 * NEWS: Update from ChangeLog.
1143 Thu Dec 23 01:53:06 GMT 2010 Olly Betts <olly@survex.com>
1145 * diritor.cc: Fix to work with older libmagic which doesn't have
1146 MAGIC_MIME_TYPE (e.g. on Ubuntu hardy).
1148 Sun Dec 19 12:39:23 GMT 2010 Olly Betts <olly@survex.com>
1150 * NEWS,configure.ac: 1.2.4.
1152 Sun Dec 19 12:37:58 GMT 2010 Olly Betts <olly@survex.com>
1154 * query.cc: Disable permission filtering based on $REMOTE_USER as that
1155 will break some existing installations if users upgrade, which we
1156 don't want. Probably this should be specifiable from OmegaScript
1157 but it's not worth delaying 1.2.4 while we sort this out.
1159 Sun Dec 19 02:46:17 GMT 2010 Olly Betts <olly@survex.com>
1161 * docs/overview.rst,omindex.cc: Change the new name for
1162 "--preserve-unupdated" from "--preserve-removed" to "--no-delete".
1164 Sun Dec 19 02:32:29 GMT 2010 Olly Betts <olly@survex.com>
1166 * query.cc: Fix comment typo.
1168 Fri Dec 17 12:45:47 GMT 2010 Olly Betts <olly@survex.com>
1170 * commonhelp.cc,commonhelp.h,omindex.cc,scriptindex.cc: Swap the
1171 meanings of -v and -V in omindex for consistency with scriptindex
1172 and typical short options for --verbose and --version in other
1173 packages. For backward compatibility, "omindex -v" is handled
1174 specially and still reports the version.
1176 Fri Dec 17 08:31:29 GMT 2010 Olly Betts <olly@survex.com>
1178 * utf8convert.cc: Fix built in converter to handle space in charset
1179 names, which fixes failing utf8converttest when iconv isn't
1182 Fri Dec 17 05:36:36 GMT 2010 Olly Betts <olly@survex.com>
1184 * utf8convert.cc: Rework the fixing up of charset names which iconv()
1185 doesn't understand a little.
1187 Thu Dec 16 06:35:46 GMT 2010 Olly Betts <olly@survex.com>
1189 * loadfile.cc: If fstat() fails, preserve the errno value rather than
1190 letting close() clobber it.
1192 Thu Dec 16 06:31:30 GMT 2010 Olly Betts <olly@survex.com>
1194 * loadfile.cc: Fix file descriptor leak if load_file() is called on
1195 something which isn't a file (found by cppcheck run on the Debian
1196 archive). This case probably couldn't occur in omindex, but could if
1197 you used the LOADFILE action in scriptindex.
1199 Thu Dec 09 10:58:48 GMT 2010 Olly Betts <olly@survex.com>
1201 * docs/omegascript.rst: Replace $simplecommand with $query - a concrete
1202 example is more useful. Improve mark-up.
1203 * docs/termprefixes.rst: Remove mention of pre-0.9.7 use of W prefix.
1205 Thu Nov 18 12:25:50 GMT 2010 Olly Betts <olly@survex.com>
1207 * omega.cc: Fix reversed condition in recent exception reporting fix.
1209 Wed Nov 17 03:46:24 GMT 2010 Olly Betts <olly@survex.com>
1211 * diritor.cc: Add missing magic_cookie argument to calls to
1214 Sat Nov 13 12:17:51 GMT 2010 Olly Betts <olly@survex.com>
1216 * omindex.cc: Build up document data with += for efficiency.
1218 Sat Nov 13 12:08:09 GMT 2010 Olly Betts <olly@survex.com>
1220 * omindex.cc: Index author with A prefix.
1222 Sat Nov 13 12:00:50 GMT 2010 Olly Betts <olly@survex.com>
1224 * omindex.cc: A file extension can't contain a '/'.
1226 Sat Nov 13 11:50:31 GMT 2010 Olly Betts <olly@survex.com>
1228 * omindex.cc: Index the leafname of the file (without any extension) as
1229 if it contained additional keywords.
1231 Sat Nov 13 11:32:09 GMT 2010 Olly Betts <olly@survex.com>
1233 * omindex.cc: If a filter command isn't installed, flag this in the
1234 commands map so we don't try running this command again for any
1235 file with the same mimetype (previously we'd rerun it for a different
1236 extension which gave the same mimetype).
1238 Fri Nov 12 09:11:35 GMT 2010 Olly Betts <olly@survex.com>
1240 * Makefile.am,configure.ac: Add -no-undefined to AM_LDFLAGS on
1241 platforms which need it to dynamically link such as cygwin (need
1242 to do this taken from ticket#282).
1244 Fri Nov 12 03:35:56 GMT 2010 Olly Betts <olly@survex.com>
1246 * omindex.cc: Report MIME type if it's unknown to us. Remove debug
1247 output line. Update comments.
1249 Fri Nov 12 03:32:27 GMT 2010 Olly Betts <olly@survex.com>
1251 * diritor.cc: Report errors from libmagic.
1253 Fri Nov 12 02:58:20 GMT 2010 Olly Betts <olly@survex.com>
1255 * diritor.cc,diritor.h: Fix to compile when libmagic is detected.
1257 Fri Nov 12 01:40:24 GMT 2010 Olly Betts <olly@survex.com>
1259 * diritor.cc: Add missing class qualifier to method definition.
1261 Fri Nov 12 01:25:11 GMT 2010 Olly Betts <olly@survex.com>
1263 * INSTALL: Mention libmagic in install instructions.
1265 Fri Nov 12 01:16:21 GMT 2010 Olly Betts <olly@survex.com>
1267 * Makefile.am,configure.ac,diritor.cc,diritor.h,omindex.cc: Optionally
1268 use libmagic to detect MIME types for files for which we have no
1269 extension mapping, which allows us to handle files with a misleading
1270 extension, and files with no extension. (ticket#114)
1272 Thu Nov 11 23:23:07 GMT 2010 Olly Betts <olly@survex.com>
1274 * omindex.cc: Refactor slightly to handle the unknown extension case
1275 up front, so we lose an indentation level for the known extension
1278 Thu Nov 11 12:25:03 GMT 2010 Olly Betts <olly@survex.com>
1280 * omindex.cc: Add new --filter option to allow the user to specify
1281 new filters without patching omindex.cc.
1282 * docs/overview.rst: Document --filter.
1284 Thu Nov 11 02:51:55 GMT 2010 Olly Betts <olly@survex.com>
1286 * omindex.cc: Factor out handling for external filter programs which
1287 simply return UTF-8 text on stdout.
1289 Mon Nov 08 10:58:46 GMT 2010 Olly Betts <olly@survex.com>
1291 * omindex.cc,svgparse.cc,svgparse.h: Extract author for SVG files.
1293 Mon Nov 08 10:40:09 GMT 2010 Olly Betts <olly@survex.com>
1295 * omindex.cc: Extract metadata from Microsoft Office 2007 file formats.
1297 Mon Nov 08 10:21:13 GMT 2010 Olly Betts <olly@survex.com>
1299 * myhtmlparse.cc,myhtmlparse.h,omindex.cc: Extract author from HTML
1302 Mon Nov 08 09:46:03 GMT 2010 Olly Betts <olly@survex.com>
1304 * omindex.cc: Escape wildcard patterns being passed to unzip - in the
1305 unlikely event that one of these matched files in or under the
1306 current directory, we might fail to extract all the files we wanted
1309 Mon Nov 08 05:03:41 GMT 2010 Olly Betts <olly@survex.com>
1311 * metaxmlparse.cc,metaxmlparse.h,omindex.cc: Extract author from
1312 OpenDocument documents.
1314 Mon Nov 08 03:18:26 GMT 2010 Olly Betts <olly@survex.com>
1316 * omindex.cc: Extract author from PDF metadata.
1318 Mon Nov 08 03:15:17 GMT 2010 Olly Betts <olly@survex.com>
1320 * metaxmlparse.h: Initialise field member variable.
1322 Mon Nov 08 00:28:07 GMT 2010 Olly Betts <olly@survex.com>
1324 * omindex.cc: Index text in headers and footers for .odt and .docx
1327 Thu Nov 04 11:55:58 GMT 2010 Olly Betts <olly@survex.com>
1329 * omega.cc,omega.h,query.cc: If we catch an error early on, make sure
1330 that if it's appropriate, we write out a "Content-Type:" HTTP header
1331 and end the headers.
1333 Thu Nov 04 11:39:10 GMT 2010 Olly Betts <olly@survex.com>
1335 * utf8converttest.cc: Add back in testcases for charset names with
1338 Thu Nov 04 09:01:43 GMT 2010 Olly Betts <olly@survex.com>
1340 * utils.cc: Fix misuse of BUFSIZE which should be sizeof(buf) (issue
1341 reported by compilation with CPPFLAGS=-D_GLIBCXX_DEBUG).
1343 Thu Nov 04 09:01:08 GMT 2010 Richard Boulton <richard@tartarus.org>
1345 * utf8convert.cc,utf8converttest.cc: If iconv can't handle a
1346 charset, check if it's of the form (UTF|UCS)[_ ]?.* and if so,
1347 convert to the official hypenated form. Should fix failure of
1348 utf8converttest on OSX, where it fails due to iconv not
1351 Tue Nov 02 09:48:19 GMT 2010 Olly Betts <olly@survex.com>
1353 * diritor.cc,diritor.h,loadfile.cc,loadfile.h,md5wrap.cc,md5wrap.h,
1354 omindex.cc,scriptindex.cc: Use O_NOATIME if available and either the
1355 file is owned by the current euid, or the current euid is 0 (i.e.
1356 we're running as root). Fixes ticket#222.
1358 Fri Oct 29 14:26:25 GMT 2010 Olly Betts <olly@survex.com>
1360 * omindex.cc: Use the CSV parser to generate a nicer sample for files
1361 of type application/vnd.ms-excel.
1363 Fri Oct 29 09:26:52 GMT 2010 Olly Betts <olly@survex.com>
1365 * Makefile.am: Put $(PCRE_LIBS) in libtransform_la_LIBADD rather than
1366 omega_LDADD (more correct, but probably doesn't actually make any
1369 Thu Oct 28 14:46:11 GMT 2010 Olly Betts <olly@survex.com>
1371 * omindex.cc: Disable more output unless --verbose is specified. Don't
1372 flush the "Indexing" partial message until we get to the potentially
1373 time consuming actions.
1375 Thu Oct 28 13:54:44 GMT 2010 Olly Betts <olly@survex.com>
1377 * docs/overview.rst: Improve mark-up, and tweak wording in a few
1380 Thu Oct 28 13:46:36 GMT 2010 Olly Betts <olly@survex.com>
1382 * docs/overview.rst: Update docs for --duplicates and
1385 Thu Oct 28 13:27:01 GMT 2010 Olly Betts <olly@survex.com>
1387 * omindex.cc: Deprecated "--preserve-nonduplicates" in favour of new
1388 long option "--preserve-removed" which does the same thing, but has
1389 a (hopefully) clearer name. Rename the variable it controls from
1390 preserve_unupdated to delete_removed_documents (with the opposite
1393 Thu Oct 28 12:08:59 GMT 2010 Olly Betts <olly@survex.com>
1395 * configfile.cc: Only append '/' to directory values if they don't
1396 already have a trailing '/'.
1398 Thu Oct 28 11:49:54 GMT 2010 Olly Betts <olly@survex.com>
1400 * runfilter.cc: Make the memory limit for filter processes the size
1401 of physical memory, not 7/8 of this value, which is a little less
1402 arbitrary (ticket#424).
1404 Thu Oct 28 11:47:38 GMT 2010 Olly Betts <olly@survex.com>
1406 * omindex.cc: Under --duplicate=ignore, fix so that old documents which
1407 aren't seen get deleted, which wasn't implemented before (to suppress
1408 this deletion, pass -p as well).
1410 Thu Oct 28 10:38:21 GMT 2010 Olly Betts <olly@survex.com>
1412 * omindex.cc: Track how many documents in the index we haven't seen
1413 in this index run - if this is 0, we don't need to check for docs
1414 to delete at all; otherwise we can at least use it to know when we
1415 have found them all. Use a PostingIterator over all documents to
1416 avoid having to catch exceptions from delete_document() for gaps
1419 Thu Oct 28 04:52:36 GMT 2010 Olly Betts <olly@survex.com>
1421 * omindex.cc: Add quotes around directory name in "Entering directory"
1422 message. Add directory name to "skipping directory" error message.
1424 Thu Oct 28 04:50:37 GMT 2010 Olly Betts <olly@survex.com>
1426 * omindex.cc: Document --verbose in --help. Actually recognise -V.
1428 Thu Oct 28 04:01:31 GMT 2010 Olly Betts <olly@survex.com>
1430 * omindex.cc: Move the directory iteration loop out of the try/catch
1431 block for starting the iteration, which means it's indented by a
1434 Thu Oct 28 03:47:30 GMT 2010 Olly Betts <olly@survex.com>
1436 * omindex.cc: Add --verbose option, and disable the less interesting
1437 output unless it is specified.
1439 Thu Oct 28 03:34:44 GMT 2010 Olly Betts <olly@survex.com>
1441 * omindex.cc: Eliminate the message "Caught unknown exception in
1442 index_directory, rethrowing" as it isn't actually informative.
1444 Thu Oct 28 01:43:44 GMT 2010 Olly Betts <olly@survex.com>
1446 * omindex.cc: Variable dbpath doesn't need to be global.
1448 Thu Oct 28 01:28:10 GMT 2010 Olly Betts <olly@survex.com>
1450 * omindex.cc: The Host and Path terms are the same for every document
1451 in a single invocation of omindex, so calculate them just once up
1454 Thu Oct 28 01:13:36 GMT 2010 Olly Betts <olly@survex.com>
1456 * omindex.cc: Eliminate the leading slash on filenames in output, so
1457 they are now relative filenames on the system. This also simplifies
1458 path building internally.
1460 Wed Oct 27 09:51:51 GMT 2010 Olly Betts <olly@survex.com>
1462 * omindex.cc: Use rpm's --qf option to produce output which is simpler
1465 Wed Oct 27 09:32:22 GMT 2010 Olly Betts <olly@survex.com>
1467 * docs/overview.rst,omindex.cc: Add support for indexing RPM packages
1470 Wed Oct 27 06:07:59 GMT 2010 Olly Betts <olly@survex.com>
1472 * docs/overview.rst,omindex.cc: Add support for indexing Debian package
1473 files (ticket #493).
1475 Wed Oct 27 05:37:02 GMT 2010 Olly Betts <olly@survex.com>
1477 * docs/overview.rst,omindex.cc: Quietly ignore files with mimetype set
1478 to "ignore". The initial list of extensions set to ignore is:
1479 .a .dll .dylib .exe .lib .o .obj .so
1481 Wed Oct 27 02:25:01 GMT 2010 Olly Betts <olly@survex.com>
1483 * omindex.cc: Report get_description() for Xapian exceptions, which
1484 is provides additional information above get_msg().
1486 Wed Oct 27 01:56:08 GMT 2010 Olly Betts <olly@survex.com>
1488 * omindex.cc,query.cc,values.h: Add file size as a value, and set up a
1489 NumberValueRangeProcessor so size: works in the query (has to be in
1492 Wed Oct 27 01:31:25 GMT 2010 Olly Betts <olly@survex.com>
1494 * scriptindex.cc: Report get_description() for Xapian exceptions, which
1495 is provides additional information above get_msg().
1497 Tue Oct 26 12:00:58 GMT 2010 Olly Betts <olly@survex.com>
1499 * docs/overview.rst: Document the new emptydocs template.
1501 Tue Oct 26 11:51:31 GMT 2010 Olly Betts <olly@survex.com>
1503 * docs/omegascript.rst,query.cc: Add new $emptydocs command which
1504 returns a list of documents with doclength zero.
1505 * query.cc: Extend $field to take an optional DOCID argument, rather
1506 than always using the context from $hitlist.
1507 * templates/emptydocs: New template which lists documents with
1510 Thu Oct 21 12:05:23 GMT 2010 Olly Betts <olly@survex.com>
1512 * configure.ac,unixperm.cc: Fix to build on platforms where
1513 getgrouplist() exists but takes int* not gid_t* (e.g. Mac OS X).
1515 Wed Oct 20 10:30:13 GMT 2010 Olly Betts <olly@survex.com>
1517 * omindex.cc,scriptindex.cc: Add boolean terms with add_boolean_term()
1518 so they get wdf of 0 and don't contribute to document length.
1520 Sat Oct 16 06:13:23 GMT 2010 Olly Betts <olly@survex.com>
1522 * configure.ac: Probe for any options needed to enable large file
1523 support. Handling files >= 2GB isn't especially useful, but more
1524 importantly this is needed to allow omindex to index files on filing
1525 systems with 64 bit inodes on some platforms (e.g. 32-bit Linux).
1527 Mon Oct 11 11:11:07 GMT 2010 Olly Betts <olly@survex.com>
1529 * Makefile.am: Drop special case to remove man pages on "make clean"
1532 Wed Sep 29 04:14:21 GMT 2010 Olly Betts <olly@survex.com>
1534 * Makefile.am,configure.ac,query.cc,unixperm.cc,unixperm.h: Pull out
1535 permission checks into a separate file and check Unix user and group
1536 permissions based on environmental variable REMOTE_USER, if set.
1538 Tue Sep 28 08:06:00 GMT 2010 Olly Betts <olly@survex.com>
1540 * Makefile.am: Ship common/realtime.h.
1542 Tue Sep 28 06:32:10 GMT 2010 Olly Betts <olly@survex.com>
1544 * query.cc: Apply permission filters if USER and/or GROUP are set.
1546 Tue Sep 28 06:14:50 GMT 2010 Olly Betts <olly@survex.com>
1548 * ./: Update svn:externals to latest common from xapian-core.
1549 * query.cc: Use RealTime::now() to time running the query. Include
1550 more enquire set-up in the time.
1552 Tue Sep 28 05:26:07 GMT 2010 Olly Betts <olly@survex.com>
1554 * omindex.cc: Index file owner and read permissions, to allow finding
1555 documents with a particular owner, and so searches can be restricted
1556 to documents a user is able to read.
1557 * docs/termprefixes.rst: Document term prefixes used by the above.
1559 Tue Sep 28 05:20:01 GMT 2010 Olly Betts <olly@survex.com>
1561 * diritor.h: Rename get_other_read() to is_other_readable() for
1564 Tue Sep 28 04:16:55 GMT 2010 Olly Betts <olly@survex.com>
1566 * diritor.cc,diritor.h: Rearrange so that the setting of statbuf_valid
1567 gets inlined so the compiler should be able to optimise out
1568 subsequent calls to call_stat().
1570 Tue Sep 28 04:10:28 GMT 2010 Olly Betts <olly@survex.com>
1572 * diritor.h: Add methods to read the owner and group, and to check
1573 who can read the file.
1575 Tue Sep 28 01:39:15 GMT 2010 Olly Betts <olly@survex.com>
1579 Tue Sep 28 01:33:44 GMT 2010 Olly Betts <olly@survex.com>
1581 * NEWS: Fix whitespace oddities.
1583 Tue Sep 28 01:31:46 GMT 2010 Olly Betts <olly@survex.com>
1585 * NEWS: Update from ChangeLog.
1587 Tue Sep 28 01:27:41 GMT 2010 Olly Betts <olly@survex.com>
1589 * omindex.cc: Improve --help for --mime-type option.
1591 Mon Sep 20 06:50:45 GMT 2010 Olly Betts <olly@survex.com>
1593 * omindex.cc,svgparse.cc,svgparse.h: Extract any document title and
1594 keywords from SVG files.
1596 Mon Sep 20 06:49:44 GMT 2010 Olly Betts <olly@survex.com>
1598 * htmlparse.cc: Call closing_tag() for XML empty tag syntax (like
1601 Mon Sep 20 05:30:54 GMT 2010 Olly Betts <olly@survex.com>
1603 * Makefile.am,docs/overview.rst,omindex.cc,svgparse.cc,svgparse.h: Add
1604 support for indexing SVG files.
1606 Tue Sep 07 04:39:59 GMT 2010 Olly Betts <olly@survex.com>
1608 * outlookmsg2html.in: If the required perl modules aren't available,
1609 exit with status 127 which omindex interprets as "filter not
1610 installed" and won't try further .msg files.
1612 Tue Sep 07 02:24:36 GMT 2010 Olly Betts <olly@survex.com>
1614 * Makefile.am,configure.ac,docs/overview.rst,omindex.cc,
1615 outlookmsg2html.in,pkglibbindir.cc,pkglibbindir.h: Add support for
1616 indexing .msg files from Microsoft Outlook. (ticket#334)
1618 Tue Aug 31 06:32:15 GMT 2010 Olly Betts <olly@survex.com>
1620 * omindex.cc: Fix handling of quoting in CSV files to match what's
1623 Tue Aug 31 05:41:13 GMT 2010 Olly Betts <olly@survex.com>
1625 * docs/overview.rst,omindex.cc: The V in CSV is Values not Variable.
1627 Mon Aug 30 14:56:36 GMT 2010 Olly Betts <olly@survex.com>
1629 * docs/overview.rst,omindex.cc: Add support for indexing .csv files.
1631 Sat Aug 28 11:46:22 GMT 2010 Olly Betts <olly@survex.com>
1633 * cdb_find.cc,cdb_init.cc,cgiparam.cc,date.cc,md5.cc,query.cc,utils.cc,
1634 values.h: Fix to compile with Sun C++.
1636 Sat Aug 28 11:36:25 GMT 2010 Olly Betts <olly@survex.com>
1638 * omega.cc: An ESet can't contain empty terms, so there's no need to
1641 Tue Aug 24 05:58:28 GMT 2010 Olly Betts <olly@survex.com>
1643 * NEWS,configure.ac: Update for 1.2.3.
1645 Mon Aug 23 15:08:11 GMT 2010 Olly Betts <olly@survex.com>
1647 * xapian-omega.spec.in: Don't run autoreconf - it's no longer required.
1649 Tue Aug 03 14:11:35 GMT 2010 Olly Betts <olly@survex.com>
1651 * docs/termprefixes.rst: Update "flint and quartz" to "flint and chert"
1652 as quartz is no longer supported. Give exact term length limit for
1655 Sun Jun 27 05:00:39 GMT 2010 Olly Betts <olly@survex.com>
1657 * NEWS,configure.ac: Update for 1.2.2.
1659 Sat Jun 26 15:59:59 GMT 2010 Olly Betts <olly@survex.com>
1661 * NEWS.SKELETON: Add blank line to the end.
1663 Sat Jun 26 15:59:05 GMT 2010 Olly Betts <olly@survex.com>
1665 * NEWS.SKELETON: Add template NEWS entry.
1667 Tue Jun 22 13:55:11 GMT 2010 Olly Betts <olly@survex.com>
1669 * NEWS: Sync with 1.0.21.
1670 * NEWS,configure.ac: Update for 1.2.1.
1672 Sun Jun 13 11:55:40 GMT 2010 Olly Betts <olly@survex.com>
1674 * freemem.cc: Merge in __WIN32__ implementation from perftest in
1677 Fri May 14 01:39:43 GMT 2010 Olly Betts <olly@survex.com>
1679 * freemem.cc: Use "safeunistd.h" instead of <unistd.h>.
1681 Wed Apr 28 13:38:33 GMT 2010 Olly Betts <olly@survex.com>
1683 * NEWS: Sync with 1.0.20.
1685 Wed Apr 28 06:44:56 GMT 2010 Olly Betts <olly@survex.com>
1687 * configure.ac: Tell libtool not to link in deplibs on platforms where
1688 we know they aren't needed.
1689 * configure.ac: On Linux, extract the library search path from ldconfig
1690 which gives us the default entries reliably.
1691 * NEWS,configure.ac: 1.2.0.
1693 Thu Apr 15 04:32:06 GMT 2010 Olly Betts <olly@survex.com>
1695 * NEWS,configure.ac: Update for 1.1.5.
1697 Mon Feb 15 14:00:26 GMT 2010 Olly Betts <olly@survex.com>
1699 * configure.ac: Update for 1.1.4.
1701 Mon Feb 15 13:51:44 GMT 2010 Olly Betts <olly@survex.com>
1703 * NEWS: Add missing notes for 1.1.2 and 1.1.1 including changes from
1704 1.0.14 and 1.0.13 respectively.
1706 Mon Feb 15 13:28:12 GMT 2010 Olly Betts <olly@survex.com>
1708 * NEWS: Update from ChangeLog and 1.0.18.
1710 Mon Feb 08 00:48:44 GMT 2010 Olly Betts <olly@survex.com>
1712 * Makefile.am: Need to ship common/omassert.h.
1714 Sun Feb 07 23:03:45 GMT 2010 Olly Betts <olly@survex.com>
1716 * Makefile.am: Need to ship common/str.h.
1718 Sun Feb 07 21:40:03 GMT 2010 Olly Betts <olly@survex.com>
1720 * Makefile.am,omega.cc,omindex.cc,query.cc,utils.cc,utils.h: Use the
1721 optimised str() routine instead of int_to_string() and
1724 Fri Feb 05 23:29:12 GMT 2010 Olly Betts <olly@survex.com>
1726 * omindex.cc: Increase the wdf boost for the document title from 2 to
1727 5, since 2 isn't really enough.
1729 Thu Feb 04 03:20:02 GMT 2010 Olly Betts <olly@survex.com>
1731 * Makefile.am,configure.ac,runfilter.cc: Use safesyswait.h.
1732 * runfilter.cc: Reformat header to @file doxygen comment. Put
1733 '#include "runfilter.h"' right after <config.h>.
1735 Wed Dec 10 00:15:10 GMT 2009 Olly Betts <olly@survex.com>
1737 * NEWS: Update from ChangeLog.
1739 Wed Dec 09 00:26:19 GMT 2009 Olly Betts <olly@survex.com>
1741 * myhtmlparse.cc: Add missing "using namespace std;".
1743 Wed Dec 09 00:20:38 GMT 2009 Olly Betts <olly@survex.com>
1745 * htmlparse.cc: Make the default charset "utf-8" not "UTF-8" as we
1746 lower case explicitly specified character sets to compare to see
1747 if we need to reparse, so this avoids a reparse when UTF-8 is
1748 explicitly specified as well as the default.
1750 Tue Dec 08 23:56:46 GMT 2009 Olly Betts <olly@survex.com>
1752 * scriptindex.cc: Don't bomb out if indexing is disallowed or we hit
1753 </body> for a document which had an overridden character set.
1756 Wed Nov 18 10:48:47 GMT 2009 Olly Betts <olly@survex.com>
1758 * NEWS,configure.ac: Update for 1.1.3.
1760 Wed Nov 18 02:37:34 GMT 2009 Olly Betts <olly@survex.com>
1762 * NEWS: Update from 1.0.17 and ChangeLog.
1764 Mon Nov 16 09:08:12 GMT 2009 Olly Betts <olly@survex.com>
1766 * utf8converttest.cc: Charset "8859_1" isn't understood by Solaris
1767 libiconv, and isn't likely to be specified on a page, so just
1768 test it for our built-in convertor and GNU libc.
1770 Wed Nov 11 04:52:25 GMT 2009 Olly Betts <olly@survex.com>
1772 * configure.ac: Also check for socketpair with -lxnet if it isn't found
1773 without, which enables resource limits on Solaris, and possibly some
1774 other platforms. Fixes ticket#412.
1776 Wed Nov 04 01:51:41 GMT 2009 Olly Betts <olly@survex.com>
1778 * freemem.cc: On Linux, _SC_AVPHYS_PAGES excludes pages used by the OS
1779 VM cache, so will often return a really low value, so instead use
1780 _SC_PHYS_PAGES. Reported by Rune Kock in Debian bug#548987. Also
1781 explains ticket#358.
1783 Wed Nov 04 00:54:38 GMT 2009 Olly Betts <olly@survex.com>
1785 * common/: Sync with latest version from xapian-core to pick up getopt
1786 fix for Mac OS X 10.6.
1788 Mon Nov 02 09:32:22 GMT 2009 Olly Betts <olly@survex.com>
1790 * omindex.cc: Use delete[] (not delete) for array allocated by new[].
1792 Mon Nov 02 07:08:13 GMT 2009 Olly Betts <olly@survex.com>
1794 * runfilter.cc: Fix likely crash if read() is interrupted by a signal.
1795 Identified by Coverity's Scan.
1797 Mon Nov 02 06:47:01 GMT 2009 Olly Betts <olly@survex.com>
1799 * scriptindex.cc: Extend exception handling to the whole of main.
1800 Xapian::Stem("english") can't actually throw, but that's not obvious
1801 to static analysis tools, and it is more robust to wrap the whole of
1802 main, and reduces indentation.
1804 Mon Nov 02 06:32:41 GMT 2009 Olly Betts <olly@survex.com>
1806 * omindex.cc,scriptindex.cc: Tighten up the type of the error we catch
1807 to detect an unknown stemming language.
1809 Thu Sep 17 12:13:10 GMT 2009 Olly Betts <olly@survex.com>
1811 * NEWS: Update from ChangeLog.
1813 Thu Sep 10 13:33:06 GMT 2009 Olly Betts <olly@survex.com>
1815 * configure.ac: Default to looking for xapian-config-1.1.
1817 Thu Sep 10 06:46:55 GMT 2009 Olly Betts <olly@survex.com>
1819 * NEWS: Sync changes from 1.0.15 and 1.0.16.
1821 Wed Sep 09 13:32:25 GMT 2009 Olly Betts <olly@survex.com>
1823 * omega.cc,query.cc,query.h: Fix cross-site scripting vulnerability in
1824 reporting of exceptions (CVE-2009-2947).
1826 Fri Aug 28 15:30:07 GMT 2009 Richard Boulton <richard@lemurconsulting.com>
1828 * configure.ac: Check for PERL if in maintainer mode, not just when
1829 building documentation, because making the omegascript vim syntax
1832 Wed Aug 26 14:17:06 GMT 2009 Olly Betts <olly@survex.com>
1834 * templates/query: www.xapian.org -> xapian.org.
1836 Tue Aug 25 11:15:38 GMT 2009 Olly Betts <olly@survex.com>
1838 * gen-omegascript-vim: Fix swapped arguments to perl mkdir function.
1840 Tue Aug 25 10:39:29 GMT 2009 Olly Betts <olly@survex.com>
1842 * gen-omegascript-vim: Add GPL licence boilerplate.
1844 Tue Aug 25 10:29:07 GMT 2009 Olly Betts <olly@survex.com>
1846 * gen-omegascript-vim: Need to create "extra" for a VPATH build.
1848 Tue Aug 25 08:39:00 GMT 2009 Olly Betts <olly@survex.com>
1850 * Makefile.am: Fix for VPATH build.
1852 Tue Aug 25 06:38:08 GMT 2009 Olly Betts <olly@survex.com>
1854 * Makefile.am,extra/omegascript.vim,extra/omegascript.vim.in,
1855 gen-omegascript-vim: The list of OmegaScript commands in the vim
1856 mode was rather out of date, and a few commands were misclassified.
1857 Fix both problems and avoid future recurrences by automatically
1858 generating those lists from the command list in query.cc.
1860 Sat Aug 15 11:31:56 GMT 2009 Olly Betts <olly@survex.com>
1862 * NEWS: Update from ChangeLog.
1864 Wed Aug 05 03:50:54 GMT 2009 Olly Betts <olly@survex.com>
1866 * omindex.cc: Implement correct handling of paths when calling
1867 external filter programs on Microsoft Windows.
1869 Thu Jul 23 12:07:24 GMT 2009 Olly Betts <olly@survex.com>
1871 * omindex.cc: Remove pointless fallback code.
1873 Thu Jul 23 12:06:37 GMT 2009 Olly Betts <olly@survex.com>
1875 * templates/inc/toptermsjs: Use double-quotes rather than single quotes
1876 for parameter values on the <script> tag.
1878 Thu Jul 23 11:29:43 GMT 2009 Olly Betts <olly@survex.com>
1880 * docs/omegascript.rst: Document that $date uses UTC. (ticket#314)
1882 Thu Jul 23 11:26:15 GMT 2009 Olly Betts <olly@survex.com>
1884 * templates/query: If JavaScript is available, convert the
1885 $field{modtime} to a string on the client-side so that the timezone
1886 is correct. If JavaScript isn't available, fall back to the existing
1887 behaviour of using UTC. (ticket#314)
1889 Thu Jul 23 04:12:02 GMT 2009 Olly Betts <olly@survex.com>
1891 * NEWS,configure.ac: Update for 1.1.2.
1893 Wed Jul 22 04:33:29 GMT 2009 Olly Betts <olly@survex.com>
1895 * NEWS: Update from ChangeLog and sync with 1.0.13 and 1.0.14.
1897 Tue Jul 07 15:05:09 GMT 2009 Olly Betts <olly@survex.com>
1899 * omindex.cc: Consistently use endl not "\n" at the end of messages so
1900 that output is flushed.
1902 Tue Jul 07 07:29:21 GMT 2009 Olly Betts <olly@survex.com>
1904 * cdb_init.cc,cdb_int.h,cgiparam.cc,configfile.cc,date.cc,
1905 datematchdecider.cc,datematchdecider.h,freemem.cc,htmlparse.cc,
1906 htmlparsetest.cc,md5.cc,md5test.cc,myhtmlparse.cc,omega.cc,
1907 omindex.cc,query.cc,runfilter.cc,scriptindex.cc,strcasecmp.h,
1908 utf8converttest.cc,utils.cc: Update to use C++ forms for ISO C
1909 standard headers (ticket#330).
1911 Mon Jul 06 01:54:35 GMT 2009 Olly Betts <olly@survex.com>
1913 * loadfile.cc: Avoid infinite loop if the file has been truncated
1914 since we read the length, or on Cygwin with the automatic end of
1915 line translation turned on.
1917 Sun Jul 05 13:00:57 GMT 2009 Olly Betts <olly@survex.com>
1919 * htmlparse.cc,htmlparse.h: Make HtmlParser::get_parameter() const
1922 Sun Jul 05 12:59:45 GMT 2009 Olly Betts <olly@survex.com>
1924 * cdb_init.cc: Prefer static_cast<> to C-style cast.
1926 Sat Jun 20 03:31:22 GMT 2009 Olly Betts <olly@survex.com>
1928 * docs/overview.rst: www.xapian.org -> xapian.org
1930 Thu Jun 11 09:45:45 GMT 2009 Olly Betts <olly@survex.com>
1932 * omindex.cc: Extract pptx notesSlides and comments, if present. If
1933 they aren't, unzip returns exit code 11, which we must ignore
1936 Thu Jun 11 07:38:57 GMT 2009 Olly Betts <olly@survex.com>
1938 * docs/overview.rst,omindex.cc: Handle the "macroenabled" versions of
1939 MS Office 2007 files too (ticket#290).
1941 Wed Jun 10 01:13:14 GMT 2009 Olly Betts <olly@survex.com>
1943 * configure.ac: Update for 1.1.1.
1945 Tue Jun 09 14:35:40 GMT 2009 Olly Betts <olly@survex.com>
1947 * NEWS: Update for 1.1.1.
1949 Mon May 25 13:38:46 GMT 2009 Olly Betts <olly@survex.com>
1951 * query.cc: If SERVER_PROTOCOL in the environment is set to INCLUDED,
1952 then our output is being included in another page (e.g. using SSI)
1953 so suppress the output of any HTTP headers.
1955 Mon May 25 13:02:22 GMT 2009 Olly Betts <olly@survex.com>
1957 * templates/query: Remove extra "}" introduced when adding spelling
1960 Mon May 25 12:57:45 GMT 2009 Olly Betts <olly@survex.com>
1962 * cgiparam.cc,commonhelp.cc: Include the corresponding header.
1964 Mon May 25 12:56:55 GMT 2009 Olly Betts <olly@survex.com>
1966 * cgiparam.h: Add explicit inclusions of <map> and <string> and qualify
1967 multimap and string with std::.
1969 Sat May 23 12:21:33 GMT 2009 Olly Betts <olly@survex.com>
1971 * configure.ac: Sync warning flags used with GCC with xapian-core
1972 apart from -Woverloaded-virtual which fires for
1973 MyHtmlParser::parse_html(). That probably should be tidied up at
1974 some point, but not right now.
1976 Wed May 20 11:24:46 GMT 2009 Olly Betts <olly@survex.com>
1978 * omindex.cc: The MD5 checksum of a text file with a BOM was being
1979 incorrectly calculated from the contents converted to UTF-8
1980 since 1.0.7. Noticed by Srijon Biswas.
1982 Tue May 05 12:13:17 GMT 2009 Olly Betts <olly@survex.com>
1984 * omindex.cc: We can now use numeric_limits<> since we no longer
1985 support GCC 2.95, so use it and fix a warning on platforms with
1988 Thu Apr 30 14:09:50 GMT 2009 Olly Betts <olly@survex.com>
1990 * Makefile.am,docs/omegascript.rst,query.cc,weight.cc,weight.h: Add
1991 $opt{weighting} to allow the weighting scheme and parameters to be
1992 specified (ticket#298).
1994 Tue Apr 28 07:38:54 GMT 2009 Olly Betts <olly@survex.com>
1996 * omindex.cc: Check the last modification time of files before
1997 reindexing (ticket#342).
1999 Tue Apr 28 05:17:04 GMT 2009 Olly Betts <olly@survex.com>
2001 * omindex.cc: Drop the copyright info from the output of --version as
2002 it's perennially out of date and we don't report it for any other
2005 Tue Apr 28 05:03:29 GMT 2009 Olly Betts <olly@survex.com>
2007 * omindex.cc: If the filter for a filetype isn't installed, don't erase
2008 the entry from the mime_map, but instead set it to the empty string
2009 and then use this to report why we subsequently skip files with the
2010 same extension, rather than slightly misleadingly reporting "Unknown
2013 Mon Apr 27 16:34:29 GMT 2009 Olly Betts <olly@survex.com>
2015 * templates/query: Offer any spelling correction QueryParser gives.
2017 Mon Apr 27 13:36:19 GMT 2009 Olly Betts <olly@survex.com>
2019 * omindex.cc: Add "--spelling" option to index spelling correction
2022 Sun Apr 26 16:28:36 GMT 2009 Olly Betts <olly@survex.com>
2024 * omindex.cc: Make -s work as a short-form for --stemmer (as
2025 documented by "omindex --help" and "man omindex").
2027 Sun Apr 26 15:33:32 GMT 2009 Olly Betts <olly@survex.com>
2029 * docs/omegascript.rst,query.cc: Add $suggestion and $opt{spelling} to
2030 provide access to spelling correction (ticket#296).
2032 Sun Apr 26 15:08:40 GMT 2009 Olly Betts <olly@survex.com>
2034 * docs/scriptindex.rst,scriptindex.cc: Add new "spell" action for
2035 scriptindex (ticket#296).
2037 Thu Apr 23 07:40:41 GMT 2009 Olly Betts <olly@survex.com>
2039 * docs/scriptindex.rst,scriptindex.cc: Add new "valuenumeric" action
2040 to index a value using Xapian::sortable_serialise() to allow numeric
2041 sorting (ticket#260).
2043 Thu Apr 23 07:09:18 GMT 2009 Olly Betts <olly@survex.com>
2045 * Makefile.am,configure.ac,docs/Makefile.am: Fix things up so that in
2046 a bootstrapped SVN tree, automatic regeneration of
2047 autotools-generated files uses the in-tree versions of the autotools.
2049 Wed Apr 22 13:52:28 GMT 2009 Olly Betts <olly@survex.com>
2051 * NEWS: Update for 1.1.0.
2053 Mon Apr 20 14:20:51 GMT 2009 Olly Betts <olly@survex.com>
2055 * NEWS: Sync changes from 1.0.12.
2057 Mon Apr 20 14:15:41 GMT 2009 Olly Betts <olly@survex.com>
2059 * NEWS: Update from ChangeLog and clean up for release.
2061 Thu Apr 16 10:02:44 GMT 2009 Olly Betts <olly@survex.com>
2063 * transform.cc: Fix off-by-one error - the return value of pcre_exec()
2064 is one more than the number of groupings.
2066 Thu Apr 16 09:23:29 GMT 2009 Olly Betts <olly@survex.com>
2068 * Makefile.am: Need to ship new file transform.h.
2070 Thu Apr 16 08:20:01 GMT 2009 Olly Betts <olly@survex.com>
2072 * Makefile.am,docs/omegascript.rst,query.cc,transform.cc,transform.h:
2073 Factor out the implementation of $transform into a separate source
2074 file and compile only that file with $(PCRE_CFLAGS) to avoid
2075 problems reported by James Aylett with Mac OS X on #xapian-devel.
2076 Fix expansion of \1 to \9 to work correctly and document these
2077 and \\. Fix handling of unescaped \ at the end of the pattern, and
2078 leave unrecognised \<x> sequences unchanged.
2080 Thu Apr 16 04:38:20 GMT 2009 Olly Betts <olly@survex.com>
2082 * configure.ac: Remove duplicate "AC_SUBST(AM_CXXFLAGS)".
2084 Thu Apr 16 04:29:28 GMT 2009 Olly Betts <olly@survex.com>
2086 * configure.ac: Avoid implicitly casting a string literal to char* in
2087 the test for iconv by adding the same explicit cast we use in the
2088 code in utf8convert.cc. Currently the implicit cast is "only" a
2089 warning under GCC, but the user could pass -Werror explicitly in
2090 CXXFLAGS, and this could be promoted to an error in future GCC
2091 versions, and may already be so for some other compilers.
2093 Thu Apr 16 03:56:16 GMT 2009 Olly Betts <olly@survex.com>
2095 * configure.ac: Back out previous fix - -Werror has nothing to do with
2096 the issue James reported.
2098 Tue Apr 14 15:34:36 GMT 2009 Richard Boulton <richard@lemurconsulting.com>
2100 * configure.ac: Test for compiler flags before checking for
2101 libraries, and use the compiler flags found when checking for
2102 things. In particular, this should fix the test for the type
2103 used by iconv() on MacOS (where it was previously returning "char
2104 *", and the test was giving a warning about converting this to
2105 "const char *", but not failing). Requires a change to the iconv
2106 test to avoid it failing on linux with GCC due to an unrelated
2107 warning in the test code.
2109 Sat Apr 04 15:15:18 GMT 2009 Olly Betts <olly@survex.com>
2111 * NEWS: Update from ChangeLog.
2113 Wed Mar 25 12:35:42 GMT 2009 Olly Betts <olly@survex.com>
2115 * Makefile.am,configure.ac: Actually use all those warning flags we
2116 carefully determine!
2118 Wed Mar 25 12:03:37 GMT 2009 Olly Betts <olly@survex.com>
2120 * Makefile.am,configure.ac: Only put XAPIAN_CXXFLAGS in CXXFLAGS for
2121 the duration of configure (we need it as it may include options to
2122 put the compiler into ISO C++ mode). Set AM_CXXFLAGS to
2123 XAPIAN_CXXFLAGS in Makefile.am. This means that the user can safely
2124 override CXXFLAGS at make-time: "make CXXFLAGS=-Os"
2126 Wed Mar 25 10:56:29 GMT 2009 Olly Betts <olly@survex.com>
2128 * query.cc: Cope with write() not writing all the data or being
2129 interrupted by a signal when writing log entries.
2131 Wed Mar 25 10:48:14 GMT 2009 Olly Betts <olly@survex.com>
2133 * configure.ac: Move AC_PROG_CXX and AC_LANG_CPLUSPLUS earlier so that
2134 CXXFLAGS is set before we add XAPIAN_CXXFLAGS to it. With libtool
2135 1.5.x this wasn't an issue, as AC_PROG_CXX was implicitly run early
2136 on. With libtool 2.2.x it is as AC_PROG_CXX doesn't touch CXXFLAGS
2137 if it is already set, so we don't get "-O2 -g" set for GCC.
2139 Wed Mar 18 06:13:16 GMT 2009 Olly Betts <olly@survex.com>
2141 * scriptindex.cc: Mark "index=nopos" error for removal in 1.3.0
2142 not 1.2.0. Tweak code that produces it to use more literal strings.
2144 Wed Mar 18 06:12:06 GMT 2009 Olly Betts <olly@survex.com>
2146 * docs/scriptindex.rst: The deprecated "index=nopos" is now removed
2147 and gives an error explaining what to use instead, so remove the
2148 documentation saying it is deprecated and what to do.
2150 Mon Mar 16 14:07:58 GMT 2009 Olly Betts <olly@survex.com>
2152 * NEWS: Sync with 1.0.11.
2154 Sat Feb 28 08:31:15 GMT 2009 Olly Betts <olly@survex.com>
2156 * omindex.cc,scriptindex.cc: Use commit() rather than flush().
2158 Sat Feb 28 08:28:26 GMT 2009 Olly Betts <olly@survex.com>
2160 * scriptindex.cc: Don't call reopen() on a WritableDatabase - it
2161 doesn't do anything!
2163 Thu Feb 26 06:38:05 GMT 2009 Olly Betts <olly@survex.com>
2165 * NEWS: Update from ChangeLog.
2167 Thu Feb 26 06:18:05 GMT 2009 Olly Betts <olly@survex.com>
2169 * omindex.cc: Mark "-l" as requiring an argument so that it actually
2170 works - previously it would always result in a segmentation fault.
2172 Thu Feb 26 00:17:56 GMT 2009 Olly Betts <olly@survex.com>
2174 * docs/cgiparams.rst: Note the technique of using a stub database file
2175 to allow a default of searching over multiple databases.
2177 Wed Feb 25 12:39:08 GMT 2009 Olly Betts <olly@survex.com>
2179 * configure.ac: Update g++ version check to match recent change to
2180 xapian-core. Also turn on _FORTIFY_SOURCE and make the rare()
2181 and usual() branch prediction hint macros available.
2183 Mon Feb 23 06:05:25 GMT 2009 Olly Betts <olly@survex.com>
2185 * Makefile.am,docs/overview.rst,omindex.cc,xpsxmlparse.cc,
2186 xpsxmlparse.h: Add support for XPS files (bug#290).
2188 Fri Feb 20 03:25:14 GMT 2009 Olly Betts <olly@survex.com>
2190 * query.cc: Wrap a long comment.
2192 Thu Feb 19 10:34:36 GMT 2009 Olly Betts <olly@survex.com>
2194 * omega.cc,query.cc: Prefer str.resize(0) to str = "".
2196 Thu Feb 19 06:23:34 GMT 2009 Olly Betts <olly@survex.com>
2198 * docs/overview.rst,omindex.cc: Add support for MS Office 2007
2201 Thu Feb 19 04:46:26 GMT 2009 Olly Betts <olly@survex.com>
2203 * metaxmlparse.cc,metaxmlparse.h,xmlparse.cc,xmlparse.h: XmlParser and
2204 MetaXmlParser were overriding opening_tag with the wrong signature so
2205 their implementations weren't ever being used.
2207 Fri Jan 09 04:19:32 GMT 2009 Olly Betts <olly@survex.com>
2209 * runfilter.cc: Fix to compile when RLIMIT_AS isn't available (as on
2210 NetBSD and OpenBSD). In this situation, instead use RLIMIT_VMEM or
2211 RLIMIT_DATA if either is available.
2213 Wed Dec 10 01:06:03 GMT 2008 Olly Betts <olly@survex.com>
2215 * query.cc: Fix poor grammar in comment.
2217 Sat Nov 01 01:49:07 GMT 2008 Olly Betts <olly@survex.com>
2219 * NEWS: Sync with 1.0.9.
2221 Fri Oct 31 18:34:49 GMT 2008 Olly Betts <olly@survex.com>
2223 * configure.ac: Sync warning flag handling changes from xapian-core.
2225 Thu Oct 23 17:08:22 GMT 2008 Olly Betts <olly@survex.com>
2227 * docs/overview.rst: Document HTML parsing a bit, including robots
2228 meta and htdig_noindex.
2230 Sat Oct 18 08:00:24 GMT 2008 Olly Betts <olly@survex.com>
2232 * omega.cc: Catch std::exception and report what its what() method
2235 Thu Oct 09 10:16:05 GMT 2008 Olly Betts <olly@survex.com>
2237 * configure.ac: Update autoconf requirement to 2.63, libtool to 2.2.6.
2239 Wed Oct 01 04:48:37 GMT 2008 Olly Betts <olly@survex.com>
2241 * scriptindex.cc: Separate Action constructor cases to avoid
2242 pointlessly calling atoi() on an empty string.
2244 Wed Oct 01 03:15:29 GMT 2008 Olly Betts <olly@survex.com>
2246 * omega.cc,omega.h: Remove undocumented and non-functional support for
2247 numeric sorting via: SORT=#<slot>
2249 Thu Sep 04 04:26:22 GMT 2008 Olly Betts <olly@survex.com>
2251 * configure.ac: Set version to 1.1.0.
2253 Thu Sep 04 04:21:12 GMT 2008 Olly Betts <olly@survex.com>
2255 * NEWS: Sync with 1.0.8 and update from ChangeLog.
2257 Wed Sep 03 12:26:58 GMT 2008 Olly Betts <olly@survex.com>
2259 * htmlparse.cc,htmlparse.h,htmlparsetest.cc,myhtmlparse.cc,
2260 myhtmlparse.h,omindex.cc,scriptindex.cc,xmlparse.h: If the character
2261 encoding is specified using <meta http-equiv=...> in an HTML
2262 document then reparse the document if it isn't the encoding we're
2263 already using so that any preceding <title> is converted correctly
2266 Convert text from meta tag parameters to UTF-8 (bug#293).
2268 Handle <meta charset="..."> (new in HTML 5).
2270 Fix bug in parameter parsing which was probably just a small
2271 performance penalty in real world cases, but could perhaps result in
2272 parsing bogus extra parameters in carefully contrived situations.
2274 Tue Aug 05 09:24:33 GMT 2008 Olly Betts <olly@survex.com>
2276 * docs/: Fix a few typos and improve wording in a few places.
2278 Tue Aug 05 09:19:56 GMT 2008 Olly Betts <olly@survex.com>
2280 * omindex.cc: Tweak to use string::assign() instead of assigning the
2281 result of string::substr().
2283 Tue Jul 29 23:48:31 GMT 2008 Olly Betts <olly@survex.com>
2285 * runfilter.cc: Add missing <signal.h>, noted on FreeBSD by Henrik
2288 Mon Jul 21 12:27:48 GMT 2008 Olly Betts <olly@survex.com>
2290 * commonhelp.cc: Use PACKAGE_BUGREPORT instead of hardcoding the bug
2291 report URL. Remove reference to "bugzilla" as we now use trac
2294 Mon Jul 21 11:58:25 GMT 2008 Olly Betts <olly@survex.com>
2296 * configure.ac: Put the bug report URL as the third parameter to
2297 AC_INIT. Add proper m4 quoting in a few places (nowhere that
2298 should actually change behaviour). Add hard autotools version
2299 requirements to match xapian-core, and remove the version
2300 justification since HACKING now covers that. Drop docdir workaround
2301 for autoconf < 2.60.
2303 Wed Jul 09 10:44:37 GMT 2008 Olly Betts <olly@survex.com>
2305 * configure.ac: The workaround to avoid probe code for F77, GCJ, and
2306 RC being added to configure is no longer required now that we're
2307 using libtool 2.2 so remove it.
2309 Wed Jul 09 10:13:18 GMT 2008 Olly Betts <olly@survex.com>
2311 * Makefile.am,configure.ac: Use AC_CONFIG_MACRO_DIR and
2312 ACLOCAL_AMFLAGS as libtoolize 2.2.4 recommends.
2314 Fri Jul 04 08:29:47 GMT 2008 Olly Betts <olly@survex.com>
2316 * NEWS: Synchronise with 1.0 branch.
2318 Fri Jul 04 08:15:03 GMT 2008 Olly Betts <olly@survex.com>
2320 * utf8convert.cc,utf8converttest.cc: UTF-16 with no BOM is meant to be
2321 assumed to be big-endian. GNU libiconv doesn't handle some examples
2322 as expected, so disable them when using iconv() for now.
2324 Fri Jul 04 06:39:20 GMT 2008 Olly Betts <olly@survex.com>
2326 * omindex.cc: Handle UCS-2 and UTF-16 text files with a byte-order
2327 mark (BOM). Ignore any UTF-8 "byte-order" mark.
2328 * utf8convert.cc: Handle UCS-2/UTF-16 and explicit BE and LE forms in
2330 * Makefile.am,utf8converttest.cc: Add unit tests of convert_to_utf8().
2332 Fri Jun 27 04:43:18 GMT 2008 Olly Betts <olly@survex.com>
2334 * query.cc: Overhaul the $highlight colour combinations since some
2335 were rather unreadable. Reported by Joey Hess in Debian bug
2338 Sun Jun 01 15:12:02 GMT 2008 Olly Betts <olly@survex.com>
2340 * configure.ac: Update version to 1.0.7 to match 1.0 branch.
2342 Sun May 25 14:56:41 GMT 2008 Olly Betts <olly@survex.com>
2344 * NEWS: Synchronise with 1.0 branch, and update from ChangeLog.
2346 Sat May 17 11:42:26 GMT 2008 Olly Betts <olly@survex.com>
2348 * docs/omegascript.rst,docs/scriptindex.rst: Tweak mark-up so
2349 generated HTML gets a non-empty title.
2351 Sat May 10 11:14:20 GMT 2008 Olly Betts <olly@survex.com>
2353 * Makefile.am: omega_CPPFLAGS overrides AM_CPPFLAGS, so we need to
2354 explicitly include AM_CPPFLAGS in omega_CPPFLAGS to get
2355 CONFIGFILE_SYSTEM defined when building omega.
2357 Fri May 09 19:27:21 GMT 2008 Olly Betts <olly@survex.com>
2359 * Makefile.am: Fix handling of any -I options needed for PCRE.
2361 Sun May 04 19:12:08 GMT 2008 Olly Betts <olly@survex.com>
2363 * omindex.cc: Fix comment error regarding catdvi options.
2365 Sat May 03 14:02:02 GMT 2008 Olly Betts <olly@survex.com>
2367 * xapian-omega.spec.in: Remove "www." from xapian.org and
2368 oligarchy.co.uk URLs.
2370 Sat May 03 13:55:35 GMT 2008 Olly Betts <olly@survex.com>
2372 * cgiparam.cc,htdig2omega,mbox2omega,omindex-config.cc: Update FSF
2375 Sat May 03 13:54:25 GMT 2008 Olly Betts <olly@survex.com>
2377 * gnu_getopt.h: Remove old copy of file which is no longer used - we
2378 now share a copy with xapian-core via common/.
2380 Sat May 03 10:42:27 GMT 2008 Olly Betts <olly@survex.com>
2382 * configure.ac: Fix header checks to pre-include <sys/types.h> which
2383 Mac OS X needs for some other headers to work.
2385 Sat May 03 10:41:18 GMT 2008 Olly Betts <olly@survex.com>
2387 * configure.ac: Improve code which prevents probing for f77, etc.
2389 Fri May 02 17:52:44 GMT 2008 Olly Betts <olly@survex.com>
2391 * configure.ac: Fix to fail if --with-iconv is specified and libiconv
2392 isn't, and we aren't using fink on Mac OS X.
2394 Fri May 02 15:55:24 GMT 2008 Richard Boulton <richard@lemurconsulting.com>
2396 * configure.ac: If iconv isn't found, set with_iconv to "no", to
2397 prevent USE_ICONV being set. Was previously only doing this if
2398 fink on OS X was found.
2400 Fri May 02 14:14:07 GMT 2008 Richard Boulton <richard@lemurconsulting.com>
2402 * query.cc: Cast size to unsigned before division to avoid a
2403 warning about signed overflow.
2405 Fri May 02 14:08:39 GMT 2008 Richard Boulton <richard@lemurconsulting.com>
2407 * configure.ac: Synchronise code for working out warning flags used
2408 for builds with that used for xapian-core. Copes with different
2409 formats of version number output by "gcc --version" which should
2410 help to improve output.
2412 Tue Apr 15 23:44:10 GMT 2008 Richard Boulton <richard@lemurconsulting.com>
2414 * query.cc: Catch only the specific error which indicates a need to
2415 repeat a get_termfreq() call on the database instead of the mset.
2417 Sun Apr 13 11:19:49 GMT 2008 Richard Boulton <richard@lemurconsulting.com>
2419 * freemem.h: Specify units of get_free_physical_memory().
2421 Sun Apr 06 09:05:58 GMT 2008 Olly Betts <olly@survex.com>
2423 * freemem.cc: Fix latent compilation error on FreeBSD, pointed out by
2426 Mon Mar 31 02:00:48 GMT 2008 Olly Betts <olly@survex.com>
2428 * configure.ac: Update version to 1.0.6 to match latest release.
2430 Wed Mar 12 07:04:56 GMT 2008 Olly Betts <olly@survex.com>
2432 * scriptindex.cc: Make deprecated "index=nopos" an error.
2434 Mon Mar 10 03:37:30 GMT 2008 Olly Betts <olly@survex.com>
2436 * Makefile.am,diritor.cc,diritor.h,omindex.cc: Check for readdir()
2439 Thu Mar 06 23:43:11 GMT 2008 Olly Betts <olly@survex.com>
2441 * common/: Update to latest revisions.
2442 * Makefile.am,diritor.h: Use safedirent.h not dirent.h and build
2443 msvc_dirent.cc as part of omindex.
2445 Wed Mar 05 23:16:23 GMT 2008 Olly Betts <olly@survex.com>
2447 * NEWS: Update to HEAD with un-backported changes kept separate.
2449 Wed Mar 05 19:05:12 GMT 2008 Olly Betts <olly@survex.com>
2451 * NEWS: Update to 1.0 branch point.
2453 Sat Feb 02 22:46:40 GMT 2008 Olly Betts <olly@survex.com>
2455 * query.cc: Add (C) notice for Thomas Viehmann.
2457 Sat Feb 02 22:46:14 GMT 2008 Olly Betts <olly@survex.com>
2459 * omindex.cc: Back out random change committed by accident.
2461 Sat Feb 02 21:23:07 GMT 2008 Olly Betts <olly@survex.com>
2463 * omindex.cc,query.cc: New OmegaScript commands $addfilter, $lower,
2465 * docs/omegascript.rst: Document. Improve formatting.
2467 Fri Feb 01 01:45:26 GMT 2008 Olly Betts <olly@survex.com>
2469 * INSTALL: PCRE required.
2470 * docs/omegascript.rst: $transform{} now enabled. Fixes bug#231.
2472 Fri Feb 01 01:35:58 GMT 2008 Olly Betts <olly@survex.com>
2474 * Makefile.am,configure.ac,query.cc: Add PCRE as a requirement and
2475 add $transform{} command (which has been in the code for ages but
2478 Sat Jan 19 02:01:02 GMT 2008 Olly Betts <olly@survex.com>
2480 * omindex.cc: Add support for DjVu files.
2481 * docs/overview.rst: Document.
2483 Sat Jan 12 03:37:28 GMT 2008 Olly Betts <olly@survex.com>
2485 * freemem.cc: Check "defined HAVE_SYSMP" rather than just "HAVE_SYSMP".
2486 This doesn't change behaviour, but fixes a compile warning on
2487 platforms other than Linux and IRIX.
2489 Fri Dec 21 02:13:49 GMT 2007 Olly Betts <olly@survex.com>
2491 * NEWS: Bump release date.
2493 Thu Dec 20 21:40:34 GMT 2007 Olly Betts <olly@survex.com>
2495 * NEWS: Another update for 1.0.5.
2497 Thu Dec 20 20:08:58 GMT 2007 Olly Betts <olly@survex.com>
2499 * Makefile.am,scriptindex.cc: Fix scriptindex to insert a ':' between
2500 prefix and term using the same criteria which the QueryParser does.
2501 * scriptindex.cc,docs/scriptindex.rst: Action BOOLEAN now ignores an
2502 empty input rather than adding the prefix as a term. Action UNIQUE
2503 now issues an warning for empty input but otherwise ignores it.
2505 Thu Dec 20 17:44:57 GMT 2007 Olly Betts <olly@survex.com>
2507 * common/: Update to r9894 to pick up stringutils.cc.
2509 Wed Dec 19 03:44:50 GMT 2007 Olly Betts <olly@survex.com>
2511 * NEWS,configure.ac: Update for 1.0.5.
2513 Tue Dec 18 00:58:07 GMT 2007 Olly Betts <olly@survex.com>
2517 Thu Dec 13 01:38:43 GMT 2007 Olly Betts <olly@survex.com>
2519 * omindex.cc: Avoid rereading uncompressed AbiWord documents in order
2520 to calculate their MD5 checksums.
2522 Thu Dec 13 01:34:53 GMT 2007 Olly Betts <olly@survex.com>
2524 * omindex.cc: Improve comment wording.
2526 Thu Dec 13 00:59:35 GMT 2007 Olly Betts <olly@survex.com>
2528 * docs/overview.rst: Document that omindex limits resources that
2529 filter programs can use. Also add a note welcoming suggestions
2530 for additional reliable filter programs.
2532 Wed Dec 12 23:49:27 GMT 2007 Olly Betts <olly@survex.com>
2534 * Makefile.am,freemem.cc,freemem.h,runfilter.cc: Limit filter programs
2535 to 7/8 of free physical memory on platforms where we know how to
2536 determine this (currently at least Linux, FreeBSD, IRIX, HP-UX;
2537 probably Solaris and a few others too). Fixes bug#111.
2539 Wed Dec 12 18:20:34 GMT 2007 Olly Betts <olly@survex.com>
2541 * docs/termprefixes.rst: Note the version where we stopped generating
2542 terms with a 'W' prefix (0.9.7).
2544 Wed Dec 12 18:17:28 GMT 2007 Olly Betts <olly@survex.com>
2546 * docs/overview.rst: omindex hasn't generated "W"-prefix terms since
2547 0.9.7, so remove the documentation saying it does!
2549 Wed Dec 12 18:16:52 GMT 2007 Olly Betts <olly@survex.com>
2551 * docs/overview.rst: Update to mention how upper case in extensions is
2554 Wed Dec 12 17:49:12 GMT 2007 Olly Betts <olly@survex.com>
2556 * omindex.cc: If an extension isn't found in the mime_map and contains
2557 uppercase ASCII characters, see if the lower cased extension is in
2560 Wed Dec 12 02:09:02 GMT 2007 Olly Betts <olly@survex.com>
2562 * NEWS: Updated from ChangeLog in preparation for 1.0.5.
2564 Mon Dec 10 23:27:40 GMT 2007 Olly Betts <olly@survex.com>
2566 * omindex.cc: '-f' is documented by --help as a short option for
2567 '--follow', but wasn't previously actually recognised.
2569 Tue Nov 20 13:08:19 GMT 2007 Olly Betts <olly@survex.com>
2571 * htmlparse.cc: Add "using namespace std;" to ensure that
2572 std::strchr(), etc are imported into the global namespace.
2574 Tue Nov 20 01:01:13 GMT 2007 Richard Boulton <richard@lemurconsulting.com>
2576 * commonhelp.cc,diritor.cc,htmlparse.cc,omega.cc,scriptindex.cc:
2577 Add #include of cstring, to fix errors from gcc-4.3 snapshot.
2578 Tidy include ordering in htmlparse.cc
2580 Tue Nov 06 12:17:10 GMT 2007 Olly Betts <olly@survex.com>
2582 * docs/Makefile.am: No need to set SUFFIXES manually for suffixes used
2585 Mon Nov 05 19:32:41 GMT 2007 Olly Betts <olly@survex.com>
2587 * configure.ac: Probe for rst2html.
2589 Mon Nov 05 07:24:31 GMT 2007 Olly Betts <olly@survex.com>
2591 * Makefile.am,README,configure.ac,docs/,query.cc: Replace .txt docs
2592 with Jenny's RST-ified versions.
2594 Tue Oct 30 04:54:58 GMT 2007 Olly Betts <olly@survex.com>
2596 * NEWS,configure.ac: Update for 1.0.4.
2598 Sat Oct 27 05:32:06 BST 2007 Olly Betts <olly@survex.com>
2602 Sat Oct 27 05:30:28 BST 2007 Olly Betts <olly@survex.com>
2604 * query.cc: On balance, it's more helpful to users to moan about a
2605 template which tries to set the same user prefix as both boolean
2606 and probabilistic, even if previous releases didn't.
2608 Thu Oct 25 20:38:15 BST 2007 Olly Betts <olly@survex.com>
2610 * common/: Update to latest version.
2611 * query.cc: Remove STRINGIZE macro definition as this is now
2612 defined by stringutils.h.
2614 Fri Oct 19 16:17:47 BST 2007 Olly Betts <olly@survex.com>
2616 * query.cc: Fix for reverted add_prefix() API.
2618 Sun Sep 30 22:12:46 BST 2007 Richard Boulton <richard@lemurconsulting.com>
2620 * query.cc: Use the new form of add_prefix() to avoid deprecation
2621 warnings at compile time. Carefully avoid calling
2622 add_prefix(f,p,PREFIX_FILTER) for a prefix which has already been
2623 set with add_prefix(f,p,PREFIX_INLINE), because this would cause
2624 an error (and we wish to avoid changing semantics of omegascript
2625 to avoid breaking existing scripts).
2628 Fri Sep 28 15:48:50 BST 2007 Olly Betts <olly@survex.com>
2630 * NEWS: Final (?) update for 1.0.3.
2632 Fri Sep 28 15:46:11 BST 2007 Olly Betts <olly@survex.com>
2634 * mbox2omega: Expand --help output.
2635 * docs/scriptindex.txt: Refer to mbox2omega as an example of how to
2638 Fri Sep 28 03:18:25 BST 2007 Olly Betts <olly@survex.com>
2642 Fri Sep 28 03:15:11 BST 2007 Olly Betts <olly@survex.com>
2644 * configure.ac: Update for 1.0.3. Use ustar format for tarball since
2645 we have to for xapian-core anyway.
2647 Fri Sep 28 02:42:28 BST 2007 Olly Betts <olly@survex.com>
2649 * ./: Update common SVN rev in svn:externals so the files are in
2650 sync with xapian-core.
2652 Wed Sep 19 16:09:36 BST 2007 Olly Betts <olly@survex.com>
2654 * NEWS: Update from ChangeLog entries since 1.0.2.
2656 Sat Sep 08 19:24:48 BST 2007 Olly Betts <olly@survex.com>
2658 * configure.ac,runfilter.cc: Impose a 5 minute CPU time limit on
2659 filter programs to prevent problems if a filter program goes into
2660 an infinite loop on a malformed input. Partly addresses bug#111.
2662 Fri Sep 07 21:22:43 BST 2007 Olly Betts <olly@survex.com>
2664 * omindex.cc: Fix comment typos.
2666 Fri Sep 07 20:56:50 BST 2007 Olly Betts <olly@survex.com>
2668 * docs/overview.txt,omindex.cc: Add supporting for indexing TeX DVI
2671 Thu Sep 06 20:59:57 BST 2007 Olly Betts <olly@survex.com>
2673 * query.cc: Fix bug in decimal fraction in $size for files >= 1M in
2676 Thu Sep 06 20:13:44 BST 2007 Olly Betts <olly@survex.com>
2678 * templates/query: Set HTML charset to utf-8 since that's what
2679 databases now are by default. Tidy up some HTML gremlins.
2680 Restyle to use CSS to draw a "score bar" instead of using
2681 images. Rework the layout of each hit. Add popup hints on
2682 mouse-over for various items.
2684 Thu Sep 06 18:12:07 BST 2007 Olly Betts <olly@survex.com>
2686 * scriptindex.cc: Fix line number tracking in dump files.
2688 Thu Sep 06 18:06:28 BST 2007 Olly Betts <olly@survex.com>
2690 * docs/omegascript.txt,query.cc: Add $muldiv{A,B,C} which calculates
2693 Thu Sep 06 03:36:36 BST 2007 Olly Betts <olly@survex.com>
2695 * runfilter.cc: Fix file description.
2697 Thu Sep 06 00:54:58 BST 2007 Olly Betts <olly@survex.com>
2699 * Makefile.am,omindex.cc,runfilter.cc,runfilter.h: Factor out the
2700 stdout_to_string() function into its own source file.
2702 Thu Sep 06 00:45:14 BST 2007 Olly Betts <olly@survex.com>
2704 * cgiparam.h,commonhelp.h,date.h,hashterm.h,htmlparse.h,loadfile.h,
2705 md5wrap.h,metaxmlparse.h,myhtmlparse.h,namedentities.h,omega.h,
2706 sample.h,utf8convert.h,utf8truncate.h,xmlparse.h: Add missing header
2707 guards and standardise existing header guards to use the form
2708 OMEGA_INCLUDED_FOO_H.
2710 Thu Sep 06 00:24:54 BST 2007 Olly Betts <olly@survex.com>
2712 * myhtmlparse.cc: Add '#include <config.h>'.
2713 * omega.h: Don't '#include <config.h>'.
2715 Mon Sep 03 19:16:37 BST 2007 Olly Betts <olly@survex.com>
2717 * docs/overview.txt,omindex.cc: Add support for indexing AbiWord
2720 Thu Jul 05 00:37:35 BST 2007 Olly Betts <olly@survex.com>
2722 * NEWS: Final (?) update for 1.0.2.
2724 Thu Jul 05 00:33:14 BST 2007 Olly Betts <olly@survex.com>
2726 * omindex.cc: Report files we aren't indexing because their extensions
2729 Wed Jul 04 21:22:02 BST 2007 Richard Boulton <richard@lemurconsulting.com>
2731 * NEWS: Update with release date for release 1.0.2
2733 Wed Jul 04 20:43:22 BST 2007 Richard Boulton <richard@lemurconsulting.com>
2735 * configure.ac: Bump version to 1.0.2.
2737 Wed Jul 04 17:34:15 BST 2007 Olly Betts <olly@survex.com>
2741 Wed Jul 04 17:31:38 BST 2007 Olly Betts <olly@survex.com>
2743 * Makefile.am,omindex.cc,query.cc: Use stringutils.h from common.
2744 * ./: Update common SVN rev in svn:externals to get the latest
2746 * cgiparam.cc: Use string::resize() rather than assigning from a
2747 substring of the string.
2749 Mon Jul 02 16:42:01 BST 2007 Richard Boulton <richard@lemurconsulting.com>
2751 * htmlparsetest.cc,md5test.cc: Add #include <stdlib.h>, to get a
2752 definition for exit(). Fixes compilation with gcc-snapshot.
2754 Thu Jun 28 18:05:18 BST 2007 Olly Betts <olly@survex.com>
2756 * omindex.cc: If --url isn't passed, default to "/", but print a
2757 warning noting that this default has been used (at least for now).
2759 Thu Jun 28 18:04:53 BST 2007 Olly Betts <olly@survex.com>
2761 * docs/scriptindex.txt: Fix typo.
2763 Wed Jun 27 15:44:30 BST 2007 Richard Boulton <richard@lemurconsulting.com>
2765 * NEWS: Remove the items which aren't really interesting to users.
2767 Wed Jun 27 14:26:26 BST 2007 Richard Boulton <richard@lemurconsulting.com>
2769 * common/: Update svn:externals property to use latest version.
2773 Sat Jun 23 13:11:15 BST 2007 Olly Betts <olly@survex.com>
2775 * diritor.h: Delete random extra blank line.
2777 Sat Jun 23 13:08:35 BST 2007 Olly Betts <olly@survex.com>
2779 * omega.cc,query.cc: Use Xapian::BAD_VALUENO.
2781 Sat Jun 16 11:06:08 BST 2007 Richard Boulton <richard@lemurconsulting.com>
2783 * Makefile.am: Pass value of XAPIAN_CONFIG to distcheck, to ensure
2784 that it works with uninstalled copies of Xapian.
2786 Mon Jun 11 03:34:53 BST 2007 Olly Betts <olly@survex.com>
2788 * NEWS: Minor wording improvement.
2790 Mon Jun 11 03:33:37 BST 2007 Olly Betts <olly@survex.com>
2792 * NEWS: Probably the final update for 1.0.1.
2794 Sun Jun 10 22:00:23 BST 2007 Olly Betts <olly@survex.com>
2796 * configure.ac: Drop automake requirement to 1.8.3 to allow RPM spec
2797 file to work on SLES 9.
2799 Sun Jun 10 21:49:45 BST 2007 Olly Betts <olly@survex.com>
2801 * configure.ac: Bump version to 1.0.1.
2803 Sun Jun 10 02:16:54 BST 2007 Olly Betts <olly@survex.com>
2807 Sat Jun 09 15:20:25 BST 2007 Olly Betts <olly@survex.com>
2809 * Makefile.am,diritor.cc,diritor.h,omindex.cc: Under Linux (at least)
2810 struct dirent can tell us the type of a directory entry for some
2811 filing systems, so make use of this to avoid calling stat() (or
2812 lstat()) unnecessarily - when indexing /usr/share/doc on my Linux
2813 box, this saves about 14000 explicit calls to stat (leaving about
2816 Thu Jun 07 01:40:43 BST 2007 Olly Betts <olly@survex.com>
2820 Wed Jun 06 15:45:33 BST 2007 Olly Betts <olly@survex.com>
2822 * docs/scriptindex.txt: Document that you can delete a document by
2823 giving a new document which only contains the unique term.
2825 Mon Jun 04 16:40:18 BST 2007 Richard Boulton <richard@lemurconsulting.com>
2827 * Makefile.am: Only add manpages to dist_man_MANS if we're not in
2828 maintainer mode with documentation generation turned off.
2830 Thu May 31 20:02:16 BST 2007 Olly Betts <olly@survex.com>
2834 Thu May 31 19:16:37 BST 2007 Olly Betts <olly@survex.com>
2836 * configure.ac: Relax automake requirement to 1.9.2 to allow RPM
2839 Wed May 30 14:42:40 BST 2007 Olly Betts <olly@survex.com>
2841 * NEWS: Update for changes since 1.0.0. Removed unused subheading
2844 Wed May 30 10:24:57 BST 2007 Olly Betts <olly@survex.com>
2846 * query.cc: Fix handling of query parsing errors (broken by changes in
2849 Tue May 29 01:19:21 BST 2007 Olly Betts <olly@survex.com>
2851 * docs/overview.txt: We no longer use pstotext for PostScript, but
2852 instead use ps2pdf followed by pdftotext, so update the docs to
2855 Fri May 18 03:36:28 BST 2007 Olly Betts <olly@survex.com>
2857 * htmlparsetest.cc,myhtmlparse.cc: Fix bug in HTML parser - if the
2858 text between tags consisted entirely of whitespace it would just be
2859 ignored which could run words together. Add regression test, plus
2860 another test for other whitespace handling.
2862 Thu May 17 22:27:47 BST 2007 Olly Betts <olly@survex.com>
2864 * NEWS: Final update before release.
2866 Thu May 17 20:48:25 BST 2007 Olly Betts <olly@survex.com>
2870 Thu May 17 20:46:43 BST 2007 Olly Betts <olly@survex.com>
2872 * docs/termprefixes.txt: Update to include 'Z' prefix and mention
2873 that 'R' and 'W' aren't used by Xapian now.
2875 Thu May 17 19:11:04 BST 2007 Olly Betts <olly@survex.com>
2877 * configure.ac: Bump version to 1.0.0.
2879 Thu May 17 18:11:19 BST 2007 Olly Betts <olly@survex.com>
2881 * common/: Update to latest xapian-core revision to pull in 2 argument
2882 mkdir() wrapper for Mingw.
2884 Thu May 17 03:29:44 BST 2007 Olly Betts <olly@survex.com>
2886 * Makefile.am,configure.ac: Add support for --disable-documentation
2887 like xapian-core now has.
2888 * configure.ac: Only enable -Werror on --enable-maintainer-mode for
2889 GCC 4 or newer, in line with change in xapian-core.
2891 Thu May 17 03:22:10 BST 2007 Olly Betts <olly@survex.com>
2893 * NEWS: Update for 1.0.0.
2895 Wed May 16 03:09:44 BST 2007 Olly Betts <olly@survex.com>
2899 Tue May 15 18:50:47 BST 2007 Olly Betts <olly@survex.com>
2901 * configure.ac: Add AC_TYPE_PID_T.
2903 Tue May 15 04:22:40 BST 2007 Olly Betts <olly@survex.com>
2905 * omindex.cc: Remove FIXME comment which has already been addressed.
2907 Mon May 14 04:38:49 BST 2007 Olly Betts <olly@survex.com>
2909 * docs/omegascript.txt: Update docs for $prettyterm{TERM}.
2911 Mon May 14 04:31:01 BST 2007 Olly Betts <olly@survex.com>
2913 * omega.cc,omega.h,query.cc,query.h: Rejig how $topterms and other
2914 cases handle terms to fit with the new term generation scheme.
2915 Add 'you' and 'your' as stopwords.
2917 Thu May 10 04:48:43 BST 2007 Olly Betts <olly@survex.com>
2919 * ./: Update svn:externals to pull in r8538 of xapian-core's common
2921 * Makefile.am: Add common/safe.cc to scriptindex_SOURCES.
2923 Thu May 10 01:09:14 BST 2007 Olly Betts <olly@survex.com>
2925 * templates/,Makefile.am: The 'query' template no longer uses
2926 $topterms by default - to get them, use the new 'topterms' template.
2927 Also the template fragments which aren't intended for direct use
2928 have been move to templates/inc/.
2929 * docs/overview.txt: Document what each of the OmegaScript templates
2931 * docs/quickstart.txt: Assorted minor improvements.
2932 * xapian-omega.spec.in: Update to install templates/inc too.
2934 Wed May 09 23:43:57 BST 2007 Olly Betts <olly@survex.com>
2936 * docs/omegascript.txt,query.cc: Instead of appending a dot to
2937 indicate a stemmed term, wrap the term in double quotes.
2939 Sun May 06 21:41:21 BST 2007 Olly Betts <olly@survex.com>
2941 * omindex.cc,scriptindex.cc: Removed commented out code for generating
2942 "W" prefix terms for date searching. We've never made use of them
2943 in Omega, and we'll be moving to using DateMatchDecider by default
2946 Sun May 06 16:00:47 BST 2007 Olly Betts <olly@survex.com>
2948 * configure.ac: Set version to mythical 0.9.99.
2950 Sun May 06 15:52:08 BST 2007 Olly Betts <olly@survex.com>
2952 * Makefile.am,configure.ac,omega.spec.in,xapian-omega.spec.in:
2953 Update RPM spec file to reflect tarball name change from omega
2954 to xapian.omega (patch from Fabrice Colin). Also rename omega.spec
2955 to xapian-omega.spec (rpmbuild looks for any .spec file, but it's
2956 more consistent to keep the names in step).
2958 Fri May 04 19:52:44 BST 2007 Olly Betts <olly@survex.com>
2960 * omindex.cc,scriptindex.cc: Use new TermGenerator convenience methods
2961 which take std::string instead of Utf8Iterator.
2963 Fri May 04 13:32:11 BST 2007 Olly Betts <olly@survex.com>
2965 * Makefile.am,configure.ac,makemanpage.in: Use makemanpage to generate
2968 Fri May 04 13:30:36 BST 2007 Olly Betts <olly@survex.com>
2970 * commonhelp.cc: Add missing full stop in description of --stemmer.
2972 Fri May 04 04:10:23 BST 2007 Olly Betts <olly@survex.com>
2974 * query.cc: Explicitly include stdlib.h since we use atoi().
2976 Thu May 03 15:16:31 BST 2007 Olly Betts <olly@survex.com>
2978 * Makefile.am,indextext.cc,indextext.h,omindex.cc,scriptindex.cc:
2979 Update to use new TermGenerator class.
2981 Thu May 03 04:03:35 BST 2007 Olly Betts <olly@survex.com>
2983 * ./: Update svn:externals to pull rev8430 of xapian-core's common
2985 * scriptindex.cc: Remove sleep() wrapper.
2987 Wed May 02 03:26:38 BST 2007 Olly Betts <olly@survex.com>
2989 * docs/omegascript.txt,query.cc: Removed $freqs as it has been
2990 deprecated for ages.
2992 Wed May 02 03:19:18 BST 2007 Olly Betts <olly@survex.com>
2994 * docs/scriptindex.txt: Explicitly note that index=nopos is deprecated
2995 (scriptindex already emits a warning).
2997 Wed May 02 03:17:03 BST 2007 Olly Betts <olly@survex.com>
2999 * docs/cgiparams.txt: FMT isn't limited to just `a-z' - the
3000 actual restriction is that it may not contain `..'.
3002 Wed May 02 03:02:53 BST 2007 Olly Betts <olly@survex.com>
3004 * scriptindex.cc: Remove -q and -u options - they no longer do
3005 anything and are only accepted for compatibility with really old
3006 versions (0.6.1 and earlier and 0.7.5 and earlier respectively).
3008 Wed Apr 25 21:47:48 BST 2007 Olly Betts <olly@survex.com>
3010 * Makefile.am: omega doesn't need indextext.cc.
3012 Wed Apr 25 21:46:25 BST 2007 Olly Betts <olly@survex.com>
3014 * query.cc: Remove unused `#include "indextext.h"'.
3016 Wed Apr 25 02:37:15 BST 2007 Olly Betts <olly@survex.com>
3018 * Makefile.am,configure.ac: Add support like xapian-core has for
3019 `configure --enable-quiet', `make QUIET=' and `make QUIET=y'.
3021 Mon Apr 23 15:42:24 BST 2007 Olly Betts <olly@survex.com>
3023 * date.cc,datematchdecider.cc,utils.cc: Fix compilation with GCC 4.3
3026 Mon Apr 23 15:38:00 BST 2007 Olly Betts <olly@survex.com>
3028 * portability/mkdtemp.cc: config.h should always be included first and
3029 with angle brackets. Use safeerrno.h not errno.h. No special
3030 headers are required here for __CYGWIN__, and safesysstat.h provides
3031 a two argument wrapper for mkdir, so we don't need any
3032 __WIN32__-specific magic either.
3034 Mon Apr 23 12:14:01 BST 2007 Richard Boulton <richard@lemurconsulting.com>
3036 * portability/mkdtemp.cc: Patch from Charlie Hull to fix windows
3038 * scriptindex.cc: #include <time.h> in scriptindex.cc for
3041 Sat Apr 21 23:31:02 BST 2007 Olly Betts <olly@survex.com>
3043 * strcasecmp.h: New header containing magic to provide strcasecmp()
3045 * query.cc,utf8convert.cc: Use strcasecmp.h.
3046 * Makefile.am,cdb_init.cc,cdb_int.h,configfile.cc,getopt.cc,
3047 loadfile.cc,md5wrap.cc,omega.cc,omindex-config.cc,omindex.cc,
3048 query.cc,scriptindex.cc,utf8convert.cc: Add xapian-core's common/
3049 subdirectory as an svn:external so we can (a) share copies of
3050 gnu_getopt.h and getopt.cc and (b) make use of the "safeunistd.h"
3053 Sat Apr 21 23:06:49 BST 2007 Olly Betts <olly@survex.com>
3055 * metaxmlparse.cc,metaxmlparse.h: Fix summary comments at the top of
3058 Sat Apr 21 20:42:03 BST 2007 Olly Betts <olly@survex.com>
3060 * omindex.cc: xapian.h no longer pulls in time.h, which exposes that
3061 we weren't explicitly including it here!
3063 Sat Apr 21 20:27:43 BST 2007 Olly Betts <olly@survex.com>
3065 * configure.ac: We require automake 1.9.5 for xapian-core, so require
3066 it here too for consistency. Turn on automake -Wportability option.
3068 Sat Apr 21 20:24:17 BST 2007 Olly Betts <olly@survex.com>
3070 * configure.ac: Probe for ssize_t and mode_t and define replacements
3071 if we don't find them.
3073 Fri Apr 20 14:38:57 BST 2007 Olly Betts <olly@survex.com>
3075 * datematchdecider.h,omega.h,datematchdecider.cc: Update return
3076 types of MatchDecider and ExpandDecider subclasses.
3078 Wed Apr 18 23:44:36 BST 2007 Olly Betts <olly@survex.com>
3080 * utf8convert.cc: Fix to compile when USE_ICONV isn't defined (to_utf8
3081 is now in the Xapian::Unicode namespace).
3083 Wed Apr 18 23:15:26 BST 2007 Olly Betts <olly@survex.com>
3085 * docs/cgiparams.txt,query.cc: Remove "bias_weight" and
3086 "bias_halflife" CGI parameters since they rely on
3087 Enquire::set_bias() which has been removed.
3089 Tue Apr 17 21:45:40 BST 2007 Richard Boulton <richard@lemurconsulting.com>
3091 * Makefile.am: Link htmlparsetest with Xapian library to get access
3094 Tue Apr 17 02:22:42 BST 2007 Olly Betts <olly@survex.com>
3096 * htmlparse.cc: nonascii_to_utf8 is now in the public API.
3098 Tue Apr 17 00:55:17 BST 2007 Olly Betts <olly@survex.com>
3100 * Makefile.am,htmlparse.cc,indextext.cc,indextext.h,query.cc,sample.cc,
3101 scriptindex.cc,tclUniData.cc,tclUniData.h,utf8convert.cc,utf8itor.cc,
3102 utf8itor.h,utf8test.cc: Use the new Unicode API routines in the core
3103 Xapian library instead of local copies.
3105 Thu Apr 12 17:04:07 BST 2007 Olly Betts <olly@survex.com>
3107 * Makefile.am: omega and scriptindex both need tclUniData.cc.
3109 Sat Mar 31 19:58:29 BST 2007 Olly Betts <olly@survex.com>
3111 * query.cc: $filesize{0} is now "0 bytes", $filesize{1} is now "1
3112 byte", $filesize{SIZE} where SIZE is negative is now "". Fix
3113 "comparison of signed and unsigned" warning. Use "%c" to generate
3114 the fractional part.
3115 * docs/omegascript.txt: Document that $filesize{SIZE} is "" when SIZE
3118 Sat Mar 31 18:25:55 BST 2007 Olly Betts <olly@survex.com>
3120 * query.cc: Ensure that the result of snprintf is zero terminated
3121 since MSVC's snprintf is broken (by design it seems).
3122 * query.cc,docs/omegascript.txt: $filesize enhanced to return a
3123 decimal point for K, M, and G (e.g. "2.1K" and "4.0M" rather than
3126 Fri Mar 30 19:57:00 BST 2007 Olly Betts <olly@survex.com>
3128 * portability/mkdtemp.cc: Fixes for mingw.
3130 Fri Mar 30 02:22:59 BST 2007 Olly Betts <olly@survex.com>
3132 * Makefile.am,scriptindex.cc,utf8truncate.cc,utf8truncate.h: The
3133 "truncate" action now knows not to chop off a multibyte utf-8
3136 Fri Mar 30 02:19:05 BST 2007 Olly Betts <olly@survex.com>
3138 * Makefile.am,omindex.cc,sample.cc,sample.h: New sample generating
3139 function which normalises all runs of whitespace to a single space,
3140 and fixes invalid utf-8 in the sample. This means we can now index
3141 an iso-8859-1 text file and mostly get the same results as if it
3144 Thu Mar 29 23:12:20 BST 2007 Olly Betts <olly@survex.com>
3146 * scriptindex.cc: Fix optimisation of "load truncate=N" to actually
3149 Thu Mar 29 18:54:11 BST 2007 Olly Betts <olly@survex.com>
3151 * configure.ac: Probe for mkdtemp.
3152 * Makefile.am: Add portability/mkdtemp.cc to omindex_SOURCES if
3153 configure didn't detect it.
3154 * omindex.cc: Prototype mkdtemp if configure didn't detect it.
3156 Thu Mar 29 18:47:50 BST 2007 Olly Betts <olly@survex.com>
3158 * portability/mkdtemp.cc: Fix to compile as C++. Replace isdigit()
3159 with a simple range test to avoid locale related quirks.
3161 Thu Mar 29 18:28:25 BST 2007 Olly Betts <olly@survex.com>
3163 * portability/mkdtemp.cc: Add portable implementation of mkdtemp for
3164 use on platforms which don't supply it.
3166 Thu Mar 29 17:22:18 BST 2007 Olly Betts <olly@survex.com>
3168 * omindex.cc: Index PostScript by converting to PDF with ps2pdf and
3169 then indexing that. This allows us to index PostScript files
3170 containing Unicode characters outside of iso-8859-1, and also
3171 means we now get metadata from PostScript files.
3173 Thu Mar 29 03:14:55 BST 2007 Olly Betts <olly@survex.com>
3175 * omega.spec.in: Update to handle documentation being installed in
3176 $prefix/share/doc/xapian-omega.
3178 Tue Mar 27 21:42:19 BST 2007 Olly Betts <olly@survex.com>
3180 * configure.ac: datarootdir is new in 2.60 too, so use datadir when
3181 setting docdir for 2.59.
3183 Mon Mar 26 15:47:53 BST 2007 Olly Betts <olly@survex.com>
3185 * configure.ac: Add code to ensure that docdir is set for autoconf
3186 2.59 (starting from 2.60, it is defined as standard).
3187 * Makefile.am: Use docdir for installing docs. This means that the
3188 documentation now goes in $prefix/share/doc/xapian-omega rather
3189 than $prefix/share/doc/omega, which is better really.
3191 Sat Mar 24 17:21:32 GMT 2007 Olly Betts <olly@survex.com>
3193 * query.cc: Prefer static char[] to static char * (gives better
3196 Sat Mar 24 17:19:18 GMT 2007 Olly Betts <olly@survex.com>
3198 * omega.cc: Prefer static char[] to static char * (gives better
3201 Sat Mar 24 17:16:49 GMT 2007 Olly Betts <olly@survex.com>
3203 * configfile.cc: Prefer static char[] to static char * (gives better
3206 Thu Mar 22 01:11:52 GMT 2007 Olly Betts <olly@survex.com>
3208 * configure.ac: Eliminate libtool probe code for f77, gcj, and rc
3209 which speeds up configure and knocks 29% off its size.
3211 Tue Mar 06 01:56:00 GMT 2007 Olly Betts <olly@survex.com>
3213 * configure.ac: Bump version number to 0.9.10 so that snapshots don't
3214 look older than releases.
3216 Sun Mar 04 14:42:18 GMT 2007 Olly Betts <olly@survex.com>
3218 * TODO: Remove entries which have already been done!
3220 Sat Mar 03 02:24:42 GMT 2007 Olly Betts <olly@survex.com>
3222 * utf8test.cc: Add single utf-8 sequence decoding tests.
3224 Fri Mar 02 00:18:09 GMT 2007 Olly Betts <olly@survex.com>
3226 * configure.ac: Perform a link test for posix_fadvise to fix
3227 misdetection on HP-UX.
3229 Thu Mar 01 21:48:57 GMT 2007 Olly Betts <olly@survex.com>
3231 * utf8itor.h: Add cast to suppress warning from aCC.
3233 Thu Mar 01 21:00:56 GMT 2007 Olly Betts <olly@survex.com>
3235 * configure.ac: Check we can link with libiconv, not just compile.
3236 Some of the HP-UX hosts in the HP testdrive seem to have headers
3237 but no matching library.
3239 Thu Mar 01 18:02:37 GMT 2007 Olly Betts <olly@survex.com>
3241 * myhtmlparse.cc: Remove unused function. Move "#include <string.h>"
3244 Thu Feb 22 15:45:25 GMT 2007 Olly Betts <olly@survex.com>
3246 * configure.ac: xapian-config --cxxflags now includes -ptused for
3247 SGI's C++ compiler, so we don't need to probe for it here.
3249 Wed Feb 21 15:17:07 GMT 2007 Olly Betts <olly@survex.com>
3251 * docs/termprefixes.txt: Expand section on boolean prefixes, showing
3252 how to generate them using scriptindex, and how to allow them to be
3253 selected in an HTML form.
3255 Mon Feb 19 12:51:24 GMT 2007 Olly Betts <olly@survex.com>
3257 * configure.ac: Previous fix doesn't work. Just drop -O2 instead -
3258 users of SGI's CC can specify "./configure CXXFLAGS=-O2" is they
3261 Sun Feb 18 21:44:09 GMT 2007 Olly Betts <olly@survex.com>
3263 * configure.ac: For SGI's CC, -g overrides -g3 if it comes afterwards,
3264 so we need to modify CXXFLAGS rather than just setting AM_CXXFLAGS.
3266 Sat Feb 17 19:25:04 GMT 2007 Olly Betts <olly@survex.com>
3268 * docs/overview.txt,omindex.cc: Add support for indexing MS Works
3269 documents using wps2text (part of libwps).
3271 Sat Feb 17 19:06:03 GMT 2007 Olly Betts <olly@survex.com>
3273 * omindex.cc: Don't index empty files.
3275 Fri Feb 16 21:14:35 GMT 2007 Olly Betts <olly@survex.com>
3277 * NEWS: Add note that Omega < 0.8.0 NEWS entries are in the
3278 xapian-core NEWS file.
3280 Fri Feb 16 20:34:10 GMT 2007 Olly Betts <olly@survex.com>
3282 * indextext.cc: Now I've fixed the bug in UTF-8 decoding, the check
3283 for zero length terms is no longer required.
3285 Fri Feb 16 19:34:48 GMT 2007 Olly Betts <olly@survex.com>
3287 * tclUniData.h,utf8itor.h: The tcl unicode routines only have tables
3288 for characters in the BMP. For other characters, assume they're
3289 word characters, but can't be forced to lowercase.
3291 Fri Feb 16 19:19:11 GMT 2007 Olly Betts <olly@survex.com>
3293 * utf8itor.cc: Fix bug in decoding of 4 byte utf-8 sequences
3294 - the returned value was 0x400000 too large! Fixes bug#106.
3296 Thu Feb 15 19:42:36 GMT 2007 Olly Betts <olly@survex.com>
3298 * indextext.cc,query.cc: Keep embedded apostrophe's in terms rather
3299 than relying on generating a phrase search for them.
3301 Thu Feb 15 05:38:12 GMT 2007 Olly Betts <olly@survex.com>
3303 * Makefile.am,datematchdecider.cc,datematchdecider.h,
3304 docs/cgiparams.txt,query.cc: Add an alternative implementation
3305 of date range filtering which uses a MatchDecider. This allows
3306 everything that the existing implementation does, plus you can
3307 support sorting on a choice of dates (e.g. first published or
3308 last updated), and filtering works to a resolution of a minute
3309 rather than a day. Since omindex now adds the last modified
3310 date as value 0, this will work with omindex.
3312 Thu Feb 15 04:38:32 GMT 2007 Olly Betts <olly@survex.com>
3314 * configure.ac: SGI's CC needs -g3 instead of -g if we want to use
3317 Sat Feb 10 20:53:14 GMT 2007 Olly Betts <olly@survex.com>
3319 * md5.cc: Fix reversed preprocessor conditional so that we generate
3320 correct MD5 checksums on big endian platforms.
3322 Sat Feb 10 20:19:23 GMT 2007 Olly Betts <olly@survex.com>
3324 * md5.cc: No need to byte swap when we've just zero filled!
3326 Sat Feb 10 18:54:33 GMT 2007 Olly Betts <olly@survex.com>
3328 * indextext.cc,query.cc: Prefer Xapian::Stem::operator() to
3329 Xapian::Stem::stem_word().
3331 Fri Feb 09 05:53:29 GMT 2007 Olly Betts <olly@survex.com>
3333 * docs/omegascript.txt: Rewrite introductory paragraph. Note that
3334 whitespace is significant, and add explicit warning to $setmap.
3336 Mon Jan 1 01:56:56 GMT 2007 Richard Boulton <richard@lemurconsulting.com>
3338 * indextext.cc: Fix parsing of text containing certain unicode
3339 characters. Such text could have resulted in zero length terms
3340 being added to documents. (The minimal example I found causing
3341 this problem was a document containing only the unicode character
3342 0x28a0f, which is a CJK Unified Ideograph).
3344 Addresses bug #106, though may not be a complete fix - see the
3347 Sun Dec 31 17:22:56 GMT 2006 Richard Boulton <richard@lemurconsulting.com>
3349 * scriptindex.cc: Update short option list for scriptindex to match
3350 documented usage (-h, -V and -s were not working).
3352 Thu Dec 21 14:57:28 GMT 2006 Olly Betts <olly@survex.com>
3354 * query.cc: Remove support for xB, xDATE1, xDATE2, xDAYSMINUS,
3355 and xDEFAULTOP which were deprecated in favour of xFILTER in
3356 0.7.5 (over 3 years ago).
3358 Thu Dec 21 14:52:38 GMT 2006 Olly Betts <olly@survex.com>
3360 * docs/cgiparams.txt: Remove documentation of the removed deprecated
3363 Thu Dec 21 14:39:04 GMT 2006 Olly Betts <olly@survex.com>
3365 * omega.cc,query.cc: Remove deprecated aliases for CGI parameters
3366 (deprecated in 0.6.3 or 0.6.5, more than 3.5 years ago):
3367 RAW_SEARCH (now RAWSEARCH), DATE1 (now START), DATE2 (now END),
3368 DAYSMINUS (now SPAN but with slightly different semantics),
3369 and MIN_HITS (now MINHITS).
3371 Thu Dec 21 01:04:00 GMT 2006 Olly Betts <olly@survex.com>
3373 * utf8convert.cc: Fix headers included for iconv and not-iconv.
3375 Wed Dec 20 23:53:41 GMT 2006 Olly Betts <olly@survex.com>
3377 * configure.ac,utf8convert.cc: If iconv isn't found by configure, fall
3378 back on simple conversion routines which handle iso-8859-1.
3379 Configuring --without-iconv forces these routines to be used.
3380 Configuring --with-iconv forces configure to fail if it can't find
3383 Tue Dec 19 20:35:04 GMT 2006 Olly Betts <olly@survex.com>
3385 * utf8itor.h: Need <string.h> for strlen.
3387 Tue Dec 19 19:53:52 GMT 2006 Olly Betts <olly@survex.com>
3389 * Makefile.am,configure.ac: Add "-liconv" if it's needed. If we're on
3390 OS X, also check for libiconv installed with fink.
3392 Fri Dec 15 05:43:40 GMT 2006 Olly Betts <olly@survex.com>
3394 * values.h: Add include guard.
3396 Sun Dec 10 04:33:26 GMT 2006 Olly Betts <olly@survex.com>
3398 * query.cc: Fix $substr{} with negative start to actually work. Fix
3399 $substr{} to never cause a C++ exception.
3400 * docs/omegascript.txt,query.cc: Enhance $substr{} to accept a
3401 negative length (meaning to count back from the end of the string).
3403 Sun Dec 10 03:05:09 GMT 2006 Olly Betts <olly@survex.com>
3405 * commonhelp.cc: "--help" now says that the default stemming language
3408 Thu Nov 16 23:06:25 GMT 2006 Olly Betts <olly@survex.com>
3410 * docs/omegascript.txt,query.cc,utils.cc,utils.h: Add $weight command
3411 to OmegaScript which returns the raw document weight - mostly useful
3412 for debugging purposes.
3414 Thu Nov 16 04:02:10 GMT 2006 Olly Betts <olly@survex.com>
3416 * omega.spec.in: Remove "." from the end of the Summary.
3418 Thu Nov 16 03:03:25 GMT 2006 Olly Betts <olly@survex.com>
3420 * configure.ac: As of xapian-core 0.8.0, XO_LIB_XAPIAN doesn't need to
3421 be called with arguments if you want a hard requirement on xapian,
3422 so remove the arguments.
3424 Thu Nov 16 02:07:31 GMT 2006 Olly Betts <olly@survex.com>
3426 * configure.ac: Change the project name to "xapian-omega" since that's
3427 what the RPMs and Debian packages call it (there's a Rogue-like game
3430 Thu Nov 16 02:01:55 GMT 2006 Olly Betts <olly@survex.com>
3432 * omega.cc: Fix backwards setting of sort_after. Fix generation of
3433 sort setup flags for filters.
3435 Thu Nov 16 01:21:32 GMT 2006 Olly Betts <olly@survex.com>
3437 * docs/cgiparams.txt,omega.cc,omega.h,query.cc: Implement new CGI
3438 parameters for finer control of sorting and ranking - SORTAFTER
3440 * omega.cc: Set up the filters variable so we know to revert to
3441 page 1 if the sorting options are changed.
3443 Tue Nov 14 15:27:09 GMT 2006 Olly Betts <olly@survex.com>
3445 * md5test.cc: Need <stdio.h> for sprintf.
3447 Tue Nov 14 03:19:13 GMT 2006 Olly Betts <olly@survex.com>
3449 * configure.ac: Note a couple of platforms which take the different
3452 Tue Nov 14 03:16:37 GMT 2006 Olly Betts <olly@survex.com>
3454 * configure.ac,utf8convert.cc: The input pointer to iconv can be
3455 either "char **" or "const char **" so probe at configure time.
3457 Mon Nov 13 20:22:50 GMT 2006 Olly Betts <olly@survex.com>
3459 * utf8convert.cc: Need <algorithm> for swap().
3461 Mon Nov 13 02:27:51 GMT 2006 Olly Betts <olly@survex.com>
3463 * Makefile.am,md5test.cc: Add tests for md5 code.
3465 Mon Nov 13 02:06:51 GMT 2006 Olly Betts <olly@survex.com>
3467 * Merge in utf8 branch:
3469 Fri Sep 15 06:03:50 BST 2006 Olly Betts <olly@survex.com>
3471 * utf8convert.cc: Compilation fix for Sun C++.
3473 Thu Sep 14 23:55:20 BST 2006 Olly Betts <olly@survex.com>
3475 * Makefile.am,htmlparse.cc,htmlparse.h,indextext.cc,
3476 indextext.h,makesymboltabh.pl,myhtmlparse.cc,myhtmlparse.h,
3477 namedentities.h,omindex.cc,query.cc,scriptindex.cc,
3478 symboltab.h,tclUniData.cc,tclUniData.h,utf8convert.cc,
3479 utf8convert.h,utf8itor.cc,utf8itor.h, utf8test.cc: Convert
3482 Thu Nov 09 00:20:19 GMT 2006 Olly Betts <olly@survex.com>
3484 * NEWS,configure.ac: Update for 0.9.9.
3486 Wed Nov 08 22:45:10 GMT 2006 Olly Betts <olly@survex.com>
3488 * omega.spec.in: Run "autoreconf --force" to avoid rpath on x86_64
3491 Sun Nov 05 17:08:48 GMT 2006 Olly Betts <olly@survex.com>
3493 * scriptindex.cc: The "date" action was modifying the value it
3494 operated on, which it isn't meant to do - fixed.
3496 Sun Nov 05 02:25:48 GMT 2006 Olly Betts <olly@survex.com>
3498 * query.cc: Report an error if $setmap is called with an even number
3501 Thu Nov 02 16:08:27 GMT 2006 Olly Betts <olly@survex.com>
3503 * NEWS,configure.ac: Update for 0.9.8.
3505 Thu Nov 02 15:43:31 GMT 2006 Olly Betts <olly@survex.com>
3507 * configure.ac: Update comment about "-ptused".
3509 Wed Nov 01 16:23:13 GMT 2006 Olly Betts <olly@survex.com>
3511 * cdb_init.cc: Fix warning in mingw build.
3513 Wed Nov 01 13:43:54 GMT 2006 Olly Betts <olly@survex.com>
3515 * cdb_init.cc,query.cc: Fix warnings.
3517 Wed Nov 01 04:00:20 GMT 2006 Olly Betts <olly@survex.com>
3519 * md5.cc,md5.h: Fix warnings about changing alignment requirements
3520 when casting pointers.
3522 Tue Oct 31 02:47:23 GMT 2006 Olly Betts <olly@survex.com>
3524 * cdb_init.cc,configure.ac,getopt.cc,omega.cc,query.cc,scriptindex.cc:
3525 Enable more warnings for GCC (and fix them in the code). Enable
3526 appropriate warnings for Intel's C++ compiler.
3528 Tue Oct 31 00:02:19 GMT 2006 Olly Betts <olly@survex.com>
3530 * htmlparsetest.cc,omindex.cc: Fix GCC warnings.
3532 Mon Oct 30 23:57:09 GMT 2006 Olly Betts <olly@survex.com>
3534 * query.cc: $substr where the start is negative and longer than the
3535 string (e.g. $substr{abcd,-5,1}) should now work as intended.
3537 Mon Oct 30 21:02:18 GMT 2006 Olly Betts <olly@survex.com>
3539 * scriptindex.cc: Fix GCC warnings uncovered by actually substituting
3542 Mon Oct 30 21:01:26 GMT 2006 Olly Betts <olly@survex.com>
3544 * configure.ac: Actually substitute AM_CXXFLAGS in the Makefile.
3545 * configure.ac: Fix AM_CXXFLAGS for IRIX.
3547 Sat Oct 28 12:31:31 BST 2006 Olly Betts <olly@survex.com>
3549 * myhtmlparse.cc: Add missing "#include <ctype.h>".
3551 Sat Oct 28 02:23:09 BST 2006 Olly Betts <olly@survex.com>
3553 * htmlparse.cc,indextext.cc,indextext.h,myhtmlparse.cc,omega.cc,
3554 omega.h,omindex.cc,query.cc,scriptindex.cc: Ensure that we always
3555 pass an unsigned char value to isupper(), toupper(), etc as they
3556 are undefined on other values (glibc makes them work for signed
3557 char values too, but this is an extension).
3559 Fri Oct 27 00:36:34 BST 2006 Olly Betts <olly@survex.com>
3561 * configure.ac,md5.h,values.h: HAVE_STDINT_H is already defined
3562 by autoconf based on trying the C compiler with AC_CHECK_HEADERS
3563 so define HAVE_WORKING_STDINT_H instead.
3565 Wed Oct 25 01:36:43 BST 2006 Olly Betts <olly@survex.com>
3567 * configure.ac: Need a more sophisticated test for the stdint.h
3570 Tue Oct 24 02:12:13 BST 2006 Olly Betts <olly@survex.com>
3572 * metaxmlparse.cc,omega.h: Fix warnings from SGI's C++ compiler.
3574 Tue Oct 24 02:11:11 BST 2006 Olly Betts <olly@survex.com>
3576 * htmlparse.cc,query.cc,scriptindex.cc: Remove unused static
3579 Tue Oct 24 01:51:05 BST 2006 Olly Betts <olly@survex.com>
3581 * configure.ac: Pass magic options to SGI's C++ compiler to allow
3582 linking of templates to work.
3584 Tue Oct 24 00:46:06 BST 2006 Olly Betts <olly@survex.com>
3586 * configure.ac: IRIX doesn't allow stdint.h to be included from C++
3587 code, so we need a smarter configure test than AC_CHECK_HEADERS.
3589 Sun Oct 22 03:30:11 BST 2006 Olly Betts <olly@survex.com>
3591 * configure.ac: Tell AC_CHECK_HEADERS to suppress its backward
3592 compatibility mode, so it only checks headers with the compiler.
3593 This speeds up configure a little, and is what we do elsewhere.
3595 Tue Oct 10 17:21:13 BST 2006 Olly Betts <olly@survex.com>
3597 * NEWS: Update for actual 0.9.7 release.
3599 Mon Oct 09 18:26:14 BST 2006 Olly Betts <olly@survex.com>
3601 * docs/termprefixes.txt: "$setmap{title,S}" should be
3602 "$setmap{prefix,title,S}".
3604 Sun Oct 08 21:43:16 BST 2006 Olly Betts <olly@survex.com>
3606 * NEWS,configure.ac: Update for 0.9.7.
3608 Fri Sep 15 16:56:49 BST 2006 Olly Betts <olly@survex.com>
3610 * cgiparam.cc: Compilation fix for Sun C++.
3612 Fri Sep 15 06:00:50 BST 2006 Olly Betts <olly@survex.com>
3614 * configure.ac,query.cc: Compilation fix for Sun C++.
3616 Thu Sep 14 15:41:33 BST 2006 Olly Betts <olly@survex.com>
3618 * htmlparse.cc: Include <stdlib.h> so atoi() is prototyped.
3620 Wed Sep 13 16:37:32 BST 2006 Olly Betts <olly@survex.com>
3622 * configure.ac,md5.h,values.h: Use stdint.h if we have it.
3624 Tue Sep 12 11:57:16 BST 2006 Olly Betts <olly@survex.com>
3626 * myhtmlparse.cc: Need "#include <string.h>" for strchr.
3628 Mon Sep 11 20:24:27 BST 2006 Olly Betts <olly@survex.com>
3630 * values.h: Only want our own ntohl for MS Windows.
3632 Mon Sep 11 16:36:54 BST 2006 Olly Betts <olly@survex.com>
3634 * omega.cc,query.cc: Now xapian-config will switch Sun's C++ compiler
3635 into ANSI C++ compliant mode, so clean out all our special cased
3638 Mon Sep 11 14:23:44 BST 2006 Olly Betts <olly@survex.com>
3640 * md5.h,values.h: Apply previous fix for DJGPP too.
3642 Sun Sep 10 19:04:17 BST 2006 Olly Betts <olly@survex.com>
3644 * md5.h,values.h: Using htonl from winsock.h requires use to link
3645 with the winsock DLL, which is overkill so just add a simple
3646 implementation for htonl - we know MS Windows is little-endian.
3648 Sat Sep 09 21:48:22 BST 2006 Olly Betts <olly@survex.com>
3650 * md5.h,values.h: Sigh, winsock.h uses u_long instead of uint32_t
3651 in the htonl prototype.
3653 Sat Sep 09 19:19:15 BST 2006 Olly Betts <olly@survex.com>
3655 * omindex.cc: Fix typo in previous commit.
3657 Sat Sep 09 17:11:40 BST 2006 Olly Betts <olly@survex.com>
3659 * configure.ac,omindex.cc: Mingw doesn't have sys/wait.h or
3662 Sat Sep 09 16:44:29 BST 2006 Olly Betts <olly@survex.com>
3664 * md5.h,values.h: On MS Windows, we need to #include <winsock.h>.
3666 Fri Sep 08 08:01:15 BST 2006 Olly Betts <olly@survex.com>
3668 * query.cc: Sun C++'s std::count() isn't very "std" -- it has the
3671 Fri Sep 08 03:39:14 BST 2006 Olly Betts <olly@survex.com>
3673 * md5.h,values.h: openbsd needs arpa/inet.h to be included before
3676 Wed Sep 06 21:31:33 BST 2006 Olly Betts <olly@survex.com>
3678 * md5wrap.cc: #include <unistd.h>
3680 Wed Sep 06 18:03:23 BST 2006 Olly Betts <olly@survex.com>
3682 * Makefile.am: Ship values.h.
3684 Wed Sep 06 03:52:27 BST 2006 Olly Betts <olly@survex.com>
3686 * configfile.cc: Changed my mind - don't allow comments on the end of
3688 * docs/overview.txt: Document that omega.conf can have comments and
3691 Wed Sep 06 03:46:16 BST 2006 Olly Betts <olly@survex.com>
3693 * configfile.cc,omega.conf: Fix code which reads omega.conf to be line
3694 based as documented rather than the wacky whitespace based scheme
3695 that was actually implemented. Allow "#" comments and blank lines
3698 Wed Sep 06 01:26:17 BST 2006 Olly Betts <olly@survex.com>
3700 * omindex.cc: If popen() fails, treat it as a read error.
3702 Wed Sep 06 00:49:47 BST 2006 Olly Betts <olly@survex.com>
3704 * omindex.cc: Fix escaping of filenames to cast characters to
3705 "unsigned char" so that isalnum() works correctly everywhere.
3706 Not a security hole as dangerous characters were still being
3709 Tue Sep 05 06:49:30 BST 2006 Olly Betts <olly@survex.com>
3711 * Makefile.am: Run htmlparsetest on "make check".
3713 Tue Sep 05 06:46:18 BST 2006 Olly Betts <olly@survex.com>
3715 * Makefile.am,htmlparse.cc,htmlparse.h,metaxmlparse.cc,metaxmlparse.h,
3716 myhtmlparse.h,omindex.cc,xmlparse.cc,xmlparse.h: Parse the XML from
3717 OpenDocument and OpenOffice using new subclasses of HtmlParser.
3718 Only extract meta.xml once.
3720 Tue Sep 05 06:45:02 BST 2006 Olly Betts <olly@survex.com>
3722 * Makefile.am,htmlparsetest.cc: Add htmlparsetest which tests the
3725 Tue Sep 05 04:36:46 BST 2006 Olly Betts <olly@survex.com>
3727 * omindex.cc: Note UTF-8 runes for pdfinfo and pdftotext.
3729 Tue Sep 05 04:29:21 BST 2006 Olly Betts <olly@survex.com>
3731 * omindex.cc: Only run pdfinfo once and pull out the
3732 fields we want using string operations, instead of
3733 running it twice filtered through sed.
3735 Tue Sep 05 03:53:00 BST 2006 Olly Betts <olly@survex.com>
3737 * htmlparse.cc,htmlparse.h: Don't get confused by "a<b" in
3738 Javascript in a <script> tag. Fixes bug#91.
3740 Sat Sep 02 04:29:12 BST 2006 Olly Betts <olly@survex.com>
3742 * omindex.cc: Call pclose() not fclose() on a FILE* obtained from
3743 popen(). If a filter program isn't installed, then don't try it
3744 again for the same extension (not perfect but an improvement -
3745 previously we indexed an empty document!)
3747 Sat Sep 02 02:07:30 BST 2006 Olly Betts <olly@survex.com>
3749 * Makefile.am,configure.ac,docs/omegascript.txt,md5.cc,md5.h,
3750 md5wrap.cc,md5wrap.h,omindex.cc,query.cc,values.h: Generate
3751 an MD5 checksum of each file indexed and store it in value #1
3752 to allow duplicates to be collapsed. Add $pack and $unpack
3753 OmegaScript commands to allow big endian binary values to
3754 be encoded and decoded. Add the file last modified time
3757 Fri Sep 01 04:37:09 BST 2006 Olly Betts <olly@survex.com>
3759 * omindex.cc: Tweak comment and whitespace.
3761 Fri Sep 01 04:19:39 BST 2006 Olly Betts <olly@survex.com>
3763 * README: Update reference to "CVS" to say "SVN".
3765 Thu Aug 31 20:22:33 BST 2006 Olly Betts <olly@survex.com>
3767 * loadfile.cc: #include <algorithm> for std::min().
3769 Thu Aug 31 02:35:36 BST 2006 Olly Betts <olly@survex.com>
3771 * loadfile.cc: More missing #include-s.
3773 Thu Aug 31 01:53:31 BST 2006 Olly Betts <olly@survex.com>
3775 * loadfile.cc: Add #include <unistd.h>.
3777 Wed Aug 30 23:21:49 BST 2006 Olly Betts <olly@survex.com>
3779 * Makefile.am: Include loadfile.h in the tarball.
3781 Mon Aug 28 18:09:28 BST 2006 Olly Betts <olly@survex.com>
3783 * omindex.cc: Don't generate 'W' terms since omega doesn't use them.
3785 Mon Aug 28 03:06:46 BST 2006 Olly Betts <olly@survex.com>
3787 * query.cc,templates/query: Use '\t' to separate terms in xP since
3788 filter terms might contain '.'. Fixes bug#87.
3790 Sun Aug 27 01:36:40 BST 2006 Olly Betts <olly@survex.com>
3792 * indextext.cc: Don't generate terms with more than 3 trailing
3793 symbols ('-', '+', or '#').
3795 Sun Aug 27 01:11:45 BST 2006 Olly Betts <olly@survex.com>
3797 * omindex.cc: Added "size" field to document data; don't add "modtime"
3798 field if the timestamp is (time_t)-1.
3800 Sun Aug 27 00:36:12 BST 2006 Olly Betts <olly@survex.com>
3802 * omindex.cc,templates/query,utils.cc,utils.h: Store the file's last
3803 modified time in the document data as "modtime" so it shows up in
3804 search results (and tweak the query template so the display of this
3805 information looks nicer).
3807 Fri Aug 25 22:55:23 BST 2006 Olly Betts <olly@survex.com>
3809 * docs/overview.txt,omindex.cc: Run xls2csv on MS Excel files; run
3810 catppt on MS Powerpoint files; also index MS Word templates (.dot).
3812 Thu Aug 24 21:40:10 BST 2006 Olly Betts <olly@survex.com>
3814 * htmlparse.cc: Support htdig's "ignore this bit" comments.
3816 Thu Aug 24 12:55:26 BST 2006 Olly Betts <olly@survex.com>
3818 * query.cc: Fix $highlight{} to work with capitalised words (it used
3819 to work but regressed in 0.8.2).
3821 Thu Aug 24 12:38:50 BST 2006 Olly Betts <olly@survex.com>
3823 * Makefile.am,omindex.cc,query.cc: Use the new routines in loadfile.cc
3824 to replace code to do the same thing in omindex and omega.
3826 Thu Aug 24 12:37:16 BST 2006 Olly Betts <olly@survex.com>
3828 * scriptindex.cc: Fix handling of check whether a record has content
3829 in the case where the same field is processed more than once.
3831 Thu Aug 24 12:35:32 BST 2006 Olly Betts <olly@survex.com>
3833 * Makefile.am,docs/scriptindex.txt,loadfile.cc,loadfile.h,
3834 scriptindex.cc: Add new "load" action to allow the contents of an
3835 external file to be loaded.
3837 Thu Aug 24 12:05:23 BST 2006 Olly Betts <olly@survex.com>
3839 * configure.ac: Check for strftime.
3841 Sun Jul 09 01:40:09 BST 2006 Olly Betts <olly@survex.com>
3843 * docs/omegascript.txt: Note that (by design) an omegascript template
3844 can't contain an infinite loop.
3846 Sun May 21 11:42:54 BST 2006 Olly Betts <olly@survex.com>
3848 * Makefile.am: Make use of the dist_ prefix to avoid having to list
3849 files in EXTRA_DIST as well as in *_SCRIPTS, *_DATA, and man_MANS.
3850 * Makefile.am: Prefer $(sysconfdir) to @sysconfdir@ since the former
3851 can be overridden on the "make" command line.
3853 Sat May 20 06:16:27 BST 2006 Olly Betts <olly@survex.com>
3855 * Makefile.am,configure.ac: Specify required automake version in
3856 the call to AM_INIT_AUTOMAKE in configure.ac.
3858 Thu May 18 14:12:13 BST 2006 Olly Betts <olly@survex.com>
3860 * docs/overview.txt,docs/quickstart.txt: Use the default path to the
3861 database directories in examples. Tweak the formatting in a few
3862 places. Give a path to the omega CGI binary in the example showing
3863 how to run it from the command line.
3865 Wed May 17 15:28:01 BST 2006 Olly Betts <olly@survex.com>
3867 * omega.spec.in: Fix so that the documentation gets packaged.
3869 Tue May 16 06:56:26 BST 2006 Olly Betts <olly@survex.com>
3871 * configure.ac: Remove unused variable from snprintf testing code.
3873 Mon May 15 02:18:01 BST 2006 Olly Betts <olly@survex.com>
3875 * NEWS,configure.ac: Updated for 0.9.6.
3877 Sat May 13 20:43:08 BST 2006 Olly Betts <olly@survex.com>
3879 * configure.ac: Update snprintf detection to match xapian-core.
3881 Fri May 12 20:12:40 BST 2006 Olly Betts <olly@survex.com>
3883 * docs/omegascript.txt: Clarified description of $now.
3885 Thu Apr 27 23:45:26 BST 2006 Olly Betts <olly@survex.com>
3887 * docs/omegascript.txt,query.cc: Added new OmegaScript commands
3888 $filterterms and $substr.
3890 Thu Apr 27 18:37:50 BST 2006 Olly Betts <olly@survex.com>
3892 * scriptindex.cc: Use const reference instead of just a reference.
3894 Sun Apr 23 18:32:20 BST 2006 Olly Betts <olly@survex.com>
3896 * scriptindex.cc: Fix "index" and "indexnopos" without a prefix to
3897 set the weight correctly (bug introduced in 0.9.5).
3899 Wed Apr 19 13:37:15 BST 2006 Fabrice Colin
3901 * omega.spec.in: Create and package /var/lib/omega/cdb and
3904 Tue Apr 11 19:29:34 BST 2006 Olly Betts <olly@survex.com>
3906 * configure.ac,htmlparse.cc,query.cc,scriptindex.cc: Disable MSVC
3907 warning 4800 (on int to bool conversions) in config.h and then we
3908 can remove the "fixes" elsewhere.
3910 Mon Apr 10 16:26:08 BST 2006 Olly Betts <olly@survex.com>
3912 * date.cc,hashterm.cc,htmlparse.cc,omega.cc,omindex.cc,query.cc,
3913 scriptindex.cc: Fix MSVC7 warnings.
3915 Sat Apr 08 20:04:33 BST 2006 Olly Betts <olly@survex.com>
3917 * NEWS,configure.ac: Updated for 0.9.5.
3919 Fri Apr 07 16:45:36 BST 2006 Olly Betts <olly@survex.com>
3921 * omindex.cc,query.cc: Tweak for MSVC compilation.
3923 Fri Apr 07 03:23:22 BST 2006 Olly Betts <olly@survex.com>
3925 * omega.spec.in: Man pages may be gzipped.
3927 Thu Apr 06 14:28:08 BST 2006 Olly Betts <olly@survex.com>
3929 * README: Add pointer to documentation.
3931 Thu Apr 06 03:32:21 BST 2006 Olly Betts <olly@survex.com>
3933 * omega.spec.in: Include man pages in RPM.
3935 Thu Apr 06 03:06:56 BST 2006 Olly Betts <olly@survex.com>
3937 * Makefile.am,commonhelp.cc,commonhelp.h,configure.ac,omindex.cc,
3938 scriptindex.cc: Add man pages for omindex and scriptindex.
3940 Thu Apr 06 02:56:09 BST 2006 Olly Betts <olly@survex.com>
3942 * mbox2omega.script: Use new "hash" command.
3944 Wed Apr 05 19:29:14 BST 2006 Olly Betts <olly@survex.com>
3946 * Makefile.am,docs/scriptindex.txt,hashterm.cc,hashterm.h,
3947 omindex.cc,scriptindex.cc: Add new "hash" command to allow hashed
3948 terms to be generated from long URLs like omindex does.
3949 * htdig2omega.script: Use new "hash" command.
3950 * scriptindex.cc: Fix "useless weight" warning to not incorrectly
3951 fire when "index" or "indexnopos" has no parameter.
3953 Wed Apr 05 15:03:28 BST 2006 Olly Betts <olly@survex.com>
3955 * scriptindex.cc: Check if we successfully opened the index script
3956 and give an error if not.
3958 Fri Mar 10 05:21:13 GMT 2006 Olly Betts <olly@survex.com>
3960 * dbi2omega: Check DBIDRIVER environmental variable to allow a driver
3961 other than mysql to be specified without modifying the script.
3963 Wed Mar 01 02:28:57 GMT 2006 Olly Betts <olly@survex.com>
3965 * scriptindex.cc: Don't repeat the "note" part of warnings; Warn if
3966 "unique=<prefix>" is used without a corresponding "boolean=<prefix>";
3967 Warn that "index=nopos" is deprecated and should be replaced by
3970 Tue Feb 28 23:46:57 GMT 2006 Olly Betts <olly@survex.com>
3972 * scriptindex.cc: Report a useless weight action, even if it's
3973 followed by another non-useless action (e.g. field); convert weight
3974 actions into a numeric parameter on index and indexnopos Action
3975 objects; add explanatory text "(note that actions are executed from
3976 left to right)" when reporting useless actions.
3978 Sun Feb 26 00:25:10 GMT 2006 Olly Betts <olly@survex.com>
3980 * query.cc: Fix $opt[fieldnames] handling. Previously it would try
3981 to kick in if you didn't set fieldnames but set any alphabetically
3984 Tue Feb 21 00:18:25 GMT 2006 Olly Betts <olly@survex.com>
3986 * configure.ac,NEWS: Updated for 0.9.4.
3988 Sun Feb 19 23:20:49 GMT 2006 Olly Betts <olly@survex.com>
3990 * COPYING: Updated FSF address.
3992 Thu Feb 16 00:10:22 GMT 2006 Olly Betts <olly@survex.com>
3994 * NEWS,configure.ac: Updated for 0.9.3.
3996 Wed Feb 08 13:01:15 GMT 2006 Olly Betts <olly@survex.com>
3998 * templates/query: Make the page title shorter so there's more chance
3999 it will fit on icon bars, etc.
4001 Wed Feb 08 10:08:24 GMT 2006 Olly Betts <olly@survex.com>
4003 * docs/overview.txt: Add pointer to documentation of the supported
4006 Mon Feb 06 15:19:17 GMT 2006 Olly Betts <olly@survex.com>
4008 * docs/termprefixes.txt: Fix typo.
4010 Sat Jan 14 22:40:43 GMT 2006 Olly Betts <olly@survex.com>
4012 * configure.ac: Copy over fixed snprintf checks from xapian-core.
4014 Fri Jan 13 03:21:15 GMT 2006 Olly Betts <olly@survex.com>
4016 * configure.ac: The configure test for snprintf uses memcmp, so
4017 we need to "#include <string.h>" for it to work reliably.
4019 Mon Jan 09 04:23:54 GMT 2006 Olly Betts <olly@survex.com>
4021 * date.cc,query.cc: Add "#include <stdarg.h>" where we use
4024 Mon Jan 09 04:17:54 GMT 2006 Olly Betts <olly@survex.com>
4026 * cdb_init.cc: Fix more compilation issues with cdb no-mmap code.
4028 Mon Jan 09 03:42:18 GMT 2006 Olly Betts <olly@survex.com>
4030 * omega.cc,utils.cc,utils.h: Replace remaining use of split with
4031 a direct walk of the string.
4033 Mon Jan 09 03:19:49 GMT 2006 Olly Betts <olly@survex.com>
4035 * query.cc: Don't split strings of docids in R parameters into a
4036 vector<string> - just walk the string directly. The code is
4037 as simple, and much more efficient if a lot of documents are
4040 Mon Jan 09 02:46:34 GMT 2006 Olly Betts <olly@survex.com>
4042 * Makefile.am,date.cc,omindex.cc,query.cc,scriptindex.cc,utils.cc,
4043 utils.h: Use snprintf where available.
4045 Sun Jan 08 22:41:47 GMT 2006 Olly Betts <olly@survex.com>
4047 * cdb_init.cc: Fixed malloc-based version to compile.
4049 Sun Jan 08 21:05:46 GMT 2006 Olly Betts <olly@survex.com>
4051 * cdb_find.cc,cdb_hash.cc,cdb_unpack.cc: #include <config.h>.
4052 * configure.ac: Test for mmap.
4053 * cdb_init.cc: If mmap isn't found, and this isn't WIN32 fall back on
4054 the very crude approach of loading the whole file into a malloc-ed
4055 block. For a small cdb file, that'll give acceptable performance
4058 Fri Jan 06 21:29:37 GMT 2006 Olly Betts <olly@survex.com>
4060 * symboltab.h: Fix A after \xbf being interpereted as an overlong
4063 Fri Jan 06 21:26:57 GMT 2006 Olly Betts <olly@survex.com>
4065 * query.cc: Fix printf type mismatch on 64 bit platforms.
4067 Fri Jan 06 21:00:34 GMT 2006 Olly Betts <olly@survex.com>
4069 * docs/omegascript.txt,query.cc: Added $find{LIST,STRING}.
4071 Fri Jan 06 20:52:31 GMT 2006 Olly Betts <olly@survex.com>
4073 * symboltab.h: Write top-bit set characters using \xXX notation to
4074 avoid warnings from Intel's C++ compiler.
4076 Fri Jan 06 18:15:42 GMT 2006 Olly Betts <olly@survex.com>
4078 * query.cc: Removed unused variable.
4080 Fri Jan 06 18:14:33 GMT 2006 Olly Betts <olly@survex.com>
4082 * query.cc: Cast time_t to unsigned long to avoid problems on 64bit
4085 Fri Jan 06 18:12:38 GMT 2006 Olly Betts <olly@survex.com>
4087 * docs/omegascript.txt: Note in the $cgi description that it returns
4088 an arbitrary value if there's more than one, and pointing to
4091 Thu Jan 05 05:54:58 GMT 2006 Olly Betts <olly@survex.com>
4093 * cdb_init.cc: Fix mingw compilation.
4095 Thu Jan 05 03:24:07 GMT 2006 Olly Betts <olly@survex.com>
4097 * cdb_init.cc: Fix to hopefully compile on Solaris which has a broken
4098 sys/mman.h when used from C++.
4100 Wed Jan 04 20:44:44 GMT 2006 Olly Betts <olly@survex.com>
4102 * query.cc: Fixed to compile with GCC 3.0.
4104 Wed Jan 04 04:33:15 GMT 2006 Olly Betts <olly@survex.com>
4106 * Makefile.am,cdb.h,cdb_find.cc,cdb_hash.cc,cdb_init.cc,cdb_int.h,
4107 cdb_unpack.cc,configfile.cc,configfile.h,docs/omegascript.txt,
4108 omega.conf,query.cc: Add $lookup{CDBFILE,KEY} command to perform
4109 a lookup in a CDB file.
4111 Wed Jan 04 03:06:31 GMT 2006 Olly Betts <olly@survex.com>
4113 * docs/omegascript.txt,docs/overview.txt,query.cc: Added new feature
4114 which allows you to avoid storing fieldnames in every document
4115 (which can save a lot of disk space for a large database). Instead
4116 you just store the field values, one per line, and add something
4117 like "$set{fieldnames,$split{caption sample url}}" to the
4118 OmegaScript template to specify the fieldnames to use.
4119 * docs/omegascript.txt,query.cc: Add new "$split{}" command which
4120 splits a string to give an OmegaScript list.
4121 * query.cc: Fix $url{} to escape "+" to "%2b".
4122 * query.cc: Speed up $highlight{} - only compare terms which are the
4125 Tue Jan 03 22:38:01 GMT 2006 Olly Betts <olly@survex.com>
4127 * configfile.cc: Rename file_readable() to file_exists() to better
4128 reflect what the function actually does!
4130 Tue Jan 03 17:43:40 GMT 2006 Olly Betts <olly@survex.com>
4132 * templates/opensearch: Add missing escaping.
4134 Mon Dec 19 10:27:30 GMT 2005 Olly Betts <olly@survex.com>
4136 * Makefile.am,commonhelp.cc,commonhelp.h,docs/overview.txt,omindex.cc,
4137 scriptindex.cc: Add "--stemmer" option to omindex and scriptindex
4138 to allow the stemming language to be set.
4139 * omindex.cc,scriptindex.cc: More consistent --help and --version
4140 output. Update FSF address.
4142 Mon Dec 19 06:03:31 GMT 2005 Olly Betts <olly@survex.com>
4144 * query.cc: Explicitly use "unsigned char" when %-encoding in $url
4145 so that top bit set characters are correctly handled on platforms
4146 where char is signed by default.
4148 Sun Dec 11 09:30:44 GMT 2005 Olly Betts <olly@survex.com>
4150 * templates/godmode: If a non-existent docid is specified, report the
4151 error and prompt the user to enter another docid. Fixes bug#60.
4153 Sun Dec 11 09:27:18 GMT 2005 Olly Betts <olly@survex.com>
4155 * docs/cgiparams.txt,omega.cc,omega.h,query.cc: Add "SORTREVERSE"
4156 CGI parameter which allows the sort order to be reversed when
4157 sorting on a value. Remove "SORTBANDS" CGI parameter since it
4158 no longer does anything.
4160 Sun Dec 11 09:26:14 GMT 2005 Olly Betts <olly@survex.com>
4162 * omindex.cc: Improve wording of comment.
4164 Sun Dec 11 09:22:58 GMT 2005 Olly Betts <olly@survex.com>
4166 * docs/overview.txt,omindex.cc: Add support for OpenDocument format
4167 mimetypes and extensions out of the box.
4169 Sun Dec 11 09:16:57 GMT 2005 Olly Betts <olly@survex.com>
4171 * docs/omegascript.txt,query.cc: If executing an OmegaScript command
4172 causes a Xapian exception to be thrown, catch it and copy the error
4173 message into error_msg (which is read by the $error command).
4175 Sun Dec 11 09:12:12 GMT 2005 Olly Betts <olly@survex.com>
4177 * htmlparse.cc: Tweak a few comments; "while (1)" -> "while (true)".
4179 Sun Dec 11 09:09:40 GMT 2005 Olly Betts <olly@survex.com>
4181 * docs/overview.txt: The U prefix (URL term) was grouped with the date
4182 searching prefixes, but it makes more sense to group it with the
4183 prefixes relating to parts of the URL (H for hostname, P for path,
4186 Sun Oct 02 16:28:59 BST 2005 Olly Betts <olly@survex.com>
4188 * scriptindex.cc: Use "int database_mode" (set to the value to pass to
4189 WritableDatabase's ctor) instead of "bool overwrite" to implement
4191 * scriptindex.cc: Remove code to handle "-q" as it no longer actually
4192 controls anything. Just ignore it for backwards compatibility.
4193 * scriptindex.cc: Tweak --help output to not wrap on a default
4196 Sat Sep 10 14:57:19 BST 2005 Olly Betts <olly@survex.com>
4198 * docs/omegascript.txt: Improve descriptions of $collapsed, $value,
4201 Fri Jul 29 10:05:21 BST 2005 James Aylett <james@tartarus.org>
4203 * omindex.cc: add --preserve-nonduplicates / -p option to not
4204 delete any documents that aren't updated, in replace duplicates
4205 mode (so that multiple runs of omindex on different subsites
4206 don't stomp on each other).
4208 * docs/overview.txt: update to match the above.
4210 Fri Jul 15 11:12:28 BST 2005 Olly Betts <olly@survex.com>
4212 * configure.ac: Updated for 0.9.2.
4214 Fri Jul 15 02:18:40 BST 2005 Olly Betts <olly@survex.com>
4216 * NEWS: Updated for 0.9.2.
4218 Sat Jul 02 14:56:35 BST 2005 Olly Betts <olly@survex.com>
4220 * query.cc: Workaround further Sun C++ crapness.
4222 Wed Jun 29 03:19:22 BST 2005 Olly Betts <olly@survex.com>
4224 * docs/omegascript.txt,query.cc: Changed $highlight so
4225 if OPEN and CLOSE aren't specified, they default to
4226 highlighting each word from the query with a different
4227 background colour like gmane does (previous default was to use
4228 '<strong>' and '</strong>').
4229 * query.cc: Removed surplus whitespace.
4231 Fri Jun 24 02:51:38 BST 2005 Olly Betts <olly@survex.com>
4233 * query.cc: Call QueryParser::set_database() as this is now used to
4234 decide what to do for terms like "C#".
4235 * docs/omegascript.txt,docs/termprefixes.txt,query.cc: Add the
4236 ability to set boolean prefixes for the QueryParser by setting
4237 a "boolprefix" map in the omegascript template.
4239 Fri Jun 24 02:40:10 BST 2005 Olly Betts <olly@survex.com>
4241 * scriptindex.cc: Fix infinite loop if there's no newline at the end
4244 Thu Jun 23 16:42:41 BST 2005 Olly Betts <olly@survex.com>
4246 * docs/termprefixes.txt: Explain who to use termprefixes with
4247 scriptindex and omega, since that's what most people will want to
4250 Thu Jun 23 16:41:15 BST 2005 Olly Betts <olly@survex.com>
4252 * query.cc,docs/omegascript.txt: Added $length{} and $stoplist{}
4253 commands to OmegaScript.
4254 * docs/omegascript.txt: Use standard "S" prefix for title in example
4255 for $setmap, rather than "XT".
4257 Mon Jun 06 17:59:10 BST 2005 Olly Betts <olly@survex.com>
4259 * NEWS: Another 0.9.1 update.
4261 Mon Jun 06 17:52:44 BST 2005 Olly Betts <olly@survex.com>
4263 * NEWS: Updated for 0.9.1.
4265 Mon Jun 06 17:51:58 BST 2005 Olly Betts <olly@survex.com>
4267 * configure.ac: Updated for 0.9.1.
4269 Mon May 23 23:36:48 BST 2005 Fabrice Colin <fabrice.colin@gmail.com>
4271 * omega.spec.in: Updated for 0.9.0.
4273 Fri May 13 23:21:02 BST 2005 Olly Betts <olly@survex.com>
4275 * NEWS: Updated for 0.9.0.
4277 Fri May 13 00:39:44 BST 2005 Olly Betts <olly@survex.com>
4279 * configure.ac: Updated for 0.9.0.
4281 Fri May 13 00:35:21 BST 2005 Olly Betts <olly@survex.com>
4283 * scriptindex.cc: Improved handling of extra blank lines in dump file;
4284 Strip multiple \r characters from end of line; Complain if a dump
4285 file doesn't appear to have been = escaped correctly; Flush
4286 database after each input file to ensure all changes from a file
4288 * docs/omegascript.txt: Whitespace tweak.
4290 Wed May 11 02:28:41 BST 2005 Olly Betts <olly@survex.com>
4292 * NEWS: Started to update for 0.9.0.
4294 Sun May 08 02:16:07 BST 2005 Olly Betts <olly@survex.com>
4296 * query.cc: Use Query::get_terms_begin() not
4297 QueryParser::termlist_begin().
4299 Sun May 08 02:11:49 BST 2005 Olly Betts <olly@survex.com>
4301 * Makefile.am: Use AM_CPPFLAGS not CPPFLAGS (CPPFLAGS is for the
4304 Wed May 4 11:32:18 BST 2005 Richard Boulton <richard@tartarus.org>
4306 * configfile.cc: Configuration file is now looked for in various
4307 locations: the first location in which a file is found is used.
4308 Firstly, if the OMEGA_CONFIG_FILE environment variable is set,
4309 the location given in it is checked. Secondly, the file
4310 "omega.conf" in the same directory as the executable is checked.
4311 Finally, the file "${sysconfdir}/omega.conf" (eg, /etc/omega.conf
4312 on Linux) is checked. If none of these locations contain a file,
4313 default values are used.
4314 * docs/overview.txt: Update to describe new configuration file
4316 * Makefile.am: Install omega.conf to ${sysconfdir} by default.
4317 Define CONFIGFILE_SYSTEM with an appropriate value to find the
4318 system configuration file.
4320 Wed May 4 11:20:26 BST 2005 Richard Boulton <richard@tartarus.org>
4322 * query.cc: Use new set_stemming_strategy() API method, rather than
4323 old set_stemming_options() method. The old method didn't compile
4324 because it's being passed a stemming_strategy value, which there
4325 isn't a prototype for.
4327 Fri Apr 29 10:27:05 BST 2005 Olly Betts <olly@survex.com>
4329 * scriptindex.cc: Improved comments.
4331 Fri Apr 15 03:12:02 BST 2005 Olly Betts <olly@survex.com>
4333 * docs/termprefixes.txt: Updated QueryParser prefix documentation to
4334 remove references to CVS HEAD.
4335 * docs/termprefixes.txt: Capitalise "Month" to indicate why it has
4336 prefix "M" (in line with all the other entries in the list).
4338 Fri Apr 15 02:55:06 BST 2005 Olly Betts <olly@survex.com>
4340 * indextext.cc: Generate terms like "c#".
4341 * query.cc: Highlight words like "C#".
4343 Fri Apr 15 02:53:22 BST 2005 Olly Betts <olly@survex.com>
4345 * query.cc: Clearer code for adding boolean filters are added to the
4348 Wed Apr 06 02:47:14 BST 2005 Olly Betts <olly@survex.com>
4350 * omindex.cc: Tweak the hashing of URLs so that it works the same
4351 way on all platforms (previously it would depend on sizeof(long)).
4352 This means an incompatibility with any existing database built on
4353 a platform where sizeof(long) > 4 where URLs were hashed (i.e.
4354 URLs were > 228 bytes if sizeof(long) == 8), but we really want
4355 databases to be portable between platforms.
4357 Wed Apr 06 02:44:58 BST 2005 Olly Betts <olly@survex.com>
4359 * omindex.cc,docs/overview.txt: Removed useless "DUPE_duplicate"
4362 Wed Apr 06 00:48:08 BST 2005 Olly Betts <olly@survex.com>
4364 * omindex.cc,docs/overview.txt: Added support for using pod2text for
4365 indexing Perl documentation.
4367 Wed Apr 06 00:25:47 BST 2005 Olly Betts <olly@survex.com>
4369 * omindex.cc,docs/overview.txt: Replace -l/--no-recurse with
4370 -l/--depth-limit which takes an argument allowing recursion
4371 to be restriction to any depth, not just 0 or infinite!
4373 Tue Apr 05 23:45:39 BST 2005 Olly Betts <olly@survex.com>
4375 * mbox2omega,mbox2omega.script,Makefile.am: Added mbox2omega which
4376 allows a mail folder to be indexed. Mostly it's an example as
4377 there's no mechanism included to show the full original message.
4379 Tue Apr 05 23:41:44 BST 2005 Olly Betts <olly@survex.com>
4381 * scriptindex.cc: Tidy up STL header includes.
4383 Tue Apr 05 23:34:36 BST 2005 Olly Betts <olly@survex.com>
4385 * docs/omegascript.txt: Clarify $field description slightly.
4387 Tue Apr 05 23:33:33 BST 2005 Olly Betts <olly@survex.com>
4389 * indextext.h: Add typedefs to allow AccentNormalisingItor to be used
4392 Tue Apr 05 00:47:52 BST 2005 Olly Betts <olly@survex.com>
4394 * docs/cgiparams.txt,docs/omegascript.txt: Fixed 3 references to
4397 Tue Apr 05 00:41:45 BST 2005 Olly Betts <olly@survex.com>
4399 * debian/.cvsignore,.cvsignore: Remove .cvsignore files, as they're
4402 Mon Mar 21 16:43:07 GMT 2005 Richard Boulton <richard@tartarus.org>
4404 * templates/opensearch: Add new template to implement basic
4405 opensearch feeds of search results.
4406 * Makefile.am: Include opensearch template in distribution.
4408 Thu Mar 03 02:20:26 GMT 2005 Olly Betts <olly@survex.com>
4410 * templates/query2: Remove Sam's unfinished rewrite of the query
4411 template. It's not been worked on for nearly two years, and we
4414 Wed Mar 02 03:09:52 GMT 2005 Olly Betts <olly@survex.com>
4416 * COPYING: Put in CVS.
4418 Tue Mar 01 02:09:35 GMT 2005 Olly Betts <olly@survex.com>
4420 * omindex.cc,docs/overview.txt: Extend -M/--mime-type to allow an
4421 existing mapping to be removed by omitting the type.
4423 Thu Feb 24 17:42:35 GMT 2005 Olly Betts <olly@survex.com>
4425 * Makefile.am: Actually ship docs/termprefixes.txt (and make it harder
4426 to fail to ship new docs in future).
4428 Thu Feb 24 02:10:09 GMT 2005 Olly Betts <olly@survex.com>
4430 * Makefile.am,docs/termprefixes.txt: Added a single document covering
4431 all aspects of term prefixes.
4433 Wed Feb 23 14:59:46 GMT 2005 Olly Betts <olly@survex.com>
4435 * docs/omegascript.txt: Moved $collapsed into correct place
4438 Wed Feb 16 03:46:51 GMT 2005 Olly Betts <olly@survex.com>
4440 * docs/cgiparams.txt,docs/overview.txt: Improved description of how
4441 B filters are handled when building the query.
4443 Wed Feb 16 03:44:24 GMT 2005 Olly Betts <olly@survex.com>
4445 * omindex.cc: Fixed so that we get lstat() prototype on Linux systems
4446 where we have posix_fadvise().
4448 Mon Jan 17 03:35:35 GMT 2005 Olly Betts <olly@survex.com>
4450 * query.cc: Corrected a comment.
4452 Mon Jan 17 03:32:25 GMT 2005 Olly Betts <olly@survex.com>
4454 * query.cc: Updated to use the new QueryParser API.
4456 Wed Jan 05 03:15:43 GMT 2005 Olly Betts <olly@survex.com>
4458 * docs/scriptindex.txt: Note that actions are applied in the specified
4461 Thu Dec 23 19:12:57 GMT 2004 Olly Betts <olly@survex.com>
4463 * INSTALL: "xapian-examples" -> "omega".
4465 Thu Dec 23 19:10:04 GMT 2004 Olly Betts <olly@survex.com>
4467 * configure.ac,NEWS: Version 0.8.5.
4469 Thu Dec 23 19:09:01 GMT 2004 Olly Betts <olly@survex.com>
4471 * INSTALL,README: Added better installation instructions.
4473 Mon Dec 20 17:26:26 GMT 2004 Olly Betts <olly@survex.com>
4475 * configure.ac,omindex.cc: Fixed "ignore symlinks" code to compile on
4476 systems without lstat (e.g. mingw).
4478 Mon Dec 20 12:18:18 GMT 2004 Olly Betts <olly@survex.com>
4480 * omindex.cc: Fix the "ignore symlinks" code to actually compile on
4481 certain Linux boxes.
4483 Mon Dec 20 11:33:59 GMT 2004 Olly Betts <olly@survex.com>
4485 * query.cc: If an exception is thrown, make sure that the HTTP headers
4486 get written so that we don't cause "500 Internal Server Error".
4487 This problem was introduced by the change to allow a user specified
4488 Content-Type in 0.8.0. Partly addresses bug#60.
4490 Fri Dec 17 22:50:01 GMT 2004 Olly Betts <olly@survex.com>
4492 * omindex.cc: Only try to delete removed documents in DUPE_replace
4495 Thu Dec 16 11:43:28 GMT 2004 Olly Betts <olly@survex.com>
4497 * scriptindex.cc: Fixed "Unknown Exception" when trying to "unhtml"
4498 text which contains "</body>" (bug#61). This bug was introduced in
4501 Thu Dec 16 11:28:25 GMT 2004 Olly Betts <olly@survex.com>
4503 * myhtmlparse.cc: <h1> - <h6> and </h1> - </h6> should leave a
4504 space into the dumped HTML.
4506 Wed Dec 15 15:53:55 GMT 2004 Richard Boulton <richard@tartarus.org>
4508 * dbi2omega: Add a comment to the start of the file detailing what
4511 Wed Dec 15 15:08:41 GMT 2004 Richard Boulton <richard@tartarus.org>
4513 * omindex.cc: Change behaviour of crawler such that it doesn't
4514 follow symbolic links any more. Add "--follow" command
4515 line option to turn following of symlinks back on.
4517 Wed Dec 08 16:31:46 GMT 2004 Olly Betts <olly@survex.com>
4519 * NEWS: Final update for 0.8.4.
4521 Tue Dec 07 18:16:32 GMT 2004 Olly Betts <olly@survex.com>
4523 * indextext.h: Fixed to compile with GCC 3.x.
4525 Tue Dec 07 18:15:39 GMT 2004 Olly Betts <olly@survex.com>
4527 * omega.cc,omindex.cc,scriptindex.cc: Use the new
4528 Database/WritableDatabase constructors.
4530 Tue Nov 30 22:02:33 GMT 2004 Olly Betts <olly@survex.com>
4532 * NEWS,configure.ac: Updated for 0.8.4 release.
4534 Wed Nov 24 04:50:52 GMT 2004 Olly Betts <olly@survex.com>
4536 * templates/godmode: Finished off godmode template.
4538 Wed Nov 24 04:12:09 GMT 2004 Olly Betts <olly@survex.com>
4540 * query.cc: If there's only a boolean query so we promote it to be
4541 the query, switch to boolean weights.
4543 Wed Nov 24 03:29:36 GMT 2004 Olly Betts <olly@survex.com>
4545 * Makefile.am,myhtmlparse.cc,myhtmlparse.h,omindex.cc,scriptindex.cc:
4546 Factored out MyHtmlParser into a separate file so it can be used
4547 in scriptindex too to give scriptindex the same improved HTML
4548 parsing which omindex just got.
4550 Wed Nov 24 02:22:49 GMT 2004 Olly Betts <olly@survex.com>
4552 * omindex.cc: Removed bogus extra line from code which was meant to
4553 truncate at a word boundary, but has never actually worked!
4555 Wed Nov 24 02:20:36 GMT 2004 Olly Betts <olly@survex.com>
4557 * omindex.cc: Improved HTML to text conversion - the parser now knows
4558 that some tags should be regarded as word breaks and some shouldn't
4559 (previously all tags were treated as word breaks).
4561 Wed Nov 24 00:22:39 GMT 2004 Olly Betts <olly@survex.com>
4563 * omindex.cc: Removed debug output; don't include \xa0 in the list of
4564 whitespace characters for now, as that's a bit character set
4567 Wed Nov 24 00:04:42 GMT 2004 Olly Betts <olly@survex.com>
4569 * omindex.cc: HTML extraction now turns strips leading and trailing
4570 whitespace and converts all other consecutive groups of whitespace
4573 Tue Nov 23 20:29:14 GMT 2004 Olly Betts <olly@survex.com>
4575 * Makefile.am: XAPIAN_FLAGS already links with xapianqueryparser
4576 so remove -lxapianqueryparser from omega_LDADD as it was causing
4579 Wed Nov 17 18:51:28 GMT 2004 Olly Betts <olly@survex.com>
4581 * omindex.cc: Index RTF documents with unrtf, if available.
4582 * docs/overview.txt: Document this.
4584 Wed Nov 17 16:31:01 GMT 2004 Olly Betts <olly@survex.com>
4586 * omindex.cc: If a filename to be passed to a filter program has a
4587 leading "-", protect it from possible interpretation as an option
4590 Wed Nov 17 16:29:55 GMT 2004 Olly Betts <olly@survex.com>
4592 * omindex.cc: Index Wordperfect documents with wpd2text, if available.
4593 * docs/overview.txt: Document this.
4595 Wed Nov 17 15:12:08 GMT 2004 Olly Betts <olly@survex.com>
4597 * omindex.cc: Index MS Word documents with antiword, if available.
4598 * docs/overview.txt: Document this.
4600 Wed Nov 17 04:29:15 GMT 2004 Olly Betts <olly@survex.com>
4602 * omindex.cc: Add simple code to index OpenOffice documents.
4603 * docs/overview.txt: Update documentation to mention this.
4605 Tue Nov 09 03:04:44 GMT 2004 Olly Betts <olly@survex.com>
4607 * configure.ac,Makefile.am: We now get -AA or -std strict_ansi from
4608 xapian-config, so we don't need to probe for them ourselves.
4610 Sun Nov 07 16:36:42 GMT 2004 Olly Betts <olly@survex.com>
4612 * utils.cc: Fixed to work with updated snprintf configure test,
4614 Sun Nov 07 04:55:26 GMT 2004 Olly Betts <olly@survex.com>
4616 * configure.ac: rearrange so that libtool is active when we test if
4617 the c++ compiler can link a program so it can pull in libstdc++
4618 through a .la file; updated snprintf test to the new one from
4621 Fri Nov 05 17:20:13 GMT 2004 Olly Betts <olly@survex.com>
4623 * configure.ac: AM_CONFIG_HEADER -> AC_CONFIG_HEADERS; Run tests using
4624 the C++ compiler; select ANSI mode for aCC and cxx; Check GXX not
4625 GCC when choosing warning flags.
4627 Wed Nov 03 20:15:34 GMT 2004 Olly Betts <olly@survex.com>
4629 * query.cc: Updated to use Query::empty() instead of
4632 Wed Nov 03 20:12:37 GMT 2004 Olly Betts <olly@survex.com>
4634 * Makefile.am,getopt.cc,getopt.h,getopt1.cc,gnu_getopt.h,omindex.cc,
4635 scriptindex.cc: Updated to reworked getopt from xapian-core.
4637 Wed Nov 03 04:11:03 GMT 2004 Olly Betts <olly@survex.com>
4639 * getopt.cc: Defining _NO_PROTO is a really bad idea for C++ code!
4641 Tue Nov 02 18:54:12 GMT 2004 Olly Betts <olly@survex.com>
4643 * getopt.cc: Protect getopt definition for possible getopt macro
4644 declared in getopt.h.
4646 Tue Nov 02 17:56:08 GMT 2004 Olly Betts <olly@survex.com>
4648 * indextext.h: Fixed 2 warnings.
4650 Tue Nov 02 06:54:17 GMT 2004 Olly Betts <olly@survex.com>
4652 * getopt.cc,getopt1.cc: Fixed function declarations to not use K&R C
4655 Tue Nov 02 05:40:06 GMT 2004 Olly Betts <olly@survex.com>
4657 * Makefile.am,configure.ac,getopt.c,getopt1.c,getopt.cc,getopt1.cc:
4658 Compile everything as C++.
4660 Mon Sep 20 14:52:24 BST 2004 Olly Betts <olly@survex.com>
4662 * NEWS,configure.ac: Version 0.8.3.
4664 Mon Sep 20 14:49:26 BST 2004 Olly Betts <olly@survex.com>
4666 * Makefile.am,configure.ac: Require same versions of autoconf and
4667 automake that xapian-core does.
4669 Mon Sep 20 14:45:53 BST 2004 Olly Betts <olly@survex.com>
4671 * omega.spec.in: Update from Fabrice Colin. The most notable change
4672 is that the RPM is now called xapian-omega because there's already
4673 an omega RPM (in Fedora Core at least) which is some game.
4675 Thu Sep 16 00:57:13 BST 2004 Olly Betts <olly@survex.com>
4677 * cgiparam.cc,configfile.cc,configfile.h,htmlparse.cc,indextext.cc,
4678 omega.cc,omindex-config.cc: All C++ sources should #include
4679 <config.h> as the first header; no header files should #include
4682 Thu Sep 16 00:54:31 BST 2004 Olly Betts <olly@survex.com>
4684 * scriptindex.cc: --version now actually reports the version. --help
4685 now exits with status 0 rather than status 1.
4687 Tue Sep 14 03:00:32 BST 2004 Olly Betts <olly@survex.com>
4689 * omega.spec.in: Updated URL for sources; include htdig2omega and
4690 htdig2omega.script in the RPM.
4692 Tue Sep 14 02:56:52 BST 2004 Olly Betts <olly@survex.com>
4694 * Makefile.am: Install htdig2omega.script in ${prefix}/share/omega/
4695 rather than ${prefix}/share/.
4697 Mon Sep 13 03:22:55 BST 2004 Olly Betts <olly@survex.com>
4699 * NEWS,configure.ac: Version 0.8.2.
4701 Thu Sep 09 15:11:45 BST 2004 Olly Betts <olly@survex.com>
4705 Thu Sep 09 14:41:41 BST 2004 Olly Betts <olly@survex.com>
4707 * query.cc: Use new checkatleast parameter to Enquire::get_mset to
4710 Thu Sep 02 01:45:46 BST 2004 Olly Betts <olly@survex.com>
4712 * templates/query: Always report database not found - previously we
4713 only did so if there was a query. Also fixed missing </center>
4714 tag which happened in certain cases.
4716 Wed Aug 25 23:19:47 BST 2004 Olly Betts <olly@survex.com>
4718 * omindex.cc: When running with "replace duplicates" mode (the
4719 default), detect documents removed since the last indexing
4720 run and delete them from the database (bug #34).
4722 Tue Aug 24 19:23:55 BST 2004 Olly Betts <olly@survex.com>
4724 * omega.cc: Added FIXME comment noting that SORT and SORTBANDS should
4725 be tracked and the results reset to the first page if they change.
4727 Tue Aug 24 19:23:07 BST 2004 Olly Betts <olly@survex.com>
4729 * Makefile.am: Install htdig2omega and htdig2omega.script.
4731 Mon Aug 23 22:29:53 BST 2004 Olly Betts <olly@survex.com>
4733 * scriptindex.cc: Report index file name and line number when
4734 reporting errors in it. Added warning for redundant actions,
4735 such as "truncate" as the last action in a rule.
4737 Mon Aug 23 22:03:25 BST 2004 Olly Betts <olly@survex.com>
4739 * omindex.cc: Use the new replace_document(term, doc) method.
4741 Sun Aug 22 13:11:23 BST 2004 Olly Betts <olly@survex.com>
4743 * configure.in,configure.ac: Renamed configure.in to configure.ac.
4745 Sat Aug 21 12:41:43 BST 2004 Olly Betts <olly@survex.com>
4747 * docs/omegascript.txt: Added note about that $add{$hit,1} gives
4750 Fri Aug 20 20:28:16 BST 2004 Olly Betts <olly@survex.com>
4752 * Makefile.am: Link with -lxapianqueryparser, not -lomqueryparser.
4754 Thu Aug 19 19:13:34 BST 2004 Olly Betts <olly@survex.com>
4756 * Makefile.am: And actually ship htdig2omega and htdig2omega.script!
4758 Thu Aug 19 19:02:40 BST 2004 Olly Betts <olly@survex.com>
4760 * htdig2omega,htdig2omega.script: Added perl script and corresponding
4761 scriptindex index script which allow an ht://dig database to be
4762 imported into Xapian. This provides an easy way to provide a search
4763 of remote websites using omega (by spidering them with ht://dig).
4765 Sun Aug 15 01:48:58 BST 2004 Olly Betts <olly@survex.com>
4767 * indextext.cc,indextext.h,omindex.cc,query.cc,scriptindex.cc,
4768 symboltab.h: Fixed $highlight to understand accented characters
4771 Wed Jun 30 14:58:12 BST 2004 Olly Betts <olly@survex.com>
4773 * NEWS,configure.in: Version 0.8.1.
4775 Tue Jun 29 17:26:41 BST 2004 Richard Boulton <richard@tartarus.org>
4777 * Makefile.am: Remove Debian files from distribution tarballs,
4778 since there will often be multiple patch releases for each
4779 release. Debian files will be available from an apt repository
4782 Tue Jun 29 01:45:06 BST 2004 Olly Betts <olly@survex.com>
4784 * omindex.cc: Renamed hash() to hash_string() to avoid colliding
4785 with something on IRIX; Removed explicit initialisation of
4786 mime_types - perhaps that's spooking the SGI CC prelinker.
4788 Sun Jun 27 23:47:35 BST 2004 Olly Betts <olly@survex.com>
4790 * omega.cc: Change MORELIKE to pick up to 40 terms, rather than up to
4791 6 (feedback on the mailing list suggests this gives much better
4794 Fri Jun 11 02:22:38 BST 2004 Olly Betts <olly@survex.com>
4796 * scriptindex.cc: Added catch for std::bad_alloc.
4798 Mon Apr 19 14:43:17 BST 2004 Olly Betts <olly@survex.com>
4800 * NEWS: Final update for 0.8.0.
4802 Sun Apr 18 22:31:24 BST 2004 Olly Betts <olly@survex.com>
4804 * omindex.cc: Only need _POSIX_C_SOURCE on Linux, and it seems to
4805 cause problems with Sun's C++ compiler.
4807 Sun Apr 18 17:50:35 BST 2004 Olly Betts <olly@survex.com>
4809 * omindex.cc: _POSIX_C_SOURCE works better than _POSIX_SOURCE for
4810 making posix_fadvise prototype visible on Linux.
4812 Thu Apr 15 02:05:49 BST 2004 Olly Betts <olly@survex.com>
4814 * omindex.cc: And another _POSIX_SOURCE attempt!
4816 Thu Apr 15 01:43:51 BST 2004 Olly Betts <olly@survex.com>
4818 * omindex.cc: Another stab at _POSIX_SOURCE...
4820 Thu Apr 15 01:25:29 BST 2004 Olly Betts <olly@survex.com>
4822 * omindex.cc: Added a missing underscore (_POSIX_SOURCE not
4825 Thu Apr 15 00:48:12 BST 2004 Olly Betts <olly@survex.com>
4827 * omindex.cc: Defined POSIX_SOURCE to a suitable value to get
4828 posix_fadvise on some versions of redhat.
4830 Mon Apr 12 01:06:58 BST 2004 Olly Betts <olly@survex.com>
4832 * NEWS,configure.in: Version 0.8.0.
4834 Mon Apr 12 00:03:57 BST 2004 Olly Betts <olly@survex.com>
4836 * indextext.cc,query.cc: Don't create R terms for terms which start
4839 Sun Apr 11 23:47:33 BST 2004 Olly Betts <olly@survex.com>
4841 * omindex.cc: Fixed inconsistent indenting.
4843 Sun Apr 11 23:11:51 BST 2004 Olly Betts <olly@survex.com>
4845 * omindex.cc: Call posix_fadvise with POSIX_FADV_DONTNEED just before
4846 closing an input file. Again should help improve indexing
4849 Fri Apr 02 16:09:03 BST 2004 Olly Betts <olly@survex.com>
4851 * configure.in,omindex.cc: Use O_STREAMING and/or posix_fadvise()
4852 when reading files to be indexed (if available). This helps to
4853 keep the Xapian database in cache, and greatly improve indexing
4856 Tue Mar 30 00:06:15 BST 2004 Olly Betts <olly@survex.com>
4858 * NEWS: We're now putting omega news here rather than in xapian-core
4859 so composed draft version for the forthcoming 0.8.0 release.
4861 Tue Mar 29 23:56:27 BST 2004 Olly Betts <olly@survex.com>
4863 * templates/xml: Remove unused OmegaScript code:
4864 `$set{topterms,$or{$ne{$msize,0},$query}}'.
4866 Tue Mar 29 23:55:40 BST 2004 Olly Betts <olly@survex.com>
4868 * Makefile.am: scriptindex needs to link to getopt.c and getopt1.c.
4870 Tue Mar 23 19:20:19 GMT 2004 Olly Betts <olly@survex.com>
4872 * templates/xml: Correct spelling of `relavence' to `relevance'.
4873 NB: if you're parsing the XML output, you'll need to fix this
4874 spelling in your parser!
4876 Sun Mar 21 14:23:23 GMT 2004 Olly Betts <olly@survex.com>
4878 * scriptindex.cc: Use getopt for option parsing. Change default to
4879 *not* overwriting the database (use --overwrite if you really want
4880 to do this); -u is now accepted but ignored.
4882 Fri Mar 12 02:11:28 GMT 2004 Olly Betts <olly@survex.com>
4884 * templates/xml: "Content-Type: application/html" is more appropriate
4887 Fri Mar 12 02:09:33 GMT 2004 Olly Betts <olly@survex.com>
4889 * omindex.cc: Added --overwrite option which forces an existing
4890 database to be deleted before indexing begins.
4892 Wed Mar 10 14:39:13 GMT 2004 Olly Betts <olly@survex.com>
4894 * templates/xml: "Content-Type: text/xml".
4896 Wed Mar 10 00:08:40 GMT 2004 Olly Betts <olly@survex.com>
4898 * docs/scriptindex.txt: Make more explicit that boolean produces a
4899 *single* boolean term.
4901 Tue Mar 09 19:08:19 GMT 2004 Olly Betts <olly@survex.com>
4903 * indextext.cc,omindex.cc,scriptindex.cc: Updated to use add_term()
4904 instead of add_term_nopos().
4906 Wed Mar 03 14:55:50 GMT 2004 Olly Betts <olly@survex.com>
4908 * scriptindex.cc: Use true/false for assigning to booleans, not 1/0.
4910 Sat Feb 21 18:33:15 GMT 2004 Olly Betts <olly@survex.com>
4912 * omega.cc,query.cc,docs/omegascript.txt: Added $httpheader
4913 Omegascript to allow arbitrary HTTP headers and alternative
4914 Content-Type headers to be specified.
4916 Sat Feb 14 00:32:06 GMT 2004 Olly Betts <olly@survex.com>
4918 * query.cc: If the probabilistic query was bad, don't try to run the
4921 Sat Feb 14 00:11:52 GMT 2004 Olly Betts <olly@survex.com>
4923 * docs/cgiparams.txt: Note that START and END should be in the format
4926 Sat Feb 14 00:07:41 GMT 2004 Olly Betts <olly@survex.com>
4928 * query.cc: Don't crash if there's a date filter but no probabilistic
4931 Wed Nov 26 22:44:49 GMT 2003 Olly Betts <olly@survex.com>
4933 * indextext.cc: Raw terms with a multicharacter prefix are now indexed
4934 with a : inserted (e.g. as XFOO:Rterm). This matches what the query
4937 Wed Nov 26 16:25:16 GMT 2003 Olly Betts <olly@survex.com>
4939 * configure.in: Version 0.7.5.
4941 Sun Nov 23 03:28:21 GMT 2003 Olly Betts <olly@survex.com>
4943 * query.cc,docs/omegascript.txt: Added note that $setmap{prefix,...}
4944 needs be used before any commands which require the query to be
4947 Thu Nov 20 02:44:55 GMT 2003 Olly Betts <olly@survex.com>
4949 * docs/omegascript.txt: Expanded documentation of $set and $setmap to
4950 include values which Omega itself makes use of.
4952 Thu Nov 20 02:43:03 GMT 2003 Olly Betts <olly@survex.com>
4954 * omega.cc,query.cc: Set default value for $opt{stemmer} to "english"
4955 rather than taking "" to mean English.
4957 Tue Oct 21 21:29:18 BST 2003 Olly Betts <olly@survex.com>
4959 * query.cc: Fixed $setmap{} to not add bogus entries.
4961 Tue Oct 21 21:20:31 BST 2003 Olly Betts <olly@survex.com>
4963 * query.cc: Allow the QueryParser prefix map to be set up using
4964 $setmap{prefix,...} (e.g. $setmap{prefix,subject,XT,abstract,XA}).
4966 Tue Oct 21 21:13:59 BST 2003 Olly Betts <olly@survex.com>
4968 * query.cc: Only parse probabilistic query once!
4970 Tue Oct 21 20:03:27 BST 2003 Olly Betts <olly@survex.com>
4972 * omega.cc,omega.h,query.cc,query.h: Reworked so that the
4973 probabilistic query isn't parsed until we need some
4974 information from it. This means that we can now use options
4975 set by the omegascript template to control the behaviour of the
4978 Thu Oct 16 21:17:01 BST 2003 Olly Betts <olly@survex.com>
4980 * omega.cc: Renamed `big_buf' to `query_string' and eliminated `more'
4981 flag and use of goto; tidied up order of reading CGI variables; use
4982 const refs to value strings in cgi_params map rather than copying
4985 Sat Oct 11 20:43:04 BST 2003 Olly Betts <olly@survex.com>
4987 * omega.cc,omega.h,query.cc: Make rset an object rather than a pointer
4990 Fri Oct 10 18:06:10 BST 2003 Olly Betts <olly@survex.com>
4992 * query.cc: Removed the unfinished code for caching omegascript
4993 command expansions. Added code to cache $dbsize. The only other
4994 value correctly marked for caching is already being cached!
4996 Thu Oct 02 15:18:19 BST 2003 Olly Betts <olly@survex.com>
4998 * configure.in: Version 0.7.4.
5000 Thu Oct 02 15:16:41 BST 2003 Olly Betts <olly@survex.com>
5002 * query.cc: $date doesn't require the match to be run to work, but
5005 Tue Sep 30 18:32:25 BST 2003 Olly Betts <olly@survex.com>
5007 * query.cc: Cleaner version of T macro.
5009 Tue Sep 30 18:09:30 BST 2003 Olly Betts <olly@survex.com>
5011 * query.cc: Hopefully the final piece in the Sun C++ puzzle.
5013 Tue Sep 30 00:59:50 BST 2003 Olly Betts <olly@survex.com>
5015 * query.cc: Cleaned up a recent fix by using clean generic code which
5016 works on Sun's C++ too.
5018 Mon Sep 29 17:12:10 BST 2003 Olly Betts <olly@survex.com>
5020 * cgiparam.cc: Portability fixes for Sun's C++ compiler.
5022 Mon Sep 29 13:26:22 BST 2003 Olly Betts <olly@survex.com>
5024 * query.cc: Another Sun C++ fix.
5026 Mon Sep 29 11:49:30 BST 2003 Olly Betts <olly@survex.com>
5028 * query.cc,omega.cc: More fixes for Sun's really rather rubbish
5031 Mon Sep 29 01:39:56 BST 2003 Olly Betts <olly@survex.com>
5033 * query.cc: Fixes for compiling with Sun's C++ compiler.
5035 Mon Sep 29 01:17:39 BST 2003 Olly Betts <olly@survex.com>
5037 * omega.cc: Added workaround for compilation problem with Sun's C++.
5039 Fri Aug 08 01:39:51 BST 2003 Olly Betts <olly@survex.com>
5041 * configure.in: Version 0.7.3.
5043 Sat Aug 02 01:52:38 BST 2003 Olly Betts <olly@survex.com>
5045 * configure.in,omindex.cc,query.cc: Fixed to compile on mingw
5046 where ftime() returns void.
5048 Fri Aug 01 20:59:57 BST 2003 Olly Betts <olly@survex.com>
5050 * scriptindex.cc: Added #define for sleep() on __WIN32__.
5052 Wed Jul 30 19:05:17 BST 2003 Olly Betts <olly@survex.com>
5054 * getopt.h: Copied over latest getopt.h from xapian-core.
5056 Sun Jul 27 16:34:19 BST 2003 Olly Betts <olly@survex.com>
5058 * Makefile.am,getopt.c,getopt.h,getopt1.c: Copied our version of GNU
5059 getopt here from xapian-core so we can build omindex on non-glibc
5060 platforms (modifications are for better C++ compatibility).
5062 Mon Jul 21 01:16:59 BST 2003 Olly Betts <olly@survex.com>
5064 * configure.in: Use libtool; OM_PATH_XAPIAN -> XO_LIB_XAPIAN.
5066 Sat Jul 19 19:26:03 BST 2003 Olly Betts <olly@survex.com>
5068 * omindex.cc: Added missing `#include <errno.h>'.
5070 Sat Jul 19 19:24:50 BST 2003 Olly Betts <olly@survex.com>
5072 * indextext.cc: Fixed signed character issue.
5074 Thu Jul 17 00:51:42 BST 2003 Olly Betts <olly@survex.com>
5076 * bootstrap: Removed bootstrap in favour of top-level bootstrap.
5078 Tue Jul 15 16:27:52 BST 2003 Olly Betts <olly@survex.com>
5080 * omindex.cc: file_to_string() and stdout_to_string() now throw an
5081 exception on a read error, avoiding the " "-for-empty-file bodge.
5083 Tue Jul 15 15:18:32 BST 2003 James Aylett <james@tartarus.org>
5085 * omindex.cc: fix file_to_string() to return the file on
5086 success, and not leak memory on empty files. Fix callers
5087 to give up on unreadable files, not vice versa. Fix
5088 logging messages to distinguish re-indexed/added.
5090 Fri Jul 11 15:09:55 BST 2003 Olly Betts <olly@survex.com>
5092 * configure.in: Version 0.7.2.
5094 Fri Jul 11 12:08:57 BST 2003 Olly Betts <olly@survex.com>
5096 * omega.cc: If the same database is listed more than once, only search
5097 the first occurrence.
5099 Fri Jul 11 11:57:24 BST 2003 Olly Betts <olly@survex.com>
5101 * configure.in,utils.cc: Use snprintf.
5103 Tue Jul 08 17:56:39 BST 2003 Olly Betts <olly@survex.com>
5105 * configure.in: Version 0.7.1.
5107 Tue Jul 08 17:34:01 BST 2003 Olly Betts <olly@survex.com>
5109 * omindex.cc: Fixed compilation problem.
5111 Fri Jul 04 22:12:32 BST 2003 Olly Betts <olly@survex.com>
5113 * bootstrap: add missing ';;' as case pattern delimiter
5115 Thu Jul 03 23:34:50 BST 2003 Olly Betts <olly@survex.com>
5117 * configure.in: Version 0.7.0.
5119 Thu Jul 03 23:33:05 BST 2003 Olly Betts <olly@survex.com>
5121 * omindex.cc: Abort parsing of document if it's excluded from
5122 indexing; ignore anything outside of the first <body>...</body>,
5125 Tue Jun 24 00:45:28 BST 2003 Olly Betts <olly@survex.com>
5127 * docs/overview.txt: Added note about hashing of long URL terms and
5128 reworked structure a little.
5130 Mon Jun 23 21:11:41 BST 2003 Olly Betts <olly@survex.com>
5132 * bootstrap: Check for Bison 1.875 which doesn't work with Xapian.
5134 Mon Jun 23 16:52:47 BST 2003 Olly Betts <olly@survex.com>
5136 * omega.cc,omindex.cc,scriptindex.cc: Xapian::PostListIterator ->
5137 Xapian::PostingIterator.
5139 Thu Jun 19 20:02:00 BST 2003 Olly Betts <olly@survex.com>
5141 * symboltab.h: Convert hardspace to space.
5143 Wed Jun 18 16:32:34 BST 2003 Olly Betts <olly@survex.com>
5145 * scriptindex.cc: Removed already disabled unique id hashing to docid
5146 code. Xapian doesn't support setting arbitrary docids - if it ever
5147 does we can retrieve this code from CVS.
5149 Wed Jun 18 16:28:33 BST 2003 Olly Betts <olly@survex.com>
5151 * Makefile.am,indextext.cc,indextext.h,omindex.cc,scriptindex.cc:
5152 Normalise accents in probabilistic terms.
5154 Tue Jun 17 17:54:32 BST 2003 Olly Betts <olly@survex.com>
5156 * omindex.cc: Read output from pstotext and pdftotext via pipes rather
5157 than temporary files to side-step the whole problem of secure
5158 temporary file creation; Use pdfinfo to get the title and keywords
5159 from when indexing a PDF; Safe filename escaping tweaked to not
5160 escape common safe punctuation.
5162 Tue Jun 17 17:50:00 BST 2003 Olly Betts <olly@survex.com>
5164 * htmlparse.cc,htmlparse.h: Moved initialisation of named_ents out of
5165 header - it's not a sensible candidate for inlining.
5167 Wed Jun 11 02:32:25 BST 2003 Olly Betts <olly@survex.com>
5169 * date.cc,date.h,omega.cc,omega.h,omindex.cc,query.cc,query.h,
5170 scriptindex.cc: Om -> Xapian::, etc.
5172 Fri Jun 6 01:04:12 BST 2003 Richard Boulton <richard@tartarus.org>
5174 * omindex.cc: Implement an upper limit on the length of URL
5175 terms. Currently, this is set at 240 characters - it can
5176 probably be increased slightly, but I'm not sure exactly
5177 how long a term can safely be. If the URL term would be
5178 longer than this, its last few bytes are replaced by a
5179 hash of the tail of the URL. This means that (apart from
5180 hopefully very rare collisions) urlterms should still be
5181 unique ids for documents.
5183 Fri Jun 06 00:14:13 BST 2003 Richard Boulton <richard@tartarus.org>
5185 * omindex.cc: Clean up processing of HTML documents:
5186 - Ignore the contents of <script> and <style> tags in HTML.
5187 - Strip initial whitespace in each tag in an HTML document.
5188 - Try not to split words in half when truncating title and
5191 Tue Jun 03 11:15:28 BST 2003 Olly Betts <olly@survex.com>
5193 * templates/query: Create log entry in query.log.
5195 Thu May 29 18:03:54 BST 2003 Olly Betts <olly@survex.com>
5197 * query.cc: Fixed bug in DEFAULT_LOG_ENTRY's Omegascript.
5199 Thu May 29 00:22:28 BST 2003 Olly Betts <olly@survex.com>
5201 * query.cc: Set STEM_LANGUAGE near the start of the file so it's easy
5202 for users to change until we get better configurability.
5204 Thu May 29 00:00:28 BST 2003 Olly Betts <olly@survex.com>
5206 * Makefile.am,date.cc,date.h,query.cc: Split code to build a
5207 date range filter into a separate file.
5209 Wed May 28 23:38:02 BST 2003 Olly Betts <olly@survex.com>
5211 * configfile.cc,configfile.h,omega.cc,omega.conf,query.cc,query.h,
5212 docs/omegascript.txt,docs/overview.txt,docs/quickstart.txt:
5213 Replaced half-hearted logging support with flexible
5214 OmegaScript-based approach with new $log command. Also added
5215 $now to allow the current date/time to be logged.
5217 Tue May 27 17:55:24 BST 2003 Olly Betts <olly@survex.com>
5219 * query.cc: Added missing "#include <assert.h>".
5221 Mon May 26 22:41:26 BST 2003 Olly Betts <olly@survex.com>
5223 * configure.in: Don't use libtool; Use AC_CONFIG_FILES - it's the new
5226 Mon May 26 12:12:22 BST 2003 Olly Betts <olly@survex.com>
5228 * omega.spec.in: Removed %changelog - it hasn't been reliably updated
5229 and only really makes sense when the packaging is done by a third
5232 Mon May 26 12:01:55 BST 2003 Olly Betts <olly@survex.com>
5234 * query.cc: If the query is empty, don't bother running it through
5237 Wed Apr 30 01:18:47 BST 2003 Olly Betts <olly@survex.com>
5239 * docs/cgiparams.txt,docs/omegascript.txt: Minor improvements.
5241 Wed Apr 30 01:14:46 BST 2003 Olly Betts <olly@survex.com>
5243 * query.cc: Use correct types for docid and value_no in $value.
5245 Wed Apr 23 16:15:07 BST 2003 Sam Liddicott <sam@liddicott.com>
5247 * templates/xml: add collapse info to xml template.
5249 Wed Apr 23 14:00:37 BST 2003 Olly Betts <olly@survex.com>
5251 * omega.spec.in: Merged changes from Fabrice Colin.
5253 Thu Apr 10 03:14:51 BST 2003 Olly Betts <olly@survex.com>
5255 * configure.in: Updated for 0.6.5 release.
5257 Wed Apr 09 13:56:14 BST 2003 Olly Betts <olly@survex.com>
5259 * omega.cc,query.cc,omega.h,docs/cgiparams.txt: Renamed DATE1, DATE2,
5260 and DAYSMINUS to the more meaningful START, END, and SPAN (NB SPAN
5261 is days before END, or after START, or before today - whereas
5262 SPAN was before *DATE1* or before today). The old parameters names
5263 are supported (with the original semantics) for now.
5265 Wed Apr 09 13:44:28 BST 2003 Olly Betts <olly@survex.com>
5267 * Makefile.am: Install docs in /usr/share/doc/omega to be FHS
5269 * omega.spec.in: Consistently use %{contentdir} instead of /var/lib;
5270 removed redundant second setting of %docdir.
5272 Wed Apr 09 01:21:57 BST 2003 Olly Betts <olly@survex.com>
5274 * Makefile.am: Removed bogus extra "\".
5276 Mon Mar 31 19:42:24 BST 2003 Olly Betts <olly@survex.com>
5278 * Makefile.am: Install documentation!
5279 * omega.spec.in: Merged in changes to RPM packaging from Fabrice Colin
5280 and reworked further.
5282 Fri Mar 28 17:47:45 GMT 2003 Olly Betts <olly@survex.com>
5284 * templates/query,templates/query2: Removed bogus setting of defunct
5285 xB parameter; correctly propagate multiple B parameters.
5287 Fri Mar 28 17:45:41 GMT 2003 Olly Betts <olly@survex.com>
5289 * omindex.cc: Report correct version number (was hard-wired to 1.0!)
5291 Tue Mar 25 14:46:10 GMT 2003 Olly Betts <olly@survex.com>
5293 * query.cc: If xP and P are both empty, classify as SAME_QUERY not
5294 NEW_QUERY as there may be a boolean query too.
5295 * query.cc: Fixed off-by-one error in rounding down topdoc - it was
5296 possible to get to an empty page of hits if there were exactly a
5297 multiple of HITSPERPAGE matches and the matcher over-estimated the
5298 number of matches and Omega displayed page links.
5300 Mon Mar 24 09:40:04 GMT 2003 Sam Liddicott <sam.liddicott@orange.co.uk>
5302 * templates/query: Added propagation of B boolean filter
5303 * templates/query2: factored about a bit more, query2 is
5304 a more modular version of query which will ultimately
5305 lend itself to customisation a bit more to the uninitiated.
5307 Tue Mar 04 01:02:12 GMT 2003 Olly Betts <olly@survex.com>
5309 * omega.cc: Fixed handling of multiple DB parameters to be as
5312 Fri Feb 28 09:52:03 GMT 2003 Sam Liddicott <sam.liddicott@orange.co.uk>
5314 * Added $collapsed to omegascript to give the number of hits
5315 collapsed into the current hit, eg:
5317 $if{$ne{$collapsed,0},$collapsed hidden results
5318 ($value{$cgi{COLLAPSE}})}
5320 * templates/godmode: removed euro ferret icon reference
5321 * templates/godmode: added value dumping, for values from 0-255
5323 Thu Feb 27 11:58:13 GMT 2003 Olly Betts <olly@survex.com>
5325 * Makefile.am,query.cc,docs/omegascript.txt,templates/query:
5326 Added $transform{} which does regexp manipulation (currently
5327 disabled); Added $uniq{} to eliminate duplicates from a sorted
5328 list; Fixed a query with repeated terms to be identified as
5329 SAME_QUERY not EXTENDED_QUERY; remove duplicates from terms
5330 listed in term frequencies.
5332 Wed Feb 26 17:50:26 GMT 2003 Olly Betts <olly@survex.com>
5334 * scriptindex.cc: Allow '_' in fieldnames. Diagnose bad characters
5335 in fieldnames better.
5337 Wed Feb 26 15:13:02 GMT 2003 Sam Liddicott <sam.liddicott@orange.co.uk>
5339 * dbi2omega: Add DBUSER and DBPASSWD env var support so that password
5340 protected DB's can easily be used
5341 * add cgi parameter COLLAPSE to collapse on key values
5342 * Add $value{key[,docid]} support to omegascript
5344 Wed Feb 26 09:58:01 GMT 2003 Sam Liddicott <sam.liddicott@orange.co.uk>
5346 * bootstrap: Fix success message when building in non-src dir
5347 as configure is written to the src dir.
5349 Mon Jan 6 12:47:55 GMT 2003 James Aylett <james@tartarus.org>
5351 * scriptindex.cc: build fix
5353 Tue Dec 24 20:12:23 GMT 2002 Olly Betts <olly@survex.com>
5355 * configure.in: Version 0.6.4.
5357 Tue Dec 24 20:06:47 GMT 2002 Olly Betts <olly@survex.com>
5359 * scriptindex.cc: Minor tweak.
5361 Tue Dec 24 19:58:57 GMT 2002 Olly Betts <olly@survex.com>
5363 * omega.cc,docs/cgiparams.txt: Prefer MINHITS to MIN_HITS and
5364 RAWSEARCH to RAW_SEARCH since none of the other CGI parameter
5365 names have _ separating words. Also support old names for now.
5367 Mon Dec 23 03:23:33 GMT 2002 Olly Betts <olly@survex.com>
5369 * query.cc,docs/omegascript.txt,templates/query: Added $unstem to map
5370 a stemmed term to the form(s) used in the query; $queryterms now
5371 only includes the first occurrence of each stemmed form; $prettyterm
5372 uses the unstem map.
5374 Sat Dec 21 17:47:33 GMT 2002 Olly Betts <olly@survex.com>
5376 * scriptindex.cc,docs/scriptindex.txt: Replaced index=nopos with
5377 indexnopos action; index and indexnopos now take an optional
5378 prefix argument; index=nopos is handled specially for backwards
5381 Sat Dec 21 17:18:02 GMT 2002 Olly Betts <olly@survex.com>
5383 * scriptindex.cc,docs/scriptindex.txt: Added new scriptindex action
5384 date=FORMAT to generate terms for date range searching.
5386 Sat Dec 21 01:51:32 GMT 2002 Olly Betts <olly@survex.com>
5388 * templates/query: Stop topterms sticking out of green box with
5389 gecko based browsers.
5391 Sat Dec 21 01:44:53 GMT 2002 Olly Betts <olly@survex.com>
5393 * Makefile.am: Distribute docs/scriptindex.txt.
5394 * docs/omegascript.txt: It's $setrelevant not $set_relevant.
5396 Sat Dec 14 13:54:10 GMT 2002 Olly Betts <olly@survex.com>
5398 * configure.in: Version 0.6.3; removed -Wno-long-long as we don't use
5400 * query.cc: Compilation fixes.
5401 * templates/query: Don't call $topterms twice!
5403 Sat Dec 14 01:10:48 GMT 2002 Olly Betts <olly@survex.com>
5405 * query.cc: Updated in line with removal of OmSettings.
5407 Wed Dec 11 00:58:49 GMT 2002 Olly Betts <olly@survex.com>
5409 * configure.in,query.cc,docs/omegascript.txt,templates/query:
5410 Added $time which reports how long the match took - when searching
5411 on a remote website, it's hard to gauge how much time is taken by
5412 the search, and how much by the web server and browser; renamed
5413 and_vec to or_vec which better describes its purpose.
5415 Mon Dec 09 17:11:26 GMT 2002 Olly Betts <olly@survex.com>
5417 * query.cc,docs/omegascript.txt,templates/query: Added $dbsize
5418 to return the number of documents in the database being searched.
5419 Use this in the default query template on the "front page" shown
5420 when there's no search.
5422 Mon Dec 09 02:55:46 GMT 2002 Olly Betts <olly@survex.com>
5424 * query.cc,docs/omegascript.txt,templates/query: Added $msizeexact
5425 which returns "true" if $msize if exact (or "" if it is estimated).
5426 This means that you'll see "... of about N matches" less often -
5427 notably it's gone when searching for a single term, which is a
5430 Sun Dec 08 08:42:47 GMT 2002 Olly Betts <olly@survex.com>
5432 * scriptindex.cc: Replaced icky unportable code which set the filename
5433 to "/dev/fd/0" in order to read from stdin.
5435 Sun Dec 08 06:39:30 GMT 2002 Olly Betts <olly@survex.com>
5437 * query.cc,docs/omegascript.txt: Fixed $hitlist to complain if more
5438 than one parameter is passed; $topterms now defaults to 16 terms
5439 rather than 20; $topterms now weeds out terms which stem to the
5440 same as those in the query, or those already in $topterms.
5442 Sun Dec 08 06:36:04 GMT 2002 Olly Betts <olly@survex.com>
5444 * templates/query: Make background white - the very light grey just
5445 looks dirty; fixed exclusion of TopTerms Javascript when there
5446 are not TopTerms; sample now <small>; language and size now
5447 appear when the corresponding fields are present; fixed
5448 unmatched </small>; fixed missing list of terms matching
5451 Sat Dec 07 21:20:31 GMT 2002 Olly Betts <olly@survex.com>
5453 * configure.in: Version 0.6.2.
5455 Sat Dec 07 21:04:31 GMT 2002 Olly Betts <olly@survex.com>
5457 * query.cc: Prefer "while (true)" to "while (1)".
5459 Fri Dec 06 04:41:05 GMT 2002 Olly Betts <olly@survex.com>
5461 * omindex.cc: Index .php files by default; non-zero return code if
5462 an exception is caught.
5464 Fri Dec 06 04:30:17 GMT 2002 Olly Betts <olly@survex.com>
5466 * htmlparse.cc: Ignore PHP tags and their contents; fixed tag
5467 scanning code to never read one character past the end of
5470 Wed Dec 04 18:42:51 GMT 2002 Olly Betts <olly@survex.com>
5472 * omega.cc,omega.h,omindex.cc,query.cc,scriptindex.cc:
5473 Updated in line with OmSettings related changes to the API.
5475 Wed Dec 04 17:13:43 GMT 2002 Olly Betts <olly@survex.com>
5477 * query.cc: Fixed $dbname to return "default" for the default
5478 database, rather than "" - this fixes paging in searches of the
5480 * templates/query: Removed xDEFAULTOP hidden field which is no longer
5483 Wed Dec 04 11:57:13 GMT 2002 Olly Betts <olly@survex.com>
5485 * templates/query: Removed bogus unmatched '}'.
5487 Thu Nov 28 20:24:08 GMT 2002 Olly Betts <olly@survex.com>
5489 * omega.cc,query.cc: Updated in line with OmEnquire::get_eset() no
5490 longer taking an OmSettings object.
5492 Wed Nov 27 19:02:12 GMT 2002 Olly Betts <olly@survex.com>
5494 * dbi2omega: Return fields in table order; more efficient;
5495 report any error reading a row; if we get a NULL field,
5496 don't output it, and suppress perl warning about use of
5497 an undefined program.
5499 Wed Nov 27 05:22:04 GMT 2002 Olly Betts <olly@survex.com>
5501 * configure.in: Set version to 0.6.0.
5503 Wed Nov 27 05:21:00 GMT 2002 Olly Betts <olly@survex.com>
5505 * configure.in,htmlparse.h,omindex.cc,scriptindex.cc:
5506 Use "-Wall -W" rather than "-Wall -Wunused", and fixed the
5507 warnings this reveals.
5509 Wed Nov 27 04:20:13 GMT 2002 Olly Betts <olly@survex.com>
5511 * Makefile.am,dbi2omega: Added perl script to dump any database
5512 which perl DBI can access into the dump format expected by
5515 Wed Oct 30 02:02:32 GMT 2002 Olly Betts <olly@survex.com>
5517 * omega.spec.in: Use bootstrap instead of buildall; don't use "-j4"
5518 with make - most people don't all have quad processor boxes!
5520 Wed Oct 30 01:56:31 GMT 2002 Olly Betts <olly@survex.com>
5522 * buildall: Removed in favour of bootstrap script.
5524 Tue Oct 29 02:01:58 GMT 2002 Olly Betts <olly@survex.com>
5526 * omindex.cc,scriptindex.cc: Added MAX_PROB_TERM_LENGTH (set to
5527 64) to limit size of probabilistic terms.
5529 Sat Oct 12 17:09:55 BST 2002 Olly Betts <olly@survex.com>
5531 * bootstrap: Copied bootstrap script from xapian-core.
5533 Sat Oct 12 17:05:37 BST 2002 Olly Betts <olly@survex.com>
5535 * configure.in: Version 0.5.3.
5537 Wed Oct 09 16:55:56 BST 2002 Olly Betts <olly@survex.com>
5539 * omega.cc,omega.h,query.cc,docs/{cgiparams.txt,omegascript.txt},
5540 templates/query: revamped the "reset first page when filter changes"
5541 scheme - all filtery things are now serialised and put into the
5542 xFILTER CGI parameter, which copes with multiple B values. Support
5543 for the old way (xB, xDATE1, xDATE2, xDAYSMINUS, xDEFAULTOP) is
5544 included for now (but only copes with a single B value). Added (and
5545 documented) $filters Omegascript command to implement this.
5546 * query.cc: fixed handling of case when topdoc is non-zero, but
5547 no matches were found. This was causing topdoc to be set to -6!
5548 * query.cc: fixed handling of prefixes starting with an X.
5550 Wed Oct 09 15:35:54 BST 2002 Olly Betts <olly@survex.com>
5552 * .cvsignore: Added scriptindex and omega-*.tar.gz; removed libtool.
5554 Sun Oct 06 18:56:40 BST 2002 Olly Betts <olly@survex.com>
5556 * configure.in: Version 0.5.2.
5558 Thu Oct 03 16:42:06 BST 2002 Olly Betts <olly@survex.com>
5560 * query.cc: Added CMD_hit to enumeration.
5562 Wed Oct 02 17:02:25 BST 2002 Olly Betts <olly@survex.com>
5564 * configure.in: Version 0.5.1.
5565 * Makefile.am,configure.in: require automake 1.6.3 and autoconf 2.54
5566 since xapian-core does anyway, and it neatens configure.in slightly.
5568 Wed Oct 02 16:58:39 BST 2002 Olly Betts <olly@survex.com>
5570 * query.cc,docs/omegascript.txt: Added $hit which gives the m-set
5571 number of the current hit.
5573 Sun Sep 22 15:47:33 BST 2002 Olly Betts <olly@survex.com>
5575 * configfile.cc: Corrected use of string.data() to string.c_str().
5577 Sun Sep 22 03:53:35 BST 2002 Olly Betts <olly@survex.com>
5579 * templates/query: Updated xapian url to http://www.xapian.org/
5581 Fri Sep 20 15:36:35 BST 2002 Olly Betts <olly@survex.com>
5583 * configure.in: Version 0.5.0.
5585 Sun Sep 15 03:07:31 BST 2002 Richard Boulton <richard.boulton@omsee.com>
5587 * buildall: Update to latest version, to fix bug with VPATH version
5588 checking for autoconf.
5590 Thu Sep 12 15:11:16 BST 2002 Olly Betts <olly@survex.com>
5592 * htmlparse.cc: Add comment about string::replace() invalidating
5595 Thu Sep 12 13:38:05 BST 2002 Olly Betts <olly@survex.com>
5597 * omegascript.vim,omegascript.txt,query.cc: cosmetic tweaks.
5599 Thu Sep 5 14:47:54 BST 2002 Richard Boulton <richard@tartarus.org>
5601 * configure.in: Don't use libtool. I don't know why I ever thought
5604 Thu Sep 5 14:11:51 BST 2002 Richard Boulton <richard@tartarus.org>
5606 * query.cc: Change $and to return true iff all its arguments are
5607 not false, rather than if one or more of the arguments is false.
5608 * docs/omegascript.txt: Update documentation of $and{}
5610 Fri Aug 23 13:27:02 BST 2002 James Aylett <tartarus@users.sourceforge.net>
5612 * docs/quickstart.txt: encourage people to call their first
5613 database 'default' since this will work straight off.
5615 Wed Aug 21 17:52:36 BST 2002 Richard Boulton <richard@tartarus.org>
5617 * query.cc: Add $slice{} command, to slice a list at a set of
5618 positions (given by a second list).
5619 Also, bugfix: require $hitlist{} to take at least one parameter:
5620 it currently segfaults if given none.
5621 * docs/omegascript.txt: Document $slice{}.
5622 * extra/omegascript.vim: Update syntax highlighting.
5624 Wed Aug 21 18:03:43 BST 2002 James Aylett <tartarus@users.sourceforge.net>
5626 * omindex.cc: tidy up output so it doesn't wrap so much
5628 Wed Aug 21 18:01:38 BST 2002 James Aylett <tartarus@users.sourceforge.net>
5630 * htmlparse.cc: fixed bug in entity reference handling
5632 Wed Aug 21 13:21:12 BST 2002 James Aylett <tartarus@users.sourceforge.net>
5634 * omindex.cc: Bugfix to metaterm generation when operating on an
5635 absolute URL that is also at the root of its web server.
5637 Wed Aug 21 10:48:06 BST 2002 Richard Boulton <richard@tartarus.org>
5639 * scriptindex.cc: If a field has multiple instances, keep all of
5640 them (previously only kept the final occurrence).
5641 * docs/scriptindex.txt: Mention that multiple instances of fields
5644 Tue Aug 20 18:02:45 BST 2002 James Aylett <tartarus@users.sourceforge.net>
5646 * docs/quickstart.txt: correct for new(ish) omindex behaviour
5648 Sat Aug 17 13:38:57 BST 2002 Richard Boulton <richard@tartarus.org>
5650 * extra/omegascript.vim: Quick attempt at a vim syntax highlighting
5651 file for omegascript. Recognises files only if they're in a
5652 directory called "templates": perhaps we should adopt a suffix to
5653 make recognition easier.
5654 Read the file for installation instructions.
5656 Thu Aug 15 11:21:20 BST 2002 Richard Boulton <richard@tartarus.org>
5658 * scriptindex.cc: Allow updating of databases by a command line
5659 switch, and also turn off verbose output (can be turned back
5661 * docs/scriptindex.txt: Document the "unique" tag.
5663 Thu Aug 15 11:18:21 BST 2002 Richard Boulton <richard@tartarus.org>
5665 * buildall: Copy buildall from xapian-core - the old one breaks
5666 for me (due to odd aclocal paths) but the new one is fine.
5667 We should make a common module to hold build stuff to be shared
5668 between modules, though.
5670 Mon Aug 12 01:34:42 BST 2002 Richard Boulton <richard@tartarus.org>
5672 * scriptindex.cc: Bug fix - index without positional information
5673 if "nopos" is specified, rather than the other way around.
5674 Bug fix - don't completely eradicate newlines in multiline values,
5675 until they have a chance to be converted to spaces.
5676 Delete documents if no fields other than unique fields are
5678 Add some simple debugging, and write messages to a log file in
5679 the database directory.
5681 * configure.in: Use libtool.
5683 Fri Aug 9 13:57:32 BST 2002 Richard Boulton <richard@tartarus.org>
5685 * scriptindex.cc: Fix compile errors, by changing string
5686 constructors to take begin and end iterators, instead of a begin
5689 Fri Jul 05 19:33:55 BST 2002 Olly Betts <olly@survex.com>
5691 * omega.spec.in: Fixed wrt /usr/lib/omega/bin/omega.
5693 Fri Jul 05 19:20:05 BST 2002 Olly Betts <olly@survex.com>
5695 * Makefile.am, docs/quickstart.txt: Install omega as
5696 ${prefix}/lib/omega/bin/omega.
5698 Thu Jul 04 02:11:46 BST 2002 Olly Betts <olly@survex.com>
5700 * scriptindex.cc, docs/scriptindex.txt: new indexer - indexing
5701 behaviour is controlled by a simple but powerful script.
5703 * Makefile.am: tidied up.
5705 * configfile.cc, docs/quickstart.txt: database and templates default to
5706 being in /var/lib/omega rather than /home/omega.
5708 * docs/quickstart.txt: describe the new test mode (command line) rather
5709 than the old one (stdin).
5711 * omega.cc, docs/cgiparams.txt: If xP isn't set, honour paging and
5712 R-set. So RAW_SEARCH now only disables snapping TOPDOC to a multiple
5715 * query.cc: "using namespace std;"
5717 Fri Jun 14 00:07:20 BST 2002 Olly Betts <olly@survex.com>
5719 * $prettyterm{} no longer adds a trailing '.' if the term also exists
5720 with an R prefix and stems to itself.
5722 Fri Jun 14 00:02:16 BST 2002 Olly Betts <olly@survex.com>
5724 * MORELIKE can now take a termname - this allows MORELIKE to be used
5725 with a unique id from an external database if it has been indexed
5728 Thu Jun 13 00:01:11 BST 2002 Olly Betts <olly@survex.com>
5730 * omega.conf: removed trailing slashes from directory names.
5732 * query.cc: removed extra slash added to template_dir; improved
5733 reporting of errors opening template file.
5735 Wed Jun 12 23:51:11 BST 2002 Olly Betts <olly@survex.com>
5737 * Added an alternative test mode - you can now pass parameters as
5738 command line arguments, which is more convenient for repeating
5739 the same test query, and for automated testing, e.g.:
5741 omega 'P=information retrieval' DB=papers
5743 If the first parameter starts with a "-" and doesn't contain an
5744 "=", omega now outputs the version string and stops (to gracefully
5745 handle "omega --version" and "omega --help".
5747 Wed Jun 12 23:39:20 BST 2002 Olly Betts <olly@survex.com>
5749 * omindex.cc: removed OLD_PREFIXES code - shout if you were using it.
5751 Fri May 17 14:09:25 BST 2002 Olly Betts <olly@survex.com>
5753 * Pass the database to the query parser (not used there at present,
5754 but will allow wildcarded searches, etc to be implemented).
5756 Thu May 16 17:57:34 BST 2002 Olly Betts <olly@survex.com>
5758 * <algo.h> -> <algorithm>.
5760 Thu May 16 15:41:14 BST 2002 Sam Liddicott <sam@ananova.com>
5762 * Removed extra package again!
5764 * Moved images to /var/www/icons/omega till we think of something
5765 better. Should be the most harmless solution that still works
5766 without requireing too much brains on the part of the installer
5768 Thu May 16 14:53:54 BST 2002 Sam Liddicott <sam@ananova.com>
5770 * Moved images to a separate optional package to stop touching
5771 user's web tree until we work out what to do. sysadmin can
5772 still install images if he wants and on a redhat box they will
5773 end up in the right place. This will no doubt get revisted later,
5776 Thu May 16 13:31:27 BST 2002 Sam Liddicott <sam@ananova.com>
5778 * Added loads more missing files like images and templates to the
5781 * Also fixed the templates to use the new images dir (if they used
5782 images, which they actually don't)
5784 Thu May 16 12:56:55 BST 2002 Sam Liddicott <sam@ananova.com>
5786 * Fixes to spec file to add various missing files
5788 Wed May 15 12:59:37 BST 2002 Olly Betts <olly@survex.com>
5790 * omindex now understand acronyms (N.A.T.O. E.T ...).
5792 * $highlight{} now understands "&" (AT&T M&S ...) and acronyms.
5794 Tue May 14 13:08:41 BST 2002 Olly Betts <olly@survex.com>
5796 * Index <word>&<word> as a single term (e.g. AT&T, M&S, A&P).
5798 Tue May 14 12:37:49 BST 2002 Olly Betts <olly@survex.com>
5800 * omindex.cc: cleaned up a little.
5802 Tue May 14 11:24:42 BST 2002 Olly Betts <olly@survex.com>
5804 * Fixed config.h inclusion; using std::*.
5806 Tue May 14 11:18:37 BST 2002 Olly Betts <olly@survex.com>
5810 Tue May 14 11:16:03 BST 2002 Olly Betts <olly@survex.com>
5812 * Added SORT and SORTBANDS.
5814 Mon May 13 12:52:29 BST 2002 Olly Betts <olly@survex.com>
5818 * Commented out omindex-config (since it's unfinished) and XML support
5819 (since only omindex-config uses it).
5821 Thu May 02 16:06:02 BST 2002 Olly Betts <olly@survex.com>
5823 * Updated to reflect removal of OmData.
5825 Wed May 01 11:26:59 BST 2002 Olly Betts <olly@survex.com>
5827 * Changed to use queryparser in libomqueryparser.
5829 Tue Apr 23 15:10:42 BST 2002 Olly Betts <olly@survex.com>
5831 * Make buildall smart enough to generate aclocal.m4 properly and
5832 remove acinclude.m4. It now also extracts the package name from
5833 configure.in so we can use the same buildall everywhere; fixed
5834 problem with double use of AM_CXXFLAGS in Makefile.am.
5836 Tue Apr 23 14:27:29 BST 2002 Olly Betts <olly@survex.com>
5838 * Updated for xapian-config and xapian.m4 changes.
5840 Thu Apr 18 14:37:05 BST 2002 Olly Betts <olly@survex.com>
5842 * Updated buildall; minor tweaks to configure.in.
5844 Wed Apr 17 12:31:18 BST 2002 Olly Betts <olly@survex.com>
5846 * Removed references to xapian-config uninst options.
5848 Fri Apr 12 15:48:33 BST 2002 Olly Betts <olly@survex.com>
5850 * Remove parsequery.cc on "make maintainer-clean".
5852 Fri Apr 12 16:19:19 BST 2002 Olly Betts <olly@survex.com>
5854 * Require automake 1.5.
5856 Fri Apr 12 12:47:04 BST 2002 Olly Betts <olly@survex.com>
5858 * Tweaked what gets interpreted as a phrase.
5860 Fri Apr 12 12:44:00 BST 2002 Olly Betts <olly@survex.com>
5862 * Fixed to use AM_CFLAGS and AM_CXXFLAGS.
5864 Mon Apr 01 23:34:09 BST 2002 Olly Betts <olly@survex.com>
5866 * Fixed support for decimal numeric entities (e.g. "ö")
5868 * Added support for all iso-8859-1 named entities (e.g. "ö")
5870 Mon Apr 01 15:07:31 BST 2002 Olly Betts <olly@survex.com>
5872 * Applied patch from "orion orion" to fix problem in HTML parsing.
5874 Mon Mar 25 13:11:14 GMT 2002 Olly Betts <olly@survex.com>
5876 * More tolerant treatment of random punctuation in query.
5878 Mon Feb 4 14:57:36 GMT 2002 Sam Liddicott <sam@ananova.com>
5880 * Added support for repeated fields in document data.
5881 $field{fieldname} may now return multiple tab separated values if
5882 more than one instance of a field exists in the document data
5884 Tue Jan 15 16:29:39 GMT 2002 Sam Liddicott <sam@ananova.com>
5886 * Fixed date_range_filter for the case where DATE1 and DATE2 don't
5887 share the same MONTH and YEAR and M## terms for intermediate months
5888 need calculating between the years.
5890 Thu Jan 10 15:39:43 GMT 2002 Sam Liddicott <sam@ananova.com>
5892 * Added $htmlstrip{} to strip out html tags
5894 Thu Jan 10 14:34:35 GMT 2002 James Aylett <tartarus@users.sourceforge.net>
5896 * toptermsjs snippet now included inside the HEAD, so it's
5897 actually legal HTML. Snippet now sets the required 'type'
5898 attribute as well. (It keeps the technically illegal
5899 'language' attribute because I have a sneaking suspicion it
5900 won't work otherwise.)
5902 Thu Jan 10 14:30:19 GMT 2002 James Aylett <tartarus@users.sourceforge.net>
5904 * $opt with two arguments now acts as a lookup for a $setmap
5905 map. This was previously documented in a misleading fashion.
5906 The new system is backwards compatible with the old.
5908 Wed Jan 9 Sam Liddicott <sam@ananova.com>
5910 * Added RAW_SEARCH as cgi param which when set stops change-search
5911 detection being performed and processes rset, topdoc and page-change
5912 parameters ( [ ] < > 1 2 etc etc ) anyway
5914 * Added MIN_HITS cgi param to request many more hits than can
5915 fit on the page so we can be confident that the next few
5916 consecutive pages will really be needed
5918 * Added xml template which when combined with RAW_SEARCH=1
5919 can be very useful when searching is done from another
5922 Fri Dec 21 17:56:02 GMT 2001 Olly Betts <olly@survex.com>
5924 * Namespace fixes to allow use of find and find_if on Redhat's
5927 Fri Dec 21 17:53:59 GMT 2001 Olly Betts <olly@survex.com>
5929 * Added quick'n'dirty interface to allow experimentation with
5932 Thu Dec 20 14:46:33 GMT 2001 Olly Betts <olly@survex.com>
5934 * Document xDB, xDAYSMINUS, xDATE1, xDATE2, xB.
5936 Thu Dec 20 12:55:29 GMT 2001 Olly Betts <olly@survex.com>
5938 * Use double quotes on parameters to <BODY>.
5940 Mon Dec 17 15:01:43 GMT 2001 Olly Betts <olly@survex.com>
5942 * Get rid of whitespace between hundreds and tens image in page
5945 Fri Dec 14 17:26:48 GMT 2001 Olly Betts <olly@survex.com>
5947 * Force first page of hits if DB, DEFAULTOP, B, DAYSMINUS, DATE1,
5948 or DATE2 changes; also clear relevance judgements if DB changes.
5950 Fri Dec 14 16:21:07 GMT 2001 Olly Betts <olly@survex.com>
5952 * Removed restriction on minimum page size (was 10) - for a shopping
5953 type application with images next to each hit, 5 or fewer per page
5954 might be reasonable; even one result per page makes sense for some
5957 Fri Dec 14 15:37:20 GMT 2001 Olly Betts <olly@survex.com>
5959 * Added $error to make nicer error reporting possible.
5961 Fri Dec 14 14:49:18 GMT 2001 Olly Betts <olly@survex.com>
5963 * Give more helpful messages for query syntax errors in cases where
5964 we can without elaborate YACC hackery.
5966 Thu Dec 13 15:10:24 GMT 2001 Olly Betts <olly@survex.com>
5968 * For image page buttons, display pages 10-999 by using 2 or 3 images.
5970 Thu Dec 13 15:02:16 GMT 2001 Olly Betts <olly@survex.com>
5972 * New operators: $div{}, $mod{}, $mul{}, $sub{}, $ge{}, $gt{}, $le{},
5975 Wed Dec 12 16:37:47 GMT 2001 Olly Betts <olly@survex.com>
5977 * Updated omegascript documentation.
5979 Wed Dec 12 15:43:19 GMT 2001 Olly Betts <olly@survex.com>
5981 * Fixed TOPDOC clipping.
5983 Wed Dec 12 15:36:20 GMT 2001 Olly Betts <olly@survex.com>
5985 * templates/query: Fixed typo which caused "..." to appear after
5986 page buttons when it wasn't appropriate.
5988 Wed Dec 12 15:11:23 GMT 2001 Olly Betts <olly@survex.com>
5990 * omega: Added stopword list (still hardcoded at present though).
5992 Wed Dec 12 12:46:57 GMT 2001 Olly Betts <olly@survex.com>
5994 * omindex: index unstemmed terms with prefix 'R' (mnemonic: Raw).
5996 * omega: $topterms will now return terms with prefix 'R'.
5998 * parsequery.yy: fixed handling of DEFAULT_OP; "+first second" and
5999 "-first second" now work; stopwording queries working (currently
6000 stopword list is hardwired to just "the") - stopwords are ignored
6001 when used as normal terms, but not in phrases, or with + and -.
6003 * templates/query: make use of $prettyterm{}.
6005 Wed Dec 12 11:11:30 GMT 2001 Olly Betts <olly@survex.com>
6007 * $highlight{} now uses find_if not find_first_of (faster).
6009 * Fixed detection of new/old/extended query when a term occurs
6010 in the query more than once.
6012 * Added $prettyterm{TERM} to convert a probabilistic term for
6013 display to the user.
6015 * $map would allow more than two arguments, but ignore them. Fixed
6016 to take exactly two.
6018 Fri Dec 07 15:59:21 GMT 2001 Olly Betts <olly@survex.com>
6020 * Added macros to OmegaScript.
6022 * template/query: updated to use macros.
6024 * Removed specialcase to allow no-argument commands to accept an empty
6025 argument list (e.g. "$thispage{}" rather than "$thispage"). The only
6026 reason this was useful was to allow "$thispage{}s" which can just as
6027 well be written using a comment to force the parser do what you want,
6028 e.g. "$thispage${}s".
6030 Thu Dec 06 18:59:34 GMT 2001 Olly Betts <olly@survex.com>
6032 * If a stemmer is set, and all_stem isn't, only stemmer terms starting
6033 with a lowercase letter.
6035 Thu Dec 06 18:49:40 GMT 2001 Olly Betts <olly@survex.com>
6037 * parsequery.yy: changed to use find_if() (faster than find_first_of()).
6039 Thu Dec 06 17:46:37 GMT 2001 Olly Betts <olly@survex.com>
6041 * Base page links on estimated number of matches, not minimum.
6043 Wed Dec 05 17:07:33 GMT 2001 Olly Betts <olly@survex.com>
6045 * omindex: minor speed tweaks.
6047 Wed Dec 05 16:52:21 GMT 2001 Olly Betts <olly@survex.com>
6049 * omindex: further HTML parser speed-ups.
6051 Wed Dec 05 16:31:33 GMT 2001 Olly Betts <olly@survex.com>
6053 * omindex: sped up HTML parsing.
6055 Wed Dec 05 14:52:53 GMT 2001 Olly Betts <olly@survex.com>
6057 * omindex: parsing terms from text is now twice as fast.
6059 Thu Nov 29 16:53:45 GMT 2001 Olly Betts <olly@survex.com>
6061 * NEAR phrases (e.g. "a NEAR b NEAR c") now work; removed "{a b c}"
6062 syntax for NEAR phrases.
6064 Thu Nov 29 15:25:54 GMT 2001 Olly Betts <olly@survex.com>
6066 * $highlight{} now allows you to specify the tags to use for the
6069 Thu Nov 29 15:24:53 GMT 2001 Olly Betts <olly@survex.com>
6071 * topdoc is unsigned so subtracting and then checking if it's < 0
6074 Wed Nov 28 15:45:39 GMT 2001 Olly Betts <olly@survex.com>
6076 * Fixed clipping of hit page in case when there are a multiple of
6077 HITSPERPAGE matches.
6079 Wed Nov 28 14:03:48 GMT 2001 Olly Betts <olly@survex.com>
6081 * Added $hostname{URL}; $version output now says "Xapian - omega
6084 Wed Nov 28 13:04:46 GMT 2001 Olly Betts <olly@survex.com>
6086 * docs/cgiparams.txt: Minor corrections and updates.
6088 Wed Nov 28 13:03:40 GMT 2001 Olly Betts <olly@survex.com>
6090 * If we're asked for a page of hits beyond the end of the matches, clip
6091 to the last page of matches rather than the first.
6093 Wed Nov 28 13:02:31 GMT 2001 Olly Betts <olly@survex.com>
6095 * For an EXTENDED_QUERY, force the first page of hits.
6097 Wed Nov 28 12:56:56 2001 James Aylett <tartarus@users.sourceforge.net>
6099 * Lower case terms when constructing the query (otherwise why
6100 do we store them in the database that way? :-)
6102 Wed Nov 28 12:36:49 GMT 2001 Olly Betts <olly@survex.com>
6104 * Fettled default query template.
6106 Wed Nov 28 12:33:52 GMT 2001 Olly Betts <olly@survex.com>
6108 * Request one more match than the last we want to display so we can
6109 tell if the next page of hits is empty or not - otherwise we risk
6110 offering a "next page" link when there are no more hits.
6112 Mon Nov 26 16:28:00 2001 James Aylett <tartarus@users.sourceforge.net>
6114 * --no-recurse / -l option added; useful if your sites are
6115 nested in their disc storage (particularly things like
6116 http://example.com/ being a distinct site, with
6117 http://example.com/product being within it)
6119 * --mime-type now really works (it was --mime-map in the code)
6121 * documentation updated further
6123 Mon Nov 26 14:39:00 2001 James Aylett <tartarus@users.sourceforge.net>
6125 * options parsing fixed so minimised/unrecognised long options
6128 Mon Nov 26 14:00:13 2001 James Aylett <tartarus@users.sourceforge.net>
6130 * omindex can now index part of a site (previously 'subsite')
6131 by having an index base within the site's disc storage
6133 Mon Nov 26 13:57:10 2001 James Aylett <tartarus@users.sourceforge.net>
6135 * Documentation updated for recent changes
6137 Thu Nov 22 13:24:45 GMT 2001 Olly Betts <olly@survex.com>
6139 * Use $nice{} in query template, but don't use $freqs. Use numbers as
6140 page image button tooltips on Netscape 4.
6142 Thu Nov 22 13:02:17 GMT 2001 Olly Betts <olly@survex.com>
6144 * Herded escaped CGI parameter mangling code back into cgiparam.cc;
6145 added special handling for numeric image button names.
6147 Thu Nov 22 12:55:00 GMT 2001 Olly Betts <olly@survex.com>
6149 * Fixed $nice to put the comma (or dot) in the right place.
6151 Tue Nov 20 17:30:19 GMT 2001 Olly Betts <olly@survex.com>
6153 * $lastpage now returns 0 when there are no matches (previously
6154 gave a very large answer).
6156 Tue Nov 20 12:30:47 GMT 2001 Olly Betts <olly@survex.com>
6158 * $terms now only returns terms which were in the parsed query
6159 (boolean filter terms are excluded).
6161 Tue Nov 20 12:07:54 GMT 2001 Olly Betts <olly@survex.com>
6163 * Fixed bug in date range filtering (got it wrong when start and end
6164 date were in the same month).
6166 * DAYSMINUS now counts back from DATE1 (if specified) rather than
6167 always counting back from the present.
6169 Mon Nov 19 17:13:24 GMT 2001 Olly Betts <olly@survex.com>
6171 * Added date-range filtering (not fully tested yet).
6173 Mon Nov 19 15:21:31 GMT 2001 Olly Betts <olly@survex.com>
6175 * Fixed (c) message displayed by -v (BrightStation "PLC" not "Inc.",
6178 Fri Nov 16 11:49:20 GMT 2001 Olly Betts <olly@survex.com>
6180 * New OmegaScript commands: $allterms{<docid>}, $freq{<term>},
6181 $nice{<number>}, $set_relevant{<docid>}.
6183 * $map{} now returns a list (shouldn't affect most users - if
6184 the extra tabs are a problem, change `$map{...}' to
6185 `$list{$map{...},}' ).
6187 * Template `query' now preserves value of THRESHOLD.
6189 * Template `godmode' fixed to actually work.
6191 Wed Nov 14 15:04:13 GMT 2001 Olly Betts <olly@survex.com>
6193 * Fixed to compile with GCC3.0
6195 Wed Nov 14 14:54:53 GMT 2001 Olly Betts <olly@survex.com>
6197 * Updated for changes to OmQuery
6199 Tue Nov 06 13:10:15 GMT 2001 Olly Betts <olly@survex.com>
6201 * Updated .cvsignore.
6203 Tue Nov 06 13:02:04 GMT 2001 Olly Betts <olly@survex.com>
6205 * Fixed lookup of CGI parameter THRESHOLD.
6207 Tue Nov 6 12:38:37 GMT 2001 Richard Boulton <richard@tartarus.org>
6209 * Moved configure.ac to configure.in: depending on autoconf 2.13 is
6212 Tue Nov 06 12:23:55 GMT 2001 Olly Betts <olly@survex.com>
6214 * Added support for percentage threshold cutoff (CGI var THRESHOLD);
6215 Code for calculating better percentages has been pushed into Xapian
6216 so removed it from here.
6218 Mon Nov 5 12:42:26 GMT 2001 Richard Boulton <richard@tartarus.org>
6220 * Omega moved to new home, from om-examples/omega.
6221 Standalone build system added.