9575a5f4fe
You've been warned :-). This patch contains a more complete mapping of UTF-8 to ASCII. The mappings are based on those compiled by Richard Mahoney on the Zebra list: http://lists.indexdata.dk/pipermail/zebralist/2007-August/001707.html Note to documentation team: we need an area in the documentation that discusses how Koha handles searches and indexing for words that contain diacritics, such as E-ACUTE (vs E without an acute). If you can paste this list of mappings from this patch directly into the docs and it preserves the encoding that would be great. NOTE: I don't think this patch addresses issues of combining vs non-combining forms, and may require a refactor to address that. Josh |
||
---|---|---|
.. | ||
default.idx | ||
explain.abs | ||
explain.att | ||
explain.tag | ||
gils.att | ||
numeric.chr | ||
passwd | ||
tagsetm.tag | ||
urx.chr | ||
usmarc.mar | ||
word-phrase-utf.chr |