From b11eb03a4c9674f4f4dedadaa8790257e30fb1d0 Mon Sep 17 00:00:00 2001 From: Fridolin Somers Date: Thu, 16 Jul 2015 17:48:14 +0200 Subject: [PATCH] Bug 14542: Transliterate rule for all single quote forms Single quotes in common language (not in programming) are usually ', but there is also the form known as ’ in HTML. See https://fr.wikipedia.org/wiki/Apostrophe_%28typographie%29 This bug proposes to transliterate all forms into a space. Test plan : (I'll use the code ’ instead of the unicode character) - Without the patch - Create a record with title : L’avion d’argile - Index this record - Search for "L’avion d’argile" => You find the record - Search for "L'avion d'argile" => You do not find the record - Apply patch - Search for "L’avion d’argile" => You find the record - Search for "L'avion d'argile" => You find the record - Search for "L avion d argile" => You find the record Signed-off-by: Frederic Demians Signed-off-by: Jonathan Druart Signed-off-by: Tomas Cohen Arazi --- etc/zebradb/etc/words-icu.xml | 2 ++ 1 file changed, 2 insertions(+) diff --git a/etc/zebradb/etc/words-icu.xml b/etc/zebradb/etc/words-icu.xml index 101568845e..2eb3e7be5f 100644 --- a/etc/zebradb/etc/words-icu.xml +++ b/etc/zebradb/etc/words-icu.xml @@ -4,6 +4,8 @@ + + -- 2.39.5