Bug 17661: (follow-up) Update regex to support Unicode characters
authorNick Clemens <nick@bywatersolutions.com>
Thu, 27 Aug 2020 16:27:09 +0000 (16:27 +0000)
committerAleisha Amohia <aleishaamohia@hotmail.com>
Wed, 9 Sep 2020 10:04:23 +0000 (22:04 +1200)
commitfdf5fd0fc20c847ae059580c5fcb9cf3cee61056
tree190294e77d448a2a1909b724eaf7825a83ca2059
parent4506ca63e39a682af6c57c9c844d36ec7dcef0f4
Bug 17661: (follow-up) Update regex to support Unicode characters

Rather than limiting initials to [A-Z] we should test for a broad
range of uppercase letters.

The ES/Zebra changes are slightly different because of Perl vs Java regex
conventions. POerl may support either, but I found 'Uppercase' to be a bit more explicit

More info here:
https://perldoc.perl.org/perlunicode.html

TO test:
Same plan as before but use Ж. as the ending initial
Confirm the period is preserved and other punctuation removed

Signed-off-by: Katrin Fischer <katrin.fischer.83@web.de>
Signed-off-by: Jonathan Druart <jonathan.druart@bugs.koha-community.org>
(cherry picked from commit 224ac84aeca3b8dba87366925d83b01e0f5c1110)

Signed-off-by: Lucas Gass <lucas@bywatersolutions.com>
(cherry picked from commit f297fb819c38553899f2cde10b108ca8fa712dce)

Signed-off-by: Aleisha Amohia <aleishaamohia@hotmail.com>
C4/Search.pm
admin/searchengine/elasticsearch/index_config.yaml