Bug 17661: (follow-up) Update regex to support Unicode characters
authorNick Clemens <nick@bywatersolutions.com>
Thu, 27 Aug 2020 16:27:09 +0000 (16:27 +0000)
committerJonathan Druart <jonathan.druart@bugs.koha-community.org>
Mon, 31 Aug 2020 14:10:25 +0000 (16:10 +0200)
commit224ac84aeca3b8dba87366925d83b01e0f5c1110
tree5ef9be7c9c463912037dd273b5dbc752409c5272
parente34f95a1f5eb0fce238ab442553bec5233263a9c
Bug 17661: (follow-up) Update regex to support Unicode characters

Rather than limiting initials to [A-Z] we should test for a broad
range of uppercase letters.

The ES/Zebra changes are slightly different because of Perl vs Java regex
conventions. POerl may support either, but I found 'Uppercase' to be a bit more explicit

More info here:
https://perldoc.perl.org/perlunicode.html

TO test:
Same plan as before but use Ж. as the ending initial
Confirm the period is preserved and other punctuation removed

Signed-off-by: Katrin Fischer <katrin.fischer.83@web.de>
Signed-off-by: Jonathan Druart <jonathan.druart@bugs.koha-community.org>
C4/Search.pm
admin/searchengine/elasticsearch/index_config.yaml