Koha/etc/zebradb
Mathieu Saby b6118db2f5 Bug 11202: Improve UNIMARC biblio indexing
This patch makes the following changes to UNIMARC biblio indexing :
A. Changes to UNIMARC conf files
1. add comments to biblio-koha-indexdefs.xml
2. make biblio-koha-indexdefs.xml more compact by grouping some
   declarations
   Ex : 200$f and 200$g => one declaration for 200$fg
3. suppress unneeded declarations (indexing of some 4XX fields and 6XX
   fields not in unimarc format)
4. unindex some (sub)fields unneeded by most users (318, 207,230,210a,
   215, 4XXd)
5. change the way 308 field is indexed (no visible changes)
6. replace Title-host with Host-item -- see bug 11119
7. index 208 in Material-Type -- see bug 11119
8. index 100 pos 8-9 and 9-12 in pubdate:y and pubdate:n
9. index 100 pos 8-9 in pubdate:s instead of 210$d
10. Index all subfields of note 334 and 327 in note index
11. Index 304 and 327 in title index as well as note index
    327 can contain a list of titles included in a work
    304 can contain the title of the original work in case of a
    translation
12. Index 314 in author index as well as note index
    314 can contain authors not mentionned in 200$f/g (the 4th, 5th etc.
    author)
13. Index 328 note in Dissertation-information as well as note
14. Index 328$t in Title

B. Changes to ccl.properties :
1. add a new index Dissertation-information (1056)
2. fix EAN, pubdate and acqdate (they were not linked with bib1 attributes)

C. Changes to Search.pm
1. add Dissertation-information and suppress Title-host and UPC

D. Changes to QP config file queryparser.yaml
1. add Dissertation-information
2 fix EAN, pubdate and acqdate

Test plan :
If you cannot test in GRS1, test only in DOM, as GRS will be deprecated.

1. Apply the patch in a UNIMARC Koha running with DOM and ICU
2. copy src/etc/searchengine/queryparser.yaml into the main config
   directory of QP
3. copy src/etc/zebradb/ccl.properties into the main config directory
   of Zebra
4. copy src/etc/zebradb/marc_defs/unimarc/biblio/* into the main config
   directory of Zebra
5. reindex biblios (rebuild_zebra.pl -r -b -x -v)
6. test note index : make some searches on 334$b or 327$b
7. test author index : make some searches on 314 field
8. test title index : make some searches on 304 and 327 field, make a
   search on 328$t subfield
9. test dissertation-information index : make some searches on 328 field
10. In a record, put in the dates of 100 fields the values "1000" (1st
    date) and "1001" (2d date) ; try to search a book written in year
    1000, you should find the record ; idem for year 1001
11. make some searches and sort by date. It should work better as before,
    especially if you have values like "c2009" or "impr. 2010" in 210
    field
12. Regression test : make some searches on several indexes, like EAN,
    etc. It should work as before

Test 10-12 with and without Queryparser activated.
Be careful: with Queryparser activated, the index names (title,
dissertation-information...) must be entered in lowercase only.
Of course, to test search and sort by dates, you need to have full
records, with dates in 100 field as well as 210 field.

Signed-off-by: Paola Rossi <paola.rossi@cineca.it>
Signed-off-by: Jonathan Druart <jonathan.druart@biblibre.com>
Signed-off-by: Galen Charlton <gmc@esilibrary.com>
2014-02-19 21:01:15 +00:00
..
authorities/etc Bug 7475: Update configuration 2012-09-19 17:15:25 +02:00
biblios/etc Bug 9940: Add a new index for the original language of a document 2013-12-25 15:37:14 +00:00
etc Bug 9256 - Fix search for the packages 2013-03-19 19:34:12 -04:00
lang_defs Bug 10431 - Redundant mappings removed 2013-07-05 06:56:44 -07:00
marc_defs Bug 11202: Improve UNIMARC biblio indexing 2014-02-19 21:01:15 +00:00
xsl Bug 8852: DOM XSL now handles subfield substring extraction 2013-10-10 15:19:20 +00:00
ccl.properties Bug 11202: Improve UNIMARC biblio indexing 2014-02-19 21:01:15 +00:00
cql.properties
explain-authorities.xml
explain-biblios.xml
pqf.properties
retrieval-info-auth-dom.xml Bug 3087 Fix Z39.50 server to return the correct record syntax 2012-10-22 14:12:22 +02:00
retrieval-info-auth-grs1.xml Bug 3087 Fix Z39.50 server to return the correct record syntax 2012-10-22 14:12:22 +02:00
retrieval-info-bib-dom.xml Bug 3087 Fix Z39.50 server to return the correct record syntax 2012-10-22 14:12:22 +02:00
retrieval-info-bib-grs1.xml Bug 3087 Fix Z39.50 server to return the correct record syntax 2012-10-22 14:12:22 +02:00
zebra-authorities-dom.cfg
zebra-authorities.cfg
zebra-biblios-dom.cfg Bug 7818: update installer for biblio DOM indexing 2012-06-09 11:43:56 +02:00
zebra-biblios.cfg Bug 7041: Sort >1000 search results with sortmax parameter in zebra config file 2011-12-03 08:15:34 +01:00