Commit graph

96 commits

Author SHA1 Message Date
Galen Charlton
715aee6b95 bug 3223: add 500 and 590 to MARC21 bib Note index (Zebra)
Note: to completely apply this change, ensure that the working
copy of record.abs is updated and rebuild the bib indexes
using rebuild_zebra.pl -b -x -r

Signed-off-by: Galen Charlton <galen.charlton@liblime.com>
2009-05-14 06:59:57 -05:00
Nahuel ANGELINETTI
07ec17d02f add windows quote to ignored chars
This patch add the CP1252 "quote" (’) in ignored chars in zebra configuration.

Signed-off-by: Galen Charlton <galen.charlton@liblime.com>
2009-05-11 10:49:58 -05:00
Galen Charlton
9afcab3aef change DOS newlines to Unix newlines in word-phrase-utf.chr
Signed-off-by: Galen Charlton <galen.charlton@liblime.com>
2009-05-11 10:49:48 -05:00
Henri-Damien LAURENT
bb8e1bddb1 Adding some null words and mappings for Œ œ
(cherry picked from commit e6e27bc80a)

Signed-off-by: Galen Charlton <galen.charlton@liblime.com>
2009-05-11 10:38:54 -05:00
Nahuel ANGELINETTI
5bfee71513 (bug #3190) allow right truncation in lot of records
This patch just add an option to zebra-biblios.cfg that allow to make right truncate requests on a huge request.

Signed-off-by: Galen Charlton <galen.charlton@liblime.com>
2009-05-08 11:09:14 -05:00
Michael Hafen
a4f1283093 Add items.itemnotes to the Notes/Comments search in Marc21
This uncomments the 952$z and assigns it to the Notes index in
marc21

Signed-off-by: Galen Charlton <galen.charlton@liblime.com>
2009-04-24 22:16:14 -05:00
Henri-Damien LAURENT
9f1b24c964 Improving UNIMARC(A) compliance
Adding some fields to index.
Adding also some indexes in order to be able to query specific fields.

Signed-off-by: Galen Charlton <galen.charlton@liblime.com>
2009-04-24 21:59:13 -05:00
Henri-Damien LAURENT
40ab51d8f7 Bug Fixing : searching on mainmainentry was disabled
Adding Heading-Main as new index code in order to search only on Heading-main when $a selected.

Signed-off-by: Galen Charlton <galen.charlton@liblime.com>
2009-04-24 21:59:04 -05:00
Henri-Damien LAURENT
e7282020f7 Restoring startsby search on authorities.
[RM note: fixes bug 2792]

Signed-off-by: Galen Charlton <galen.charlton@liblime.com>
2009-04-24 20:56:58 -05:00
J. David Bavousett
d605af2d8b Changes to installed indexing.
These changes tidy up ISBN and ISSN indexing, per Michele Maenpaa.  It's being
set up manually on many new installations, and probably ought to become part of the default
Koha installation.

Signed-off-by: Galen Charlton <galen.charlton@liblime.com>
2009-04-03 19:18:39 -05:00
Joe Atzberger
13665732e8 Bug 2958 - notforloan (8008) is a numeric field.
Update index to allow searching for "On Order" items, -1 value.
Example:
http://atz.dev.kohalibrary.com/cgi-bin/koha/opac-search.pl?q=ccl=notforloan:-1

Without this patch, search for "-1" is normalized to "1".

Signed-off-by: Galen Charlton <galen.charlton@liblime.com>
2009-03-27 18:33:35 -05:00
Henri-Damien LAURENT
66c3b26579 Bug Fix : weight field was broken in UNIMARC
Title-cover was not defined in record.abs
So the relevance ranking was broken.
This patch corrects that
For UNIMARC people, please reindex

Signed-off-by: Galen Charlton <galen.charlton@liblime.com>
2009-03-27 18:33:31 -05:00
Ryan Higgins
aaa6a263b8 Add mc-loc definition for ccl.properties.
Advances search limit by shelving location doesn't work due to
missing ccl definition in default installation.  Once updated,
the zebradb will need to be reindexed.

Signed-off-by: Galen Charlton <galen.charlton@liblime.com>
2009-03-06 12:53:21 -06:00
Galen Charlton
be82512f01 bug 2986: tweak authtype index for MARC21 (GRS-1)
Following a similar patch for UNIMARC, tweak the
authtype index for MARC21 authorities if the GRS-1
Zebra filter is in use.

Note that it is recommended that *DOM* mode indexing
be used for MARC21 authorities; if you're using DOM mode,
it is not necessary to rebuild the index.  However, if
you're using the GRS-1 definitions (record.abs), it will
be necessary to reindex the authority records using

misc/migration_tools/rebuild_zebra.pl -a -r

Signed-off-by: Galen Charlton <galen.charlton@liblime.com>
2009-03-04 08:19:34 -06:00
259c77ed9b Fix bug #2986 - Authority search
On authorities-home.pl page, when you do search, you don't have any
result. Looking in log file, you see a Zebra error:

  Unsupported Use attribute (114) authtype Bib-1

This patch modify record.abs UNIMARC definition.
The same may have to be done for MARC21 record.abs.

Signed-off-by: Galen Charlton <galen.charlton@liblime.com>
2009-03-04 08:07:32 -06:00
Henri-Damien LAURENT
9e63271c8a Bug 2790 no UNIMARC search on avail items [1/2]
Changing record.abs file to add the management of acquisition date,
modification date and lost

Signed-off-by: Galen Charlton <galen.charlton@liblime.com>
2008-11-19 15:45:42 -06:00
Paul POULAIN
d8f115213d dealing with L' and D' in french
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-08-04 08:12:53 -05:00
Joshua Ferraro
9575a5f4fe UTF-8 to ASCII MAPPINGS -- WARNING: REINDEX REQUIRED
You've been warned :-). This patch contains a more
complete mapping of UTF-8 to ASCII. The mappings are
based on those compiled by Richard Mahoney on the
Zebra list: http://lists.indexdata.dk/pipermail/zebralist/2007-August/001707.html

Note to documentation team: we need an area in the
documentation that discusses how Koha handles searches
and indexing for words that contain diacritics, such
as E-ACUTE (vs E without an acute). If you can paste
this list of mappings from this patch directly into
the docs and it preserves the encoding that would be
great.

NOTE: I don't think this patch addresses issues of
combining vs non-combining forms, and may require
a refactor to address that.

Josh
2008-07-23 09:49:57 -05:00
Joshua Ferraro
085f329d7a Fix for bug 2342: Searching on more than 1 itemtype breaks
The problem was that the 'mc-' was removed from the checkboxes a while back and
that's what triggers the automatic application of OR boolean searching. I've
added it back to the templates and modified the ccl.properties file to include
mapping for itype,itemtype and ccode
2008-07-12 09:09:14 -05:00
Paul POULAIN
90af25237a fix to deal with 0088 & 0089 chars during indexing
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-07-08 14:52:50 -05:00
Joshua Ferraro
89e12081c1 Fix for bug 2260: UNIMARC misconfiguratin on itemtype
WARNING: THIS PATCH REQUIRES A REINDEX FOR UNIMARC RECORDS
2008-06-19 08:19:37 -05:00
Paul POULAIN
f01eb0cd7a (UNIMARC) index only 101$a as language
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-06-12 12:09:21 -05:00
Paul POULAIN
6555a9e9f3 few changes in UNIMARC record.abs
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-06-05 22:46:12 -05:00
Henri-Damien LAURENT
7e2e894bb3 removing autotruncate for authors.
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-05-30 13:52:13 -05:00
Paul POULAIN
94a3a247f1 small fix for UNIMARC 600$a that is also subject
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-05-27 07:37:19 -05:00
Joshua Ferraro
0808f56f84 fixing anonymous permissions access to zebra via Z39.50
and SRU
2008-04-24 13:38:21 -05:00
Henri-Damien LAURENT
6b47d3425d Minor indexing bug : Title series indexed as Title
410$t is now indexed as Title-series UNIMARC

Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-04-24 11:51:49 -05:00
Galen Charlton
49a33fc4bc zebra conf: add reference to password file
Note: currently only zebraqueue_daemon.pl is known
to use the extended services that require the
Zebra r/w password.

Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-04-21 18:17:30 -05:00
Paul POULAIN
d7637da3af removing some spaces in record.abs, that make the file invalid for Zebra
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-04-14 23:46:58 -05:00
Paul POULAIN
9056f9c84f adding Ç as equivale nt to c for frenchies
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-04-07 15:56:28 -05:00
paul
18d07f6d2b some fixes to unimarc authorities ordering
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-04-07 15:56:27 -05:00
Joshua Ferraro
f85bb6d49a adding title indexes for 780
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-03-20 08:41:08 -05:00
Joshua Ferraro
dc4a9b09eb fixing where the 505 subfields are indexed (before the field) in record.abs
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-03-18 10:39:58 -05:00
Joshua Ferraro
1b558dfe64 fix for bug 1925: 505 Field Indexing
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-03-17 19:27:16 -05:00
Galen Charlton
c767ec12b3 Zebra: added missing urx.chr
The following error messages in the Zebra
log should no longer appear:

06:10:25-04/03 zebrasrv(1) [warn] Failed to read character table urx.chr
06:10:25-04/03 zebrasrv(1) [warn] urx.chr [No such file or directory]

To fully install this patch, do a
'make update_zebra_conf'.

Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz>
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-03-05 08:13:54 -06:00
Galen Charlton
2247eee155 DOM authority indexing fixes (partial fix for 1906)
To fully install this patch, the following steps
are required:

1. perl Makefile.PL
2. make
3. make update_zebra_conf
4. restart zebrasvr
5. reindex authorities using rebuild_zebra.pl -a -r

Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz>
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-03-04 04:47:53 -06:00
Galen Charlton
ebeb3989ad DOM authority indexing: fix batch and online indexing
To fully install this patch, the following steps are
necessary:

1. perl Makefile.PL
2. make
3. make update_zebra_conf (or make upgrade)

Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz>
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-03-04 04:47:50 -06:00
Galen Charlton
741c10d911 authorities -- added CCL indexes for heading matching
Signed-off-by: Chris Cormack <crc@liblime.com>
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-02-08 05:48:56 -06:00
Galen Charlton
6a26bcf517 authorities indexing: qualify indexes with ":w"
Signed-off-by: Chris Cormack <crc@liblime.com>
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-02-08 05:48:54 -06:00
Galen Charlton
32cf2af700 authorities indexing - MAJOR changes
For DOM indexing, added index_matching_heading option
to create indexes for matching an entire authority
heading -- the index works by indexing a heading
such

150 $aCars$xElectric$zEngland$vScience fiction

as something like

"cars generalsubdiv electric geographicsubdiv england
formsubdiv science fiction"

Also started adjust names of some indexes to conform
to languaged used in the MARC21 and UNIMARC standards, e.g.,

"See" => "See-from"
"See-also" => "See-also-from"
"Conference-name-heading" => "Meeting-name-heading"

Signed-off-by: Chris Cormack <crc@liblime.com>
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-02-08 05:48:52 -06:00
Galen Charlton
1c0401e867 authorities - enabled DOM indexing
All new authority features will be based on the DOM indexing.

To update an existing installation, do the following:

[1] run perl Makefile.PL
[2] make
[3] make update_zebra_conf
[4] copy the new koha-conf.xml to $KOHA_CONF

Signed-off-by: Chris Cormack <crc@liblime.com>
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-02-08 05:48:51 -06:00
Galen Charlton
f9f246cb1e authorities: changed extension of authority-zebra-indexdefs.xml
Signed-off-by: Chris Cormack <crc@liblime.com>
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-02-03 07:22:12 -06:00
Galen Charlton
cf8c3a84ca authorities: start of work on reindexing
Currently, MARC authorities are indexed (assuming Zebra
is used) with Zebra's GRS-1 module.  However, it does
not appear to be possible to index phrases that cross
subfield boundaries using the GRS-1 module's records.abs
config file's melm, elm, and xelm directives.

Since it is necessary to be able to efficiently search
an entire authority heading (e.g., to see if a given
bib heading is authorized), I'm proposing a switch
to Zebra's DOM XML filter module, which uses XSLT
to generate the words and phrases to be indexed from the
original MARC XML (or ISO2709) record.

The file authority-zebra-indexdefs.xml is an XSLT stylesheet
to implement the new indexing regime.  It is based on the
MARC21 authority record.abs with the following changes:

  * addition of 148/448/548
  * changed name of "see" indexes to "see-from"
  * changed name of "see-also" indexes to "see-also-from"
  * added index on the subject thesaurus based on
    the 008/11 and 040$f
  * added indexes on the full heading

authority-zebra-indexdefs.xml was generated from
authority-koha-indexdefs.xml via the XSL transform
koha-indexdefs-to-zebra.xsl.  authority-koha-indexdefs.xml
is the actual master version of the indexing definitions,
and was created to provide a much more compact syntax
over the raw XSLT that is to be passed to Zebra.

An experimental schema for Koha indexing definitions is
under way; my aim is to propose a simple format that can
be readily worked with, and perhaps even generated as
a serialization of indexing definitions that are set up
via administration settings in the Koha database itself.

Signed-off-by: Chris Cormack <crc@liblime.com>
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-02-03 07:22:06 -06:00
Galen Charlton
8340c478fa start of big MARC21 authorities work
* Defined local field 942$a to store the authority type
  for MARC21 instead of 152$b
* Added 942$b to MARC21 authority framework.
* Added auth_header.authid and auth_header.authtypecode
  to appropriate subfields in MARC21 authority framework.
* Started work on two new modules:
    C4::AuthoritiesMarc::MARC21
    C4::AuthoritiesMarc::UNIMARC
  These modules will be used to extract MARC-format-specific
  behavior out of C4::AuthoritiesMarc
* Updated Zebra config for MARC21 to use only the 942$a
  for the authority type.
* For MARC21, added logic to move 152$b to 942$a for
  existing authority records.  Specifically, AddAuthority
  now does this move when a record is saved, while
  GetAuthority and GetAuthorityXML do this when
  extracting a record for other use.  This logic
  is temporary, and can hopefully be removed later, once
  use of 152$b in MARC21 authorities is confirmed to be
  absent for Koha users.  I will also create a batch
  job to do this update in one fell swoop.

Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-01-04 18:42:40 -06:00
Joshua Ferraro
bbd043f155 adding three new variables for installation:
'ZEBRA_SRU_HOST'    => 'localhost',
  'ZEBRA_SRU_BIBLIOS_PORT'    => '9998',
  'ZEBRA_SRU_AUTHORITIES_PORT'    => '9999',

Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-01-03 21:28:27 -06:00
Joshua Ferraro
c6c82fb2a5 Fix Genre-form and Subject-topical for MARC21
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-01-03 08:28:04 -06:00
Joshua Ferraro
6ba5ddd76e fixing a couple mappings for SRU CQL server
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-01-03 03:01:14 -06:00
Joshua Ferraro
6d924e69ab s/__DB_HOST__/__WEBSERVER_HOST__/
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-01-03 02:10:13 -06:00
Paul POULAIN
5dc5967801 synch'ing marc21 and unimarc where applicable
Signed-off-by: Chris Cormack <crc@liblime.com>
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-01-03 00:55:11 -06:00
Joshua Ferraro
aabea3417b WARNING: Updates to Index Defs for Authorities
This commit is a partial fix to authority control for MARC21,
and better authority handling in general (for UNIMARC too).
Before this patch, authority searching, editing, saving, was
not functioning, or was extremely buggy.

WARNING: You will need to re-index your authority database after
applying this commit.

The following changes have been made:

  * Normalizing record.abs index names (in both MARC21 and UNIMARC)
  * Synching authorities/bib1.att, ccl.properties, AuthoritiesMarc.pm
    with new indexes (UNIMARC too)
  * Clean up biblios/bib1.att (remove duplicate att defs)
  * Clean up authorities-* templates to conform to new styles
  * Fixed search failure when using Default framework (now searches
    All)

Also included are several fixes to the built-in SRU server for
Authority and Biblio, it's recommended that you update your
koha-conf.xml file:

  * adding explain-authorities.xml and explain-biblios.xml
  * adding necessary info to koha-conf.xml to enable SRU/W
  * adding several example XSLT stylesheets, that can be used
    for SRU on-the-fly transformations (to MODS, DC, RDF, etc.)

Still remaining for 3.0 are the following tasks:

  * update MARC21 frameworks (authority and cross-reference bib)
  * update display code/templates in authority results list
  * update search code/templates to utilize index points
  * implement 'grouping' of authtypes for searching (Name, Title, Subject)
  * repair utility to import auths and perform matching
  * repair bibliographic import to match auths and warn if no match

Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-01-03 00:28:40 -06:00