Koha-community/Koha - Koha: The world's first free and open source library system

Author	SHA1	Message	Date
Jonathan Druart	7e70202d34	Bug 15381: Remove GetAuthType and GetAuthTypeCode Test this patch with the previous one. Signed-off-by: Frédéric Demians <f.demians@tamil.fr> Signed-off-by: Martin Renvoize <martin.renvoize@ptfs-europe.com> Signed-off-by: Brendan Gallagher brendan@bywatersolutions.com	2015-12-31 18:59:02 +00:00
Julian Maurice	48df0b8a2d	Bug 15325: Fix --table option of rebuild_zebra.pl Option's value given on command line was never used and 'biblioitems' was used instead. Test plan: 1. git checkout master 2. perl misc/migration_tools/rebuild_zebra.pl -b -t items --where "price = 42" 3. You should see errors printed on screen about an unknown column 4. Apply patch 5. perl misc/migration_tools/rebuild_zebra.pl -b -t items --where "price = 42" 6. No errors \o/ Signed-off-by: Frédéric Demians <f.demians@tamil.fr> Signed-off-by: Jonathan Druart <jonathan.druart@bugs.koha-community.org> Signed-off-by: Kyle M Hall <kyle@bywatersolutions.com>	2015-12-11 16:15:50 +00:00
Jonathan Druart	e8055c7ef6	Bug 12368: Die if the --table value is not allowed. If the table given in parameter is not in the white list, the script should die rathen than correct to a default value. Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Signed-off-by: Kyle M Hall <kyle@bywatersolutions.com> Signed-off-by: Tomas Cohen Arazi <tomascohen@theke.io>	2015-10-09 14:25:58 -03:00
Jonathan Druart	2d9c221abc	Bug 12368: Rebuild Zebra improvement: allow to specify a DB table Currently the --where parameter only allow to specify a condition on fields in the biblioitems table. For some needs it would be great to specify a condition on the field in the items table. The use case is the following: you want to reindex biblios with items modified since a specific timestamp. Test plan: 1/ Pick an item randomly in your catalogue 2/ Edit it and save 3/ Note that the items.timestamp has been set to today but not the biblioitems.timestamp 4/ launch rebuild_zebra without the new parameter perl misc/migration_tools/rebuild_zebra.pl -b -v --where "timestamp >= XXX" where XXX is the today date (e.g. "2014-06-05 00:00:00"). Note that the biblio has not been indexed. 5/ launch rebuild_zebra using the new parameter: perl misc/migration_tools/rebuild_zebra.pl -b -v -t items --where "timestamp >= XXX" Note the biblio has been indexed. Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Signed-off-by: Kyle M Hall <kyle@bywatersolutions.com> Signed-off-by: Tomas Cohen Arazi <tomascohen@theke.io>	2015-10-09 14:25:58 -03:00
Kyle M Hall	01abfbd702	Bug 11368: [QA Followup 2] Signed-off-by: Jonathan Druart <jonathan.druart@bugs.koha-community.org> Signed-off-by: Tomas Cohen Arazi <tomascohen@unc.edu.ar>	2015-09-18 12:40:52 -03:00
Kyle M Hall	165da52e78	Bug 11368: [QA Followup] * Fix QA failures * Fix copyright * Add file format documentation * Add -c --confirm option * Add -t --test option * Add -h --help option Signed-off-by: Tomas Cohen Arazi <tomascohen@unc.edu.ar>	2015-09-18 12:40:36 -03:00
Kyle M Hall	2eeb2de909	Bug 11368: Add script to import Lexile scores Koha needs a script to automate the importing of Lexile score data for titles that have available scores but are not currently in the title's record. This script will take a CSV file of Lexile scores, and locate any matching records in the Koha database ( by ISBN ). If the record already has a score, it will be updated. If not, the Lexile score field will be created. Test Plan: 1) Apply this patch 2) Catalog a record for each of the following ISBNs: 0789170191 9780673779410 3) Download the file LexileTitlesTruncated.txt attached to this bug report 4) Run the script from the command line: ./misc/migraction_tools/import_lexile.pl -v --file /path/to/LexileTitlesTruncated.txt 5) View those records in Koha 6) Note those records now have valid Lexile scores 7) Edit the Lexile score ( 521$a ) and change the value to something else 8) Repeat step 4 9) Note the original Lexile score has been restored Signed-off-by: Mirko Tietgen <mirko@abunchofthings.net> Signed-off-by: Tomas Cohen Arazi <tomascohen@unc.edu.ar>	2015-09-18 12:39:23 -03:00
Stefan Weil	63c1589685	Bug 14383: misc: Fix some typos in comments and documentation Most of them were found and fixed using codespell. Signed-off-by: Stefan Weil <sw@weilnetz.de> Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz> Signed-off-by: Bernardo Gonzalez Kriegel <bgkriegel@gmail.com> Signed-off-by: Jonathan Druart <jonathan.druart@koha-community.org> Signed-off-by: Tomas Cohen Arazi <tomascohen@theke.io>	2015-06-22 17:34:45 -03:00
Jonathan Druart	a6c9bd0eb5	Bug 9978: Replace license header with the correct license (GPLv3+) Signed-off-by: Chris Nighswonger <cnighswonger@foundations.edu> Signed-off-by: Tomas Cohen Arazi <tomascohen@gmail.com> Signed-off-by: Katrin Fischer <katrin.fischer@bsz-bw.de> http://bugs.koha-community.org/show_bug.cgi?id=9987 Signed-off-by: Kyle M Hall <kyle@bywatersolutions.com> Signed-off-by: Tomas Cohen Arazi <tomascohen@gmail.com>	2015-04-20 09:59:38 -03:00
Thomas	482f2f31a8	Bug 13531 - Follow up Add logging of errors. Signed-off-by: Magnus Enger <magnus@enger.priv.no> More errors are indeed showing up in the log. (I took the liberty of changing the commit message a little bit.) Signed-off-by: Kyle M Hall <kyle@bywatersolutions.com> Signed-off-by: Tomas Cohen Arazi <tomascohen@gmail.com>	2015-03-31 10:56:26 -03:00
Thomas	002b79c200	Bug 13531: QA follow up A minor QA comment. ::: misc/migration_tools/bulkmarcimport.pl @@ +271,5 @@ > my ( $error, $results, $totalhits ) = C4::Search::SimpleSearch( $query, 0, 3, [$server] ); > + # changed to warn so able to continue with one broken record > + if ( defined $error ) { > + warn "unable to search the database for duplicates : $error"; > + next; For consistency with the rest of the script, should this perhaps be: next RECORD; Signed-off-by: Martin Renvoize <martin.renvoize@ptfs-europe.com> Signed-off-by: Kyle M Hall <kyle@bywatersolutions.com> Signed-off-by: Tomas Cohen Arazi <tomascohen@gmail.com>	2015-03-31 10:56:14 -03:00
Thomas	e1cdb4ebfa	Bug 13531 - bulkmarcimport bombs if no match is found Changed the die statment to a warn allowing the import to continue. Signed-off-by: Martin Renvoize <martin.renvoize@ptfs-europe.com> Signed-off-by: Kyle M Hall <kyle@bywatersolutions.com> Signed-off-by: Tomas Cohen Arazi <tomascohen@gmail.com>	2015-03-31 10:56:08 -03:00
Morag Hills	f831176787	Bug 13530: Typo in bulkmarcimport GetFrameworkCode was incorrectly spelt as GetFrameworkcode on line 401. Signed-off-by: Katrin Fischer <katrin.fischer.83@web.de> Signed-off-by: Jonathan Druart <jonathan.druart@biblibre.com> Signed-off-by: Tomas Cohen Arazi <tomascohen@gmail.com>	2015-01-14 10:28:30 -03:00
Tomas Cohen Arazi	0a53d5e6b6	Bug 12651: DOM indexing is the default On the 23 July development meeting it was decided to formally deprecate GRS-1 indexing mode for Zebra. This patch makes code fallback to DOM on the remaining places. No behaviour change should be noticed, as DOM has been the default for a while. Regards Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Signed-off-by: Katrin Fischer <Katrin.Fischer.83@web.de> Passes tests and QA script. Also checked running Makefile.PL Signed-off-by: Tomas Cohen Arazi <tomascohen@gmail.com>	2014-10-27 12:35:44 -03:00
Jonathan Druart	cf2eb49448	Bug 12538: Remove Solr without breaking anything else Since nobody is currently working on the zebra layer introduced by bug 8233, Solr won't never work. Some code has been introduced in 3.10 to prove several search engines can cohabit into Koha but no help/fund has been found to go ahead. It is useless to keep this code and to maintain an ambiguous situation. I think the indexes configuration page could be restore later if someone else introduces a new search engine into Koha. Test plan: Look at the code introduced by bug 8233 and verify all is removed. Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz> Signed-off-by: Kyle M Hall <kyle@bywatersolutions.com> Signed-off-by: Tomas Cohen Arazi <tomascohen@gmail.com>	2014-10-11 16:59:04 -03:00
Tomas Cohen Arazi	a6c278f8e0	Bug 12720: (QA followup) use API instead of plain SQL Signed-off-by: Tomas Cohen Arazi <tomascohen@gmail.com>	2014-08-24 13:19:01 -03:00
David Cook	11446fcb0e	Bug 12720 - Turn off Authority logging when running "bulkmarcimport.pl" This patch turns off the AuthoritiesLogging syspref when running the bulkmarcimport.pl script. It also temporarily disables the syspref caching which will have been making the CataloguingLogging handling ineffectual. (That is, updating the CataloguingLogging syspref in the script wouldn't have an effect as the original cached value would be used anyway.) _TEST PLAN_ 0) Turn on "AuthoritiesLogging" syspref 1) Load an authority record using bulkmarcimport.pl 2) Note a new Authorities entry in action_logs 3) Apply the patch 4) Repeat Step 1 5) Note that no new entry is made in action_logs (Bonus points: Do the same thing with CataloguingLogging and a bibliographic record.) Signed-off-by: Bernardo Gonzalez Kriegel <bgkriegel@gmail.com> Tested with biblio and auth imports. Work as described, no koha-qa errors. Note: If you begin to load a big file and get impatient and hit ^C, seems that current syspref value is lost... Signed-off-by: Katrin Fischer <Katrin.Fischer.83@web.de> Passes tests and QA script. Patch copies what was already done for the CatalougingLog, no problems found. Signed-off-by: Tomas Cohen Arazi <tomascohen@gmail.com>	2014-08-24 12:49:49 -03:00
Jonathan Druart	b2ba10b40b	Bug 11278: (follow-up) Return an exit value (1) if the module is not found. Signed-off-by: Jonathan Druart <jonathan.druart@biblibre.com> Signed-off-by: Martin Renvoize <martin.renvoize@ptfs-europe.com> Signed-off-by: Galen Charlton <gmc@esilibrary.com>	2014-05-05 00:59:33 +00:00
Marcel de Rooy	9f45310d76	Bug 11278: Followup for customize command line parameter The initial patch for this bug did not include a specific command line option for customization. If a module LocalChanges.pm existed, it would be used without asking. This patch adds a command line option enabling the customization option and offering the extra possibility of using another module name. If no file name is passed, we default to LocalChanges. Without the -custom option, behavior is as it was. Also some POD lines are added to document the feature. Test plan: [1] Make a LocalChanges.pm in migration_tools. Verify that it is not used, if you do not enable the -cust parameter. [2] Run the script again with -cust. Verify that it is called now. [3] Copy LocalChanges.pm to Whatever.pm. Make some change. Run with -cust Whatever and verify that the new module is used. [4] Copy Whatever.pm to another dir, make some change. Run with -cust and the full name. Verify that the latest change was used. [5] Run without any option. Check the pod documentation. Signed-off-by: Jonathan Druart <jonathan.druart@biblibre.com> Signed-off-by: Martin Renvoize <martin.renvoize@ptfs-europe.com> Signed-off-by: Galen Charlton <gmc@esilibrary.com>	2014-05-05 00:59:32 +00:00
Marcel de Rooy	8480570197	Bug 11278: Adjusting bulkmarcimport.pl for customization routine and verbose printing This patch makes two adjustments: [1] For the verbose option, verbose level 2 now means print the formatted version of each record. [2] If a module LocalChanges.pm is found in misc/migration_tools, the routine "customize" in this module is called for each marc record. This allows you to make local changes to these marc records before importing them. Test plan: [1] Test the verbose option: a single -v for medium verbosity and two -v to dump a human-readable version of the record to standard output. (Do not yet copy LocalChanges.pm in the folder.) You may used the attached example file on Bugzilla: perl misc/migration_tools/bulkmarcimport.pl -file zztest01.xml -v -v -b -m XML -t \| more Note the option t for test; no records will be imported. [2] Copy LocalChanges.pm in the migration_tools folder. You may use the example provided on Bugzilla (in a patch). If you use the example module, check the contents of 001, 005 and 590 fields. (The -v -v option allows you to easily check that.) Signed-off-by: Jonathan Druart <jonathan.druart@biblibre.com> Signed-off-by: Martin Renvoize <martin.renvoize@ptfs-europe.com> Signed-off-by: Galen Charlton <gmc@esilibrary.com>	2014-05-05 00:58:54 +00:00
Galen Charlton	f4633cc5e5	Bug 11441: (follow-up) improve utility help text This patch expands and reformats the help text displayed when running remove_unused_authorities.pl -h. Signed-off-by: Galen Charlton <gmc@esilibrary.com>	2014-04-11 15:25:41 +00:00
Juan Romay Sieira	0f6652d62b	Bug 11441: enhance remove_unused_authorities.pl ability to select records remove_unused_authorities.pl previously required that --aut be supplied to specify one or more authority types to check for unlinked authority records. If --aut was omitted, it would default to search for records of authority type NC, which is not present in many (or any?) Koha databases. Now, if --aut is omitted, unlinked authority records of any type are removed. To test it: Parse only PERSO_NAME authorities: misc/migration_tools/remove_unused_authorities.pl -aut PERSO_NAME Parse all authorities: misc/migration_tools/remove_unused_authorities.pl Signed-off-by: Nicolas Legrand <nicolas.legrand@bulac.fr> Signed-off-by: Kyle M Hall <kyle@bywatersolutions.com> Signed-off-by: Galen Charlton <gmc@esilibrary.com>	2014-04-11 15:17:28 +00:00
Matthias Meusburger	5935654bac	Bug 11850: Add -append option to bulkmarcimport.pl to append to logfile Signed-off-by: Magnus Enger <digitalutvikling@gmail.com> Keeps current behaviour as default. The -append option is described in the POD and works as expected. Signed-off-by: Katrin Fischer <Katrin.Fischer.83@web.de> Works as described. Adding a date/time to the output might be good, to make it easier to find the entry you were looking for. Signed-off-by: Galen Charlton <gmc@esilibrary.com>	2014-04-07 15:32:01 +00:00
Galen Charlton	03338b70e4	Bug 10955: (follow-up) improve usage information This patch improves rebuild_zebra.pl's usage help by explaining when --skip-deletes should be considered and noting that it should be used in conjunction with a cronjob to process deletions after hours. Signed-off-by: Galen Charlton <gmc@esilibrary.com>	2014-03-10 18:46:28 +00:00
Kyle M Hall	b0870311e1	Bug 10955 - Add ability to skip deletions in zebraqueue It seems that record deletions can cause extreme slowdowns for Koha installations with extremely large numbers of records. It would be helpful to be able to skip record deletions when processing the zebraqueue with rebuild_zebra.pl so the deletions can be processed with a lower frequency. Test Plan: 1) Disable any zebra indexing cronjobs you may have 2) Delete a record 3) Note the operation recordDelete in the zebraqueue table having done = 0 4) Run misc/migration_tools/rebuild_zebra.pl -b -z --skip-deletes 5) Note the delete still has done = 0 6) Run misc/migration_tools/rebuild_zebra.pl -b -z 7) Note the delete now has done = 1 Signed-off-by: Jonathan Druart <jonathan.druart@biblibre.com> Signed-off-by: Katrin Fischer <Katrin.Fischer.83@web.de> Passes all tests and QA script. Also tested for authorities, no problems found. Signed-off-by: Galen Charlton <gmc@esilibrary.com> RM note: this is at best a work-around, and I will emphasize that --skip-deletes should be used only when absolutely necessary. I hope that --skip-deletes can go away at some point soon, but that may depend on changes to Zebra.	2014-03-10 18:44:10 +00:00
Galen Charlton	160c44d4e9	Bug 11078: (follow-up) tidy code - fix a couple typos in comments - make replace a "$i" with a more descriptive variable name - style some of the new code Signed-off-by: Galen Charlton <gmc@esilibrary.com>	2014-02-28 22:24:28 +00:00
Marcel de Rooy	07de37f0e5	Bug 11078: QA Follow-up for missing file permissions on lockfile The original patch creates a lockfile in the ZEBRA_LOCKDIR. It can fall back to /var/lock or even /tmp. If the create fails, it dies. This can be considered as very exceptional. This followup adjusts the fallback location in /var/lock or /tmp slightly. It appends the database name to the folder in order to prevent interfering between multiple Koha instances. Creation of the lockfile has been moved to a subroutine extending directory and file creation testing. In the very unlikely case that we cannot create the lockfile (after three separate tries), this follow-up allows you to continue instead of die. This is just as we did before we had file locking here. Every time skipping a reindex could cause more harm than continuing and having the race condition once in a while. Test plan: Test adding and removing lockdir from your koha-conf.xml. Check fallback. Note that fallback in /var/lock or /tmp must contain database name. Remove the lockdir config line and remove permissions from fallback. In this case the reindex should continue but with a warning. Signed-off-by: Marcel de Rooy <m.de.rooy@rijksmuseum.nl> Tested with daemon and one-off invocation simultaneously. Tested new wait parameter. Tried all variations of lock directory (changing permissions etc.) Signed-off-by: Galen Charlton <gmc@esilibrary.com>	2014-02-28 22:22:47 +00:00
Doug Kingston	88e7faf860	Bug 11078: Add locking to rebuild_zebra This patch adds locking to rebuild_zebra.pl to ensure that simultaneous changes are prevented (as one is likely to overwrite the other). Incremental updates in daemon mode will skipped if the lock is busy and they will be picked up on the next pass. Non-daemon mode invocations will also exit immediately if they cannot get the lock unless the new flag -wait-for-lock is specified, in which case they will wait until the get the lock and then proceed. Supporting changes made to Makefile.PL and templates for the new locking directory (paralleling the other zebra lock directories). We stash the zebra_lockdir in koha-conf.xml so rebuild_zebra.pl can find it. To address earlier QA concerns we: 1. added code to check if flock is available and ignore locking if it's missing (from M. de Rooy) 2. changed default for adhoc invocations to abort if they cannot obtain the lock. Added option -wait-for-lock if the user prefers to wait until the lock is free, and then continue processing. 3. added missing entry to t/db_dependent/zebra_config.pl 4. added a fallback locking directory of /tmp Signed-off-by: Marcel de Rooy <m.de.rooy@rijksmuseum.nl> Doug merged the original patch with the QA changes. Just for the record, noting here that the original patch was tested extensively too by Martin Renvoize. I have added a followup for some exceptional cases. Signed-off-by: Galen Charlton <gmc@esilibrary.com>	2014-02-28 22:21:41 +00:00
Tomas Cohen Arazi	daf2ebc4f5	Bug 11096: support the retrieval of large MARCXML records This patch makes Koha <-> Zebra use MARCXML for the serialization when using DOM, and USMARC for GRS-1. * The following functions are modified to set the Zebra record syntax according to the current sysprefs and configuration: - C4::Context->Zconn - C4::Context-_new_Zconn * A new function 'new_record_from_zebra' is introduced, which checks the context we are in, and creates the MARC::Record object using the right constructor. The following packages get touched to make use of the new function: - C4::Search - C4::AuthoritiesMarc and the same happens to the UI scripts that make use of them (both in the OPAC and STAFF interfaces). * Calls to the unsafe ZOOM::Record->render()[1] method are removed. Due to this last change the code for building facets was rewritten. And for performance on the facets creation I pushed higher version dependencies for MARC::File::XML and MARC::Record (we rely on MARC::Field->as_string). * Calls to MARC::Record->new_from_xml and MARC::Record->new_from_usmarc are wrapped with eval for catching problems [2]. * As of bug 3087, UNIMARC uses the 'unimarc' record syntax. this case is correctly handled. * As of bug 7818 misc/migration_tools/rebuild_zebra.pl behaves like: - bib_index_mode (defaults to 'grs1' if not specified) - auth_index_mode (defaults to 'dom') here we do exactly the same. To test: - prove t/db_dependent/Search.t should pass. - Searching should remain functional. - Indexing and searching for a big record should work (that's what the unit tests do). - Test an index scan search (on the staff interface): Search > More options > Check "Scan indexes". - Enable 'itemBarcodeFallbackSearch' and try to circulate any word, it shouldn't break. - Searching for a biblio in a new subscription shouldn't break. - Running bulkmarcimport.pl shouldn't break. - And so on... for the rest of the .pl files. [1] http://search.cpan.org/~mirk/Net-Z3950-ZOOM/lib/ZOOM.pod#render() [2] a record that cannot be parsed by MARC::Record is simply skipped (bug 10684) Sponsored-by: Universidad Nacional de Cordoba Signed-off-by: Kyle M Hall <kyle@bywatersolutions.com> Signed-off-by: Martin Renvoize <martin.renvoize@ptfs-europe.com> Signed-off-by: Galen Charlton <gmc@esilibrary.com>	2014-02-28 19:50:09 +00:00
Matthias Meusburger	28d97e3228	Bug 11412: fix potential bulkmarcimport crash when searching for duplicates in authorities bulkmarcimport.pl can crash when searching for duplicates if the 005 field from the incoming or local record is not defined. This patch fixes it. Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz> Test plan 1/ Create a record with no 005 field 2/ Try to import it checking for duplicates, notice it crashes 3/ Try with a record with a 005 field, but the one in Koha missing one, still crashes 4/ Apply patch 5/ No more crash Signed-off-by: Katrin Fischer <Katrin.Fischer.83@web.de> Passes all tests and QA script. Patch fixes the problem described for importing authorities with the bulkmarcimport.pl when trying to match with existing records. Signed-off-by: Galen Charlton <gmc@esilibrary.com>	2013-12-26 15:52:02 +00:00
Galen Charlton	b26870e53d	Bug 11252: remove deprecated -munge-config switch from rebuild_zebra.pl The -munge-config switch has been deprecated for years, and trying to use it would either not work at all or, if it did "work", almost certainly damage one's Zebra configuration for Koha. This patch removes this switch. To test: [1] Run rebuild_zebra.pl and verify that no mention is made of -munge-config. [2] Run rebuild_zebra.pl to index records in one's test database and verify that there are no regressions. Signed-off-by: Galen Charlton <gmc@esilibrary.com> Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz> Removing a really dangerous option Signed-off-by: Katrin Fischer <Katrin.Fischer.83@web.de> Passes all tests and QA script. Ran rebuild_zebra.pl with various options and confirmed that data was reindexed successfully. No regressions found. Signed-off-by: Galen Charlton <gmc@esilibrary.com>	2013-12-26 15:24:41 +00:00
Gaetan Boisson	6657860010	Bug 11417: make sure remove_unused_authorities.pl accepts --test This patches adds support for the --test option, as well as a short message telling the user the script is running in test mode. Test plan : - Launch the script with -h to see the help - Launch the script with --test and --aut with an authtypecode that is used in your instance - Make sure it does the same thing as launching it with -t - Launch the script for real and make sure it still works as expected, deleting unused authorities. Signed-off-by: Galen Charlton <gmc@esilibrary.com> Signed-off-by: Jonathan Druart <jonathan.druart@biblibre.com> Signed-off-by: Galen Charlton <gmc@esilibrary.com>	2013-12-19 15:09:18 +00:00
Galen Charlton	b25de3e7cf	Bug 6435: (follow-up) make -daemon really imply -a and -b This patch follows up on the previous patch by moving the check for whether authority and/or biblio indexing have been specified so that -daemon has a chance to set those modes. Signed-off-by: Galen Charlton <gmc@esilibrary.com>	2013-11-24 18:20:56 +00:00
Doug Kingston	00240d6970	Bug 6435: (follow-up) rebuild_zebra -daemon option now smarter Based on feedback, make daemon mode imply -z -a -b and abort on startup if flags incompatible with an incremental update daemon are used. Update documentation to match. Signed-off-by: Galen Charlton <gmc@esilibrary.com>	2013-11-24 18:15:23 +00:00
Doug Kingston	1b0992e8d5	Bug 6435: Add daemon mode to rebuild_zebra.pl This change adds code to check the zebraqueue table with a cheap SQL query and a daemon loop that checks for new entries and processes them incrementally before sleeping for a controllable number of seconds. The default is 5 seconds which provides a near realtime search index update. This is desirable particularly for libraries that are doing active catalogue updating. The query is adjusted based on whether -a, -b, or -a -b are specified. Help text updated. Tested against a live 3.12 system. Note that this fix will benefit from the fix to lack of locking (bug 11078) Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz> Signed-off-by: Martin Renvoize <martin.renvoize@ptfs-europe.com> Signed-off-by: Galen Charlton <gmc@esilibrary.com>	2013-11-24 18:12:21 +00:00
Janusz Kaczmarek	9d02840967	Bug 10326: bulkmarcimport.pl doesn't restore value of CataloguingLog syspref To test: 0) Don't apply the patch yet. 1) Have the CataloguingLog system preference set to 'Log'. 2) Import a file of bibliographic records with bulkmarcimport.pl. 3) Check the state of CataloguingLog system preference -- it will be set to 'Don't log'. 4) Apply the patch. 5) Repeat steps 1-3. The CataloguingLog system preference will be 'Log'. Signed-off-by: Galen Charlton <gmc@esilibrary.com> Signed-off-by: Jonathan Druart <jonathan.druart@biblibre.com> Signed-off-by: Galen Charlton <gmc@esilibrary.com>	2013-05-31 07:30:25 -07:00
Mason James	2eefd1f3a5	Bug 8745: General whitespace and tab tidy http://bugs.koha-community.org/show_bug.cgi?id=8745 Signed-off-by: Katrin Fischer <Katrin.Fischer.83@web.de> 1) Runs not with root. 2) Runs with root and -run-as-root. 3) Runs using the normal koha user. Note: Maybe the message should be clear about why running as root is bad and which user you should be running the script with? Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com>	2013-04-21 09:41:34 -04:00
Barry Cannon	ef86a77801	Bug 8745 - Disallow rebuild_zebra.pl from executing, when run by root user. Added a check to warn users of execution as root user. Added a 'runas-root' switch to allow users to force execution as root user. Signed-off-by: Mason James <mtj@kohaaloha.com> Signed-off-by: Katrin Fischer <Katrin.Fischer.83@web.de> Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com>	2013-04-21 09:41:34 -04:00
Julian Maurice	357cf7fdd8	Bug 8746 [Follow-up] Replace == by eq in string comparison Signed-off-by: Bernardo Gonzalez Kriegel <bgkriegel@gmail.com> Signed-off-by: Katrin Fischer <Katrin.Fischer.83@web.de> All tests and QA script pass. Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com>	2013-04-21 09:23:23 -04:00
Julian Maurice	a4e8443691	Bug 8746: Fix indexation in DOM index mode When in DOM index mode, files exported by `rebuild_zebra.pl -x` are wrapped by '<collection></collection>' tag. This is a problem because splitting files produces invalid files. This is fixed by adding the missing <collection> tags in each generated file. Another problem was that the wrong zebra configuration file was used. The script now uses C4::Context->zebraconfig($server)->{config} to know which configuration file has to be used. Signed-off-by: Bernardo Gonzalez Kriegel <bgkriegel@gmail.com> Signed-off-by: Katrin Fischer <Katrin.Fischer.83@web.de> Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com>	2013-04-21 09:23:22 -04:00
Julian Maurice	eef4b3f23c	Bug 8746: rebuild_zebra_sliced.sh now export/index records as MARCXML This avoid indexing failures due to "bad offset" or "bad length" error with ISO2709 format + minor improvements: - --length parameter is optional. If not given, it will execute the right sql query to find the number of records to index - new parameter --reset-index. If set, index is reset before indexing Signed-off-by: Bernardo Gonzalez Kriegel <bgkriegel@gmail.com> Comment: Work as described. No errors. Test: Edit record to make it longer than 9999. Without patch rebuild_sliced fails. With patches works. Signed-off-by: Katrin Fischer <Katrin.Fischer.83@web.de> Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com>	2013-04-21 09:23:22 -04:00
Marcel de Rooy	f9c8f39c02	Bug 9609: Rebuilding zebra reports double number of exported records. Test plan: Clear the zebra queue (run rebuild). Update one biblio. Rebuild zebra (again) with -z. Check zebra log: note 2 exported records. Now apply patch, and repeat: You will see 1 exported record. Signed-off-by: Kyle M Hall <kyle@bywatersolutions.com> Signed-off-by: Katrin Fischer <Katrin.Fischer.83@web.de> Works as described. Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com>	2013-04-02 08:41:40 -04:00
Galen Charlton	151e22070a	bug 9496: improve error checking in rebuild_zebra.pl When using rebuild_zebra to index all records, skip over bibliographic or authority records that don't come out as valid XML. Also, strip extraneous XML declarations when using --nosanitize. Test plans ---------- Note that both plans assume that DOM indexing is turned on. Test plan #1 ============ [1] Run rebuild_zebra.pl with the -x -nosanitize options. Without the patch, zebraidx should terminate early and complain about invalid XML. [2] With the patch, the rebuild_zebra.pl should work without error. Test plan #2 ============ [1] Intentionally make a MARCXML record invalid, e.g, by running the following SQL: UPDATE bilbioitems SET marcxml = CONCATENATE(marcxml, 'junk') WHERE biblionumber = 123; [2] Run rebuild_zebra.pl -b -x -r [3] Without the patch, only part of the database will be indexed. [4] With the patch, rebuild_zebra.pl will not export the bad record and will give an error message saying so, but will successfully index the rest of the records. Signed-off-by: Galen Charlton <gmc@esilibrary.com> Signed-off-by: Larry Baerveldt <larry@bywatersolutions.com> Signed-off-by: Mason James <mtj@kohaaloha.com> Signed-off-by: Paul Poulain <paul.poulain@biblibre.com> Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com>	2013-03-21 22:25:03 -04:00
Paul Poulain	be82a5f942	bug 5608 follow-up: exit immediately if UNIMARC Signed-off-by: Paul Poulain <paul.poulain@biblibre.com> Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com>	2013-03-21 22:15:56 -04:00
Michael Hafen	b395ac0805	Add license statement to switch_series_info CLI script License and copyright statement added. Thanks to Bernardo Gonzalez Kriegel for reminding me about this. http://bugs.koha-community.org/show_bug.cgi?id=5608 Signed-off-by: Bernardo Gonzalez Kriegel <bgkriegel@gmail.com> Comment: add license information. No errors. Signed-off-by: Paul Poulain <paul.poulain@biblibre.com> Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com>	2013-03-21 22:15:56 -04:00
Michael Hafen	8cb3533ae6	Bug 5608 - command-line tool to switch information in 440 and 490 tags With the MARC21 standard moving from the 440 tag to the 490, this tool is to help libraries make the move. It switches any information in 440 tags to 490 tags, and any information in 490 tags to 440 tags. That seemed like the best way to go to me. There is also an option to create 830 tags for any 44 information, like authorities, that can't be represented in the 490 tag. To Test: locate some biblios with 440 or 490 tags filled. run bin/migration_tools/switch_marc21_series_info.pl -c observe that the information in the biblios has switched 4xx tags. http://bugs.koha-community.org/show_bug.cgi?id=5608 Signed-off-by: Bernardo Gonzalez Kriegel <bgkriegel@gmail.com> Comment: Work as described. No errors. Signed-off-by: Paul Poulain <paul.poulain@biblibre.com> Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com>	2013-03-21 22:15:56 -04:00
Michael Hafen	eea7b9d20d	Bug 5608 - command-line tool to switch information in 440 and 490 tags With the MARC21 standard moving from the 440 tag to the 490, this tool is to help libraries make the move. It switches any information in 440 tags to 490 tags, and any information in 490 tags to 440 tags. That seemed like the best way to go to me. To Test: locate some biblios with 440 or 490 tags filled. run bin/migration_tools/switch_marc21_series_info.pl -c observe that the information in the biblios has switched 4xx tags. http://bugs.koha-community.org/show_bug.cgi?id=5608 Signed-off-by: Bernardo Gonzalez Kriegel <bgkriegel@gmail.com> Comment: Works as described. No errors. Signed-off-by: Paul Poulain <paul.poulain@biblibre.com> Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com>	2013-03-21 22:15:56 -04:00
Stéphane Delaune	cefa7c21e2	Bug 5635: bulkmarcimport new parameters & features See the script's documentation for more details New parameters are: - authtypes - filter - insert - update - all Signed-off-by: Pascale Nalon <pascale.nalon@gmail.com> This patch is live in Mines ParisTech since 2012-07-24. Signing off Signed-off-by: Katrin Fischer <Katrin.Fischer.83@web.de> - Moved the sign-off from bugzilla to the commit message. - All tests and QA script pass. - Amended commit message to list new parameters. - Verified this patch works on a UNIMARC installation. - Verified normal import still works correct on a MARC21 installation. Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com>	2013-03-21 20:21:54 -04:00
Tomas Cohen Arazi	4dcee58a4d	Bug 7440 - Remove NoZebra vestiges Removed NoZebra vestiges. This comprises several code blocks that depend on the NoZebra syspref and NZ related functions/methods. C4::Biblio-> GetNoZebraIndexes _DelBiblioNoZebra _AddBiblioNoZebra C4::Search-> NZgetRecords NZanalyse NZoperatorAND NZoperatorOR NZoperatorNOT NZorder C4::Installer-> set_indexing_engine Sponsored-by: Universidad Nacional de CÃ³rdoba Signed-off-by: Julian Maurice <julian.maurice@biblibre.com> Signed-off-by: Paul Poulain <paul.poulain@biblibre.com> Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com>	2013-03-19 21:17:04 -04:00
Jared Camins-Esakov	144c7f4e4e	Bug 9239: Allow the use of QueryParser for all queries With the inclusion of this patch, all searches will (try) to use QueryParser for handling queries for both the bibliographic and authority databases if UseQueryParser is enabled. If QueryParser is unavailable, UseQueryParser is disabled, or the search uses CCL indexes, the old search code will be used. To test: 1) Apply patch. 2) Run the unit test with `prove t/QueryParser.t` 3) Enable the UseQueryParser syspref. 4) Try searches that should return results in the following places: * OPAC (simple search) * OPAC (advanced search) * OPAC (authorities) * Staff client (header search) * Staff client (advanced search) * Staff client (cataloging search) * Staff client (authorities) * Staff client (importing a batch using a match point) * Staff client (searching for an item for adding to a label) * Staff client (acquisitions) * Staff client (searching for a record to create a serial) * ANYWHERE ELSE I HAVE FORGOTTEN 5) Disable the UseQueryParser syspref. Repeat at least some of the searches you did above. 6) If all searches worked, sign off. Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Signed-off-by: Elliott Davis <elliott@bywatersolions.com> Searching still works as expected for variuos places. QueryParser syspref seemed to be enabled by default Signed-off-by: Katrin Fischer <Katrin.Fischer.83@web.de> Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com>	2013-03-16 21:32:32 -04:00
Robin Sheat	788e51adb1	Bug 9035 - delete bulkauthimport.pl <dcook> Then bulkauthimport.pl? <jcamins> bulkauthimport should not be used ever. <eythian> it probably should be deleted <jcamins> It should be. Signed-off-by: David Cook <dcook@prosentient.com.au> I've poked around in bulkmarcimport.pl and it certainly seems to have the functionality that Mason (and Jared and Robin) mention. Signed-off-by: Jonathan Druart <jonathan.druart@biblibre.com> Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com>	2013-03-13 08:50:09 -04:00
Vitor FERNANDES	33e95ea3b9	Bug 9144 - bulkmarcimport.pl - Problem identifying errors Replace \r with \n for newline in output for bulkmarcimport.pl Signed-off-by: Kyle M Hall <kyle@bywatersolutions.com> Signed-off-by: Paul Poulain <paul.poulain@biblibre.com> Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com>	2012-12-17 11:53:01 -05:00
Jared Camins-Esakov	49cadcf7c1	Bug 9049: Don't use shadow with rebuild_zebra -r Due to a limitation of Zebra, the register must be cleared before doing shadow indexing if you want to reset the indexes. In light of that, it does not make sense to do shadow indexing at all when rebuild_zebra.pl is run with the -r switch. This patch makes -r (reset) imply -n (no shadow). To test: 1) Run `rebuild_zebra.pl -b -r -v -v -v` 2) Note that the script never runs the merge phase Without the patch I see log lines refering to the shadow cache (enabling shadow spec=/home/koha/koha-dev/var/lib/zebradb/biblios/shadow:20G) With the patch I don't see anything in the logs about shadow. I do however see lines about merging. I think it could just be a misunderstanding of the logs Signed-off-by: wajasu <matted-34813@mypacks.net> Signed-off-by: Elliott Davis <elliott@bywatersolutions.com> Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com>	2012-12-08 09:46:30 -05:00
Robin Sheat	5d0bdbce59	Bug 9012 - --framework option for bulkmarcimport This allows the --framework option to be specified when running bulkmarkimport. This option allows a framework code to be specified for the records being imported. Signed-off-by: Kyle M Hall <kyle@bywatersolutions.com> Signed-off-by: Katrin Fischer <Katrin.Fischer.83@web.de> All tests pass, perlcritic fails before and after. Tested - imported records with -framework FA, FA framework is used - imported records without -framework, default framework is used Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com>	2012-12-03 07:14:58 -05:00
Jared Camins-Esakov	deeeb068d9	Bug 9050: Use safer adelete when deleting records from Zebra index Previously we used the "delete" command in zebraidx, which fails when you try to delete a record that doesn't exist in the index. By changing to the "adelete" command, we can reduce the likelihood of a failed delete causing ghost records. A symptom of this problem is the warning message occasionally encountered when indexing from the zebraqueue, "[warn] cannot delete record above (seems new)." To test: 1) Add a recordDelete action for a record that does not exist to zebraqueue in MySQL: INSERT INTO zebraqueue (biblio_auth_number, operation, server) \ VALUES (999999999, 'recordDelete', 'biblioserver'); 2) Run `rebuild_zebra.pl -b -z -v [-x]`. 3) Note that you do not get the message "[warn] cannot delete record above (seems new)". Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Passed-QA-by: Paul Poulain <paul.poulain@biblibre.com> Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com>	2012-11-12 18:53:49 -05:00
Colin Campbell	722701d596	Bug 8727 Minor stylistic change to help text indexing not indexation some minor grammatical changes Signed-off-by: Kyle M Hall <kyle@bywatersolutions.com> Signed-off-by: Paul Poulain <paul.poulain@biblibre.com>	2012-09-17 18:47:40 +02:00
Jared Camins-Esakov	bc05b5d163	Bug 7417: Include see from references in bibliographic searches This patch adds the Koha::Indexer::RecordNormalizer and Koha::Indexer::MARC::RecordNormalizer::EmbedSeeFromHeadings packages to enable the inclusion of alternate forms of headings in bibliographic searches. When the new syspref IncludeSeeFromInSearches is turned on (default is off) rebuild_zebra.pl will insert see from headings from authority records into bibliographic records when indexing, so that a search on an obsolete term will turn up relevant records. To test: 1) Enable IncludeSeeFromInSearches 2) Add a heading that has an alternate form to a record (for example, "Cooking" has the alternate form "Cookery," if you have authority records from LC) 3) Index the zebraqueue (or reindex if you haven't indexed your system yet) 4) Confirm that if you search for "Cookery" you get the record you just modified Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com> Rebased on master 5 August 2012 Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com> Rebased on master 11 September 2012 Signed-off-by: Katrin Fischer <Katrin.Fischer.83@web.de> Also checked: - Verified database update works correctly - Checked system preference and its description - Checked staff/opac detail pages with feature on/off - Checked staff/opac search facets - Downloaded and tested records in various formats - Tried different searches for 'see from' entries of authorities - Ran all unit tests No problems found.	2012-09-13 14:19:28 +02:00
Jared Camins-Esakov	3616eee996	Bug 8384: Some Perl scripts do not compile Fix syntax errors preventing the scripts misc/translator/text-extract2.pl and misc/cronjobs/thirdparty/TalkingTech_itiva_inbound.pl from compiling. Remove misc/migration_tools/build6xx.pl entirely since it refers to columns that no longer exist in the Koha database, and has seemingly had broken encoding since Koha switched from CVS to git (or before!). Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Signed-off-by: Paul Poulain <paul.poulain@biblibre.com>	2012-07-10 10:50:58 +02:00
Christophe Croullebois	665136f8a0	Bug 6566 Checking if DB's records are properly indexed Small script that checks if each bibliorecord in the DB is properly indexed use -h to learn more (MT #6389) Signed-off-by: Robin Sheat <robin@catalyst.net.nz> Signed-off-by: Paul Poulain <paul.poulain@biblibre.com>	2012-07-06 17:11:39 +02:00
Jonathan Druart	623f3a2c84	Bug 8233 : SearchEngine: Add a Koha::SearchEngine module First draft introducing solr into Koha :-) List of files : $ tree t/searchengine/ t/searchengine \|-- 000_conn \| `-- conn.t \|-- 001_search \| `-- search_base.t \|-- 002_index \| `-- index_base.t \|-- 003_query \| `-- buildquery.t \|-- 004_config \| `-- load_config.t `-- indexes.yaml just do `prove -r t/searchengine/*/.t` t/lib \|-- Mocks \| `-- Context.pm `-- Mocks.pm provide a mock to SearchEngine syspref (set_zebra and set_solr). $ tree Koha/SearchEngine Koha/SearchEngine \|-- Config.pm \|-- ConfigRole.pm \|-- FacetsBuilder.pm \|-- FacetsBuilderRole.pm \|-- Index.pm \|-- IndexRole.pm \|-- QueryBuilder.pm \|-- QueryBuilderRole.pm \|-- Search.pm \|-- SearchRole.pm \|-- Solr \| \|-- Config.pm \| \|-- FacetsBuilder.pm \| \|-- Index.pm \| \|-- QueryBuilder.pm \| `-- Search.pm \|-- Solr.pm \|-- Zebra \| \|-- QueryBuilder.pm \| `-- Search.pm `-- Zebra.pm How to install and configure Solr ? See the wiki page: http://wiki.koha-community.org/wiki/SearchEngine_Layer_RFC http://bugs.koha-community.org/show_bug.cgi?id=8233 Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz>	2012-07-06 16:51:58 +02:00
Julian Maurice	57424a9fdc	Bug 7286: rebuild_zebra_sliced for biblios and authorities Complete rewrite of rebuild_zebra_sliced.zsh (renamed to .sh). Main improvements are: - both biblio and authority records are handled - records are exported only once It also add an option --skip-index to rebuild_zebra.pl that permit to use rebuild_zebra.pl as an 'export only' script. Description: Index Koha records by chunks. It is useful when some record causes errors and stop the indexation process. With this script, if indexation of one chunk fails, chunk is splitted in 2 (or 3) chunks, and indexation continue on these chunks. rebuild_zebra.pl is called only once to export records. Splitting and indexing is handled by this script (using yaz-marcdump and zebraidx). Signed-off-by: Martin Renvoize <martin.renvoize@ptfs-europe.com> Signed-off-by: Paul Poulain <paul.poulain@biblibre.com>	2012-07-06 15:06:40 +02:00
christophe croullebois	082bb5049d	Bug 8136 Changes the expected lenght of 100$a in rebuild_zebra.pl In rebuild_zebra.pl, if we are in "unimarc" ("marcflavour" syspref), the sub "fix_unimarc_100" is called and checks if 100$a lenght is equal to 35. If it is not the case, the sub inserts the localtime and more, so we loose the datas in reindexing. The standart lenght is 36. I have just changed 35 to 36. Signed-off-by: Sophie Meynieux <sophie.meynieux@biblibre.com> Signed-off-by: Paul Poulain <paul.poulain@biblibre.com>	2012-06-20 09:39:27 +02:00
Galen Charlton	daca5edc52	Bug 7818: -x option of rebuild_zebra.pl now works with DOM filter One consequence is that the -x and -a options are no longer mutually exclusive. Also, because of the way that the GRS-1 SGML filter works, if you're indexing multiple documents, you can't just wrap them in a document element, but the DOM filter requires it. Consequently, two new config settings in koha-conf.xml are added to indicate the Zebra filter in use so that the -x option of rebuild_zebra.pl knows whether to wrap the exported records or not: - bib_index_mode (defaults to 'grs1' if not specified) - auth_index_mode (defaults to 'dom') Signed-off-by: Galen Charlton <gmc@esilibrary.com> Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com> Signed-off-by: Paul Poulain <paul.poulain@biblibre.com>	2012-06-09 11:44:09 +02:00
Chris Cormack	dd864696de	Bug 7213 : Follow up fixing license information Signed-off-by: Paul Poulain <paul.poulain@biblibre.com>	2012-05-15 15:44:33 +02:00
Dobrica Pavlinusic	63bc7ebc39	Bug 7213 - simple /svc/ HTTP example Simple command-line client which can authorize itself to Koha, get MARC XML record based on biblio number and update record This script can also be used as module using require "koha-svc.pl" from other scripts which can implement MARC XML creation or parsing. This is follow up version which now uses Content-type: text/xml header when using POST method to be in sync with documentation at http://wiki.koha-community.org/wiki/Koha_/svc/_HTTP_API Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz> Signed-off-by: Paul Poulain <paul.poulain@biblibre.com>	2012-05-14 18:22:17 +02:00
Robin Sheat	b96c8b7ffa	Bug 6199 - allow bulkmarkimport.pl to remove duplicate barcodes This adds the -dedupbarcode option that allows bulkmarkimport to erase a barcode but keep the item of any items it finds with duplicate barcodes. Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com> Signed-off-by: Paul Poulain <paul.poulain@biblibre.com>	2012-03-28 17:30:54 +02:00
Julian Maurice	3b0d4e04e0	Bug 6440: Implement OAI-PMH Sets New sql tables: - oai_sets: contains the list of sets, described by a spec and a name - oai_sets_descriptions: contains a list of descriptions for each set - oai_sets_mappings: conditions on marc fields to match for biblio to be in a set - oai_sets_biblios: list of biblionumbers for each set New admin page: allow to configure sets: - Creation, deletion, modification of spec, name and descriptions - Define mappings which will be used for building oai sets Implements OAI Sets in opac/oai.pl: - ListSets, ListIdentifiers, ListRecords, GetRecord New script misc/migration_tools/build_oai_sets.pl: - Retrieve marcxml from all biblios and test if they belong to defined sets. The oai_sets_biblios table is then updated accordingly New system preference OAI-PMH:AutoUpdateSets. If on, update sets automatically when a biblio is created or updated. Use OPACBaseURL in oai_dc xslt	2012-03-20 11:38:26 +01:00
Paul Poulain	1fd8c8a4de	Bug 7246 add offset/length and where options to rebuild_zebra This patch reimplement a feature that is on biblibre/master for Koha-community/master It adds 4 parameters: * offset = the offset of record. Say 1000 to start rebuilding at the 1000th record of your database * length = how many records to export. Say 400 to export only 400 records * where = add a where clause to rebuild only a given itemtype, or anything you want to filter on Another improvement resulting from offset & length limit is the rebuild_zebra_sliced.zsh that will be submitted in another patch. rebuild_zebra_sliced will slice your all database in small chunks, and, if something went wrong for a given slice, will slice the slice, and repeat, until you reach a slice size of 1, showing which record is wrong in your database. Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com> Removed mention of -l option for limiting number of items exported, as requested by QA manager. This can be re-added in a later patch. Signed-off-by: Paul Poulain <paul.poulain@biblibre.com>	2012-02-17 10:59:23 +01:00
Colin Campbell	263dded818	Bug 6752: Be stricter with utf-8 encoding of output use encoding(UTF-8) rather than utf-8 for stricter encoding Marking output as ':utf8' only flags the data as utf8 using :encoding(UTF-8) also checks it as valid utf-8 see binmode in perlfunc for more details In accordance with the robustness principle input filehandles have not been changed as code may make the undocumented assumption that invalid utf-8 is present in the imput Fixes errors reported by t/00-testcritic.t Where feasable some filehandles have been made lexical rather than reusing global filehandle vars Signed-off-by: Jonathan Druart <jonathan.druart@biblibre.com> Signed-off-by: Paul Poulain <paul.poulain@biblibre.com>	2012-01-27 12:11:06 +01:00
Dobrica Pavlinusic	90d68d6f5c	Bug 7247 - rebuild_zebra.pl -v should show all Zebra log output Currently, -v option resets Zebra log output to default system values. This produce amount of log specified in system defaults which is usually too low for debugging. This change explicitly forces all Zebra log output which create much more chatter so it triggers with verbosity level 2 Test scenario: 1. pick koha site to reindex 2. use -v -v options to rebuild_zebra.pl to see additional output Signed-off-by: Liz Rea <wizzyrea@gmail.com> Verified help corrections and loglevel 2 output vs. loglevel 1 output. No issues found. Signed-off-by: Paul Poulain <paul.poulain@biblibre.com>	2012-01-17 17:31:25 +01:00
Marc Balmer	c9c6bbdea8	Bug 7356 - Fix various typos and mis-spellings Fix typos: the the -> the, wether -> whether, developper -> developer. http://bugs.koha-community.org/show_bug.cgi?id=7356 Signed-off-by: Owen Leonard <oleonard@myacpl.org> Signed-off-by: Paul Poulain <paul.poulain@biblibre.com>	2012-01-13 11:51:26 +01:00
Robin Sheat	849547df68	Bug 7008 - create tmp dir for zebra Sometimes zebra needs a tmp dir in order to work. This ensures that it is created both by koha-create-dirs in the packages, and by rebuild_zebra when it runs. -- tested ok, signing off Signed-off-by: Mason James <mtj@kohaaloha.com>	2011-12-03 07:56:44 +01:00
Frédéric Demians	4ce57a102b	Bug 6799 rebuild_zebra.pl -x produces invalid XML records This patch allow to handle properly items containing extended characters and send valid XML records to zebraidx Signed-off-by: Julian Maurice <julian.maurice@biblibre.com> Signed-off-by: Paul Poulain <paul.poulain@biblibre.com>	2011-11-18 23:29:08 +01:00
Nahuel ANGELINETTI	e0b029a4f5	(bug #4518 ) enhance 2.2 to 3.0 scripts Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz>	2011-11-16 17:48:24 +01:00
Marcel de Rooy	05d35b0ae0	6094 Fixing ModAuthority problems Pref MergeAuthoritiesOnUpdate does not exist; should be dontmerge (AuthoritiesMarc.pm). Instead of folder modified_authorities, now introducing a table for this purpose: need_merge_authorities. This eliminates several permissions and security issues. This change applies to AuthoritiesMarc.pm and merge_authority.pl. POD lines added for ModAuthority. Deprecated parameter $merge removed. Test this patch by applying the db revision first from the second patch. August 4, 2011: Rebased. Signed-off-by: Frédéric Demians <f.demians@tamil.fr> Thanks Marcel. It works as advertised. Both modes are functionnal (back): - Immediate with dontmerge=0: After modifying an authority record, its linked biblios are immediately modified. This isn't the case in 3.4.5. - Delayed with dontmerge=1: After modifying an authority record, its linked biblios are not modified. But an entry is added to need_merge_authority new table and 'merge_authorities.pl -b' script updates biblios. Comment: need_merge_authority, like zebraqueue, should be cleared from time to time. Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz>	2011-10-20 11:28:53 +13:00
Ian Walls	4e95e94727	Bug 6789: biblios with many items can result in broken search results link This patch fixes an issue whereby biblios with many items (often > 500) would index, but not the biblionumber itself, resulting in search results with a) inaccurate item counts and b) no biblionumber to use in the link to the details page. This is due to Net::Z3950::ZOOM not providing a mechanism for specifying different connection attributes; the maximumRecordSize ZOOM connection attribute, if not specified, defaults to 1MB, which is less than the size of a MARC record with many, many 952 fields. Since it is unlikely we can fix Net::Z3950::ZOOM in a timely fashion, this patch aims to build a workaround on the Koha end. This patch changes EmbedItemsInMarcBiblio to use append_fields instead of insert_ordered_fields, so the 999$c will come before the item records. It's VERY unlikely we will encounter more than 1MB of biblio-level MARC content, as this would break the ISO-2709 standard by a large factor. To this end, it also moves the fix_biblio_ids portion of get_corrected_marc_record out of rebuild_zebra.pl, and makes it a part of GetMarcBiblio (right before EmbedItemsInMarcBiblio, so the 952s still come last). fix_biblio_ids is kept as a subroutine for the deletion portion of rebuild_zebra.pl, which still uses it. It also uses the subroutine parameter in GetMarcBiblio to do the EmbedItemsInMarcBiblio action, rather than having rebuild_zebra.pl perform it on the itemless record returned from GetMarcBiblio. Simpler and cleaner that way. To verify bug issue: 1. Find a biblio with over 700 items (or enough that the resulting MARCXML is greater than 1MB) 2. search for this biblio (in a search that would return multiple results, not just this title). You should get the title in the results list 3. attempt to click the link to this biblio's details page; the biblionumber should be blank, leading to a 404 To test solution: 1. Apply patch 2. modify the biblio slightly (click the 005 for example) and save OR manually add the biblio to zebraqueue for reindexing 3. after rebuild_zebra.pl -z -b -x runs, use the same search as above. The title should still appear. 4. click the link, and find yourself on the biblio detail page as desired Signed-off-by: D Ruth Bavousett <ruth@bywatersolutions.com> Signed-off-by: Paul Poulain <paul.poulain@biblibre.com> Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz>	2011-10-15 13:47:24 +13:00
Jared Camins-Esakov	f09e2ca27e	Bug 5528: Analytic records support Display links to parent biblios, show linked items in holdings, allow holds on linked items. This uses MARC to maintain relationships. Sponsored by the Mississippi Department of Archives and History and RapidRadio Solution. Originally developed by Savitra Sirohi and Amit Gupta at OSSLabs, with UNIMARC support added by Zeno Tajoli. Commits squashed and merge conflicts resolved by Chris Cormack from Catalyst. Respect for NORMARC and some small framework portability fixes made by Jared Camins-Esakov of C & P Bibliography Services. IMPORTANT NOTE: A bug in the 773 coding for MARC21 was corrected from the original OSS Labs code. The 773s generated by the pre-release code did not have the first indicator set to '0', which means that they were not supposed to display. Going forward, the first indicator will be set correctly, but existing records created with this code will no longer appear (they appeared before only due to another bug). To correct this, you could globally (or, to make sure you only modify records created with the Analytics tool, for records with 773$0) change the first indicator of the 773 from blank to '0'. == Background == An analytic record for an item is a more detailed, monographic biblio for an item attached to a serial record . This is often used for special issues of a journal that are released as books on their own (assigned an ISBN, as well as an ISSN/volume/issue). It is important for researchers to be able to search for these items both as issues of the serial, and as monographs. It is equally important for the library to not have duplicate item records for the item in question to have to keep synchronized. == Establishing relationships == Analytical records are connected to items belonging to parent or host bibliographic records. This can be accomplished by: * From an analytical bibliographic record linking to an host item by providing the item barcode as input * From a host item by using option "analyze", this creates a new empty bibliographic record with field 773 (MARC21) populated * Running a new CLI script that establishes a relationship between the analytical record and the host item identified by the barcode in the analytical record's 773$o (MARC21) == Connecting Records == The relationships are maintained in the MARC records, we have not used database tables at all. == MARC Representation == In MARC21/NORMARC we have used: * 773$9 to store the Koha item number of the host item * 773$0 to store the Koha biblio number of the host bibliographic record The above fields are used to display the relationships in various screens in the OPAC and the staff interface. Additionally, when populating field 773 with host item's details, we have used following MARC 21 mapping: * 'a' <= 100/110/111 $a (author main) * 'b' <= 250$a (edition) * 'd' <= 260$a, 260$b, 260$c (place, publisher, year) * 'o' <= barcode * 't' <= 245$a (title) * 'w' <= (003)001 --> if no 001 is available, we can populate biblionumber * 'x' <= 022$a (issn) * 'z' <= 020$a (isbn) In UNIMARC, this code uses: * 461$9 to store the Koha item number of the host item * 461$0 to store the Koha biblio number of the host bibliographic record When populating field 461 in UNIMARC, the following mapping is used: * 't' <= 200$a (title) == Treatment of Holds == A key requirement was to allow holds to be placed on host items from the analytical record. We have accomplished this by allowing holds on specific copies only. Biblio level holds are not allowed. This ensures that holds are placed on specific items that are relevant to the analytical record. == Deleting host items with linked analytical records == As we have not used database tables to maintain relationships, we had to use search to find out if any linked analytical records are present. If 1 or more analytical are present, we do not allow deletion of items. This is similar to what we see when we try to delete authority records. == Importing analytical records == Analytical records can be imported using bulkmarcimport or the GUI tools. The new CLI script can be executed after the import to establish relationships with host items. The script will establish relationships using the host item's barcode, the barcode must be present in 773$o of the analytical record. == What if there are two or more copies of the host item? == The current design will require that there be two host (773) fields, one for each copy. == What if there is no barcode available for the host item? == It is still possible to establish a relationship, by populating 773$9 with the host's item number. However the CLI script uses barcode in 773$o to establish relationships so it won't work where barcodes are unavailable. Also from an analytical record, it is possible to establish a relationship to a host item by providing the barcode as input, this option will not be available as well. Commits that added the following features were squashed by Chris Cormack (this is not a list of every commit): * Display links to host records from biblio detail screens * Support for UNIMARC, respecting the system preference 'marcflavor' * Support holds from the OPAC * Ability to link to items belong to host records from a analytical record * Display items belonging to host records in the moredetail page * Ability to edit items belonging to host records, also ability to delink from them * Move get host items code into a C4 routine, also calling the new routine in related perl scripts * Move host field population to a C4 routine, all changes in pl files to call new routine * Allow only specific copy holds for analytical records plus changes to use new C4 routines * Support for holds on items linked via host records * Storing bibnumber and itemnumber in subfields 0 and 9, plus other mapping changes * New command line script that establishes relationships between analytical records and host items and bibs. The script looks for host field (MARC21 773) in records, and based on barcode in subfield 'o' populates host bibnumber in subfield '0' and host itemnumber in subfield '9'. The script can be run after an import of analytical records, it can also be run in the crontab to maintain the relationships * Ability to create analytical records from items, to view linked analytics, and prevent deletion of items that have linked analytics * New template for catalogue/detail.pl (NOTE: not a new template file, just a new way of displaying analytics), template displays linked analytics and allows creation of analytical records * New zebra index for item number in host fields. This index will be used to display links to analytical records from host records * Display title of host record instead of the phrase host record * Using detail.tmpl for analytics tab instead of a new template file * Improved qualification info prepration in Prephostmarcfield * Check for linked analytics before deleting item * Display link to host record and more meaningful anchor text for edit item link * Analytical record: Unimarc index in record.abs and help in create_analytical_rel.pl * Adding a sys pref that controls display of options to create analytical relationships * Add host entry in XSLT stylesheet in staff item detail * Added host record support to OPAC detail XSLT * Adding 773$0 and 773$9 to all frameworks * Adding 773 subfields 0 and 9 to default marc framework via updatedatabase.pl * Display create analytics and used in links in catalog detail * Fixed problem where analytical records not showing in OPAC search results because GetMarcBiblio now needs a flag to add item records * Fixed problem where analytics count was set to 1 for all records, not just those with analytics * Fixed catalogue detail page not to show analytics counts if count is 0 Conflicts: installer/data/mysql/updatedatabase.pl koha-tmpl/intranet-tmpl/prog/en/modules/cataloguing/addbiblio.tt kohaversion.pl Co-author: Savitra Sirohi <savitra.sirohi@osslabs.biz> Co-author: Zeno Tajoli <tajoli@cilea.it> Signed-off-by: Jared Camins-Esakov <jcamins@cpbibliography.com> Signed-off-by: Ian Walls <ian.walls@bywatersolutions.com> Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz>	2011-10-13 10:03:39 +13:00
Jesse Weaver	048c0dc04e	Bug 6492 - Deleted biblios cause rebuild_zebra to fail This both adds a bit of a failsafe to get_raw_biblio, and prevents records that have been deleted from being updated by the same instance of rebuild_zebra. Minor amendment to remove duplication of 6433 Signed-off-by: MJ Ray <mjr@phonecoop.coop> Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz>	2011-07-05 11:18:28 +12:00
Frédéric Demians	3b8f1318e0	Bug 6050 Followup, edit a last function call Signed-off-by: Frédéric Demians <f.demians@tamil.fr> Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz>	2011-06-14 14:12:05 +12:00
Srdjan Janković	5829cef6d8	bug_6433: exception handling Signed-off-by: Magnus Enger <magnus@enger.priv.no>	2011-06-10 11:27:25 +12:00
Galen Charlton	ce849240ad	bug 5579: tweaks to bulkmarcimport.pl Fixes bug where a bib record imported by bulkmarcimport.pl could become unindexable by ensuring that ModBiblioMarc() is always called by bulkmarcimport.pl to finalize saving the bib record (as it was initially created by AddBiblio with the defer_marc_save option). Also introduces a utility routine, C4::Biblio::_strip_item_fields. Signed-off-by: Galen Charlton <gmcharlt@gmail.com> Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz>	2011-04-21 10:05:02 +12:00
Galen Charlton	e96315556b	bug 5579: new routine to embed items in bib Adds a new routine, C4::Biblio::EmbedItemsInMarcBiblio, to embed the items in the bib record when necessary: * cataloging/additem.pl * rebuild_zebra.pl Signed-off-by: Galen Charlton <gmc@esilibrary.com> Signed-off-by: Claire Hernandez <claire.hernandez@biblibre.com> Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz>	2011-04-19 22:34:21 +12:00
Henri-Damien LAURENT	3584c4426b	Bug 5579: remove items from MARC bib This is a squash of four patches by Henri-Damien Laurent starting work on removing the copy of item record information in the 9XX field of bibliographic records. The reason for doing this is primarily to improve performance, in particular, the expense of having to add/modify the bib record whenever an item changes. Now, whenever an item changes, the bib record is put in the queue to be reindexed; when the bib is indexed, the 9XX fields are inserted into the version of the bib that Zebra indexes. Since rebuild_zebra.pl runs in a separate process, the processing of the bib record will not delay (e.g.) circulation. As part of upgrading to 3.4, the following batch script should be run: misc/maintenance/remove_items_from_biblioitems.pl --run This should be followed by a complete reindexing of the bib records, e.g., misc/migration_tools/rebuild_zebra.pl -b -r Signed-off-by: Galen Charlton <gmcharlt@gmail.com> Signed-off-by: Claire Hernandez <claire.hernandez@biblibre.com> Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz>	2011-04-19 22:33:56 +12:00
Frédéric Demians	c9d082bcdc	Bug 5067 Add a cleanisbn param to bulkmarcimport.pl Import script shouldn't remove an information present in entering biblio records. With this patch, by default, ISBN are not cleared anymore. [2011.04.12] Rebased on HEAD DOCUMENTATION: There is a new paramater --isbn\|--noisbn Signed-off-by: Colin Campbell <colin.campbell@ptfs-europe.com> Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz>	2011-04-13 11:41:04 +12:00
Colin Campbell	d8b362e0f9	Bug 5415 Let calls of SimpleSearch utilize considtent interface Remove some unnecessary checks when check of error is sufficient. Make the order in some cases more logical Should remove some possibilities of runtime warning noise. Although some calls belong to the 'Nothing could ever go wrong' school have added some warnings Signed-off-by: Christophe Croullebois <christophe.croullebois@biblibre.com> Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz>	2011-04-08 13:52:57 +12:00
Alex Arnaud	e43da19e34	Bug #6044 - Authority is deleted when mergeto and mergefrom are the same Signed-off-by: Stéphane Delaune <stephane.delaune@biblibre.com> Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz>	2011-04-06 15:22:40 +12:00
Ian Walls	8dc56a0d2c	Bug 5831: rebuild_zebra.pl doesn't respect -r Reimplements support for -r, as well for -reset Signed-off-by: D Ruth Bavousett <ruth@bywatersolutions.com> Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz>	2011-03-06 08:44:57 +13:00
Robin Sheat	8de1ef7e94	Bug 5228 - make rebuild_zebra handle fixing the zebra dirs If the zebra server directories don't exist, zebra will spit the dummy. This makes rebuild_zebra.pl smart enough to create them if they're not there. If that fails, it'll scream loudly so you know zebra isn't reindexing. Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz>	2010-12-13 21:59:49 +13:00
Chris Nighswonger	374cdb2678	Fixing up a regexp to stop a trivial warn	2010-11-04 14:24:46 -04:00
MJ Ray	65f8573b5d	Display available error information during bulkmarcimport	2010-10-13 02:17:05 +01:00
Robin Sheat	57d11aee2c	Bug 5077 - ensure rebuild_zebra will run somewhere it can read This prevents it leaving files lying around in /tmp Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz> Signed-off-by: Galen Charlton <gmcharlt@gmail.com>	2010-10-06 08:00:17 -04:00
Lars Wirzenius	3e7e386148	Convert to UTF-8. Signed-off-by: Galen Charlton <gmcharlt@gmail.com>	2010-05-06 17:58:24 -04:00
Donovan Jones	5e0b850d49	Bug 2505 - Add commented use warnings where missing in the misc/ directory	2010-04-21 20:26:44 +12:00
Lars Wirzenius	87d845969e	Fix FSF address in directory misc/ Signed-off-by: Galen Charlton <gmcharlt@gmail.com>	2010-03-16 20:17:54 -04:00
Frédéric Demians	5916908e35	Bug 4125 - Reformat with perldoc bulkmarcimport.pl doc Signed-off-by: Galen Charlton <gmcharlt@gmail.com>	2010-02-06 08:06:07 -05:00
Paul Poulain	a6e1f838ae	bulkmarcimport : removing warnings	2010-01-28 15:11:56 +01:00
Henri-Damien LAURENT	ce3adab2ee	bulkmarcimport.pl Bug Fix matching biblios enhanced matching biblios is now also getting biblioitemnumber so that Items management can be performed	2009-11-23 21:40:13 +01:00
Paul Poulain	7cc1115cba	adding error details	2009-11-17 16:27:12 +01:00
Henri-Damien LAURENT	663eb1edd6	Adding some error proof on GetMarcRecord Signed-off-by: Galen Charlton <gmcharlt@gmail.com>	2009-09-30 11:29:24 +02:00
Henri-Damien LAURENT	7eca37db4f	Authorities bulkmarcimport Adding some new options to bulkmarcimport : -k idtagsubfield in order to store the id of the file record into another field -match tagsubfield,index -a to import authorities -l logfilename to store logs Bug Fixing : C4/Charset.pm Charset was incorrect for UNIMARC Authorities Signed-off-by: Galen Charlton <gmcharlt@gmail.com>	2009-09-30 11:22:21 +02:00
Ricardo Dias Marques	f8ff5879a5	Bug 3582: Missing usage information for -h / --help switch for rebuild_nozebra.pl Fix for Bug 3582: Missing usage information for -h / --help switch for rebuild_nozebra.pl http://bugs.koha.org/cgi-bin/bugzilla3/show_bug.cgi?id=3582 Signed-off-by: Galen Charlton <gmcharlt@gmail.com>	2009-09-06 12:48:35 -04:00
Sébastien Hinderer	2a8df0bc2f	Get rid of a few warnings in the bulkmarcimport script: C4/Biblio.pm, hunks #1 , #2 : a warning occurring in NoZebra configurations. C4/Biblio.pm hunk #3 : warning occurring in Unimarc MARC flavour. misc/migration_tools/bulkmarcimport.pl hunk #1 : warning occurring when no default format is specified on command-line with -m switch. Signed-off-by: Galen Charlton <gmcharlt@gmail.com>	2009-09-06 09:54:11 -04:00
Colin Campbell	3199d032e5	Avoid numeric comparisons with leading zeroes Numbers in perl with leading zeros are interpreted in octal Ensure that comparisons are done using string operators or where appropriate use the MARC::Field method Signed-off-by: Galen Charlton <gmcharlt@gmail.com>	2009-08-20 21:01:52 -04:00
Henri-Damien LAURENT	731b82f764	3519 : mergeauthority and authority edition were not synched mergeauthority and ModAuthority were working on two separate directories. So that no authority would ever be merged via cronjob or commandline script when MergeAuthoritiesOnUpdate is disable Signed-off-by: Galen Charlton <gmcharlt@gmail.com>	2009-08-11 19:30:14 -04:00
Galen Charlton	3caec55fd1	removed redundant license statement The standard license statement in the header is fine; please don't confuse things by doing anything different. Signed-off-by: Galen Charlton <gmcharlt@gmail.com>	2009-08-01 08:17:52 -04:00
Paul Poulain	6b1df98ddf	script to remove authorities without biblio attached Signed-off-by: Galen Charlton <gmcharlt@gmail.com>	2009-08-01 08:10:01 -04:00
Frédéric Demians	459d732180	Bug 3301 - Speed up rebuild_zebra script With this patch, rebuild_zebra can re-index a whole Koha DB quickly: rebuild_zebra -r -b -nosanitize Biblio (authority) records are dump directly in a file from marcxml field without beeing transformed into MARC::Record object and corrected. DOCUMENTATION: rebuild_zebra.pl new paramater: -nosanitize export biblio/authority records directly from DB marcxml field without sanitizing records. It speed up dump process but could fail if DB contains badly encoded records. Works now only with -x and -b Signed-off-by: Galen Charlton <galen.charlton@liblime.com>	2009-06-29 07:52:46 -05:00
Brian Harrington	6a2d9ffcf2	Bug 3313, bulkauthimport.pl skips MARC21 subdivision records. This patch adds the MARC21 subdivsion record tags (18x) to the block which recognizes and assigns authtypecodes to imported authority records. Signed-off-by: Galen Charlton <galen.charlton@liblime.com>	2009-06-08 17:03:03 -05:00
Galen Charlton	da51de184c	bug 2926: fix staging import hang Fixes a hang of the staging import tool when it attempts to process a MARC21 record that claims that it's UTF-8 when it is not. The staging import will now attempt to fix the character encoding of such records. Also added a FIXME to bulkmarcimport.pl, which because of its use of MARC::Batch will skip over such records - better than the original hang of the staging import, but worse than the staging import's new ability to fix such records. Signed-off-by: Galen Charlton <galen.charlton@liblime.com>	2009-06-07 13:17:06 -05:00
Galen Charlton	3f4641bf30	bug 3201: missing090field.pl - skip bad bibs Patch courtesy of G. Henry <henry@cmi.univ-mrs.fr> Signed-off-by: Galen Charlton <galen.charlton@liblime.com>	2009-06-07 13:17:01 -05:00
J. David Bavousett	a7d1ab0041	Changes to bulkmarcimport.pl Adds three new switches: -idmap <filename> - optional output file of map of source record ID numbers to Koha biblionumber -x - if idmap is supplied, MARC tag to get source record ID from -y - if idmap is supplied, MARC subfield to get source record ID from Signed-off-by: Galen Charlton <galen.charlton@liblime.com>	2009-04-03 19:18:29 -05:00
Henri-Damien LAURENT	911fddab4a	merge_authority : Bug fixing Signed-off-by: Galen Charlton <galen.charlton@liblime.com>	2009-03-06 14:14:34 -06:00
Mason James	e9599f973c	Fixes command-line 'number' arg in bulkauthimport.pl. for HEAD and 3.0.x Signed-off-by: Galen Charlton <galen.charlton@liblime.com>	2009-03-04 10:43:49 -06:00
Brian Harrington	25cd35b3a1	bug 2924 fixed rebuild_zebra.pl to work when export is skipped reindexing now occurs if there are $num_records_exported or if $skip_export is set Signed-off-by: Galen Charlton <galen.charlton@liblime.com>	2009-03-04 08:28:22 -06:00
Galen Charlton	8f07521a2d	bug 2955: fix remaining calls to GetMarcFromKohaField This includes part of a patch from Henri-Damien Laurent that could not be applied because Chris and Joe patches happened to win the race. Signed-off-by: Galen Charlton <galen.charlton@liblime.com>	2009-02-12 16:29:19 -06:00
Joe Atzberger	11b90be284	Cleanup and perltidy. Add "use warnings", remove unused variables and unnecessary finish/disconnect at the end. This script could be improved to run only on tables that need to be altered instead of touching all of them. It should also probably contain warnings to the effect that it does not rescue your DATA that was forced into whatever encoding the table used previously. Signed-off-by: Galen Charlton <galen.charlton@liblime.com>	2009-01-28 17:29:43 -06:00
Michael Hafen	086b3ccf9a	bug in rebuild_zebra verbose logging - found another print I didn't want to see all the time Add the phrase 'if ( $verbose_logging )' to the two print statements concerning the skipping of biblio or authority records. I recently had to split biblio and authority index updating in my cron script ( had some really big records so had to add the -x switch which should only be used on biblios accourding to the help ). So I noticed that rebuild_zebra.pl printed messages that it was skipping biblios or authorities. This patch is to conditionalize those prints based on the verbose logging switch. Signed-off-by: Galen Charlton <galen.charlton@liblime.com>	2008-12-11 09:23:28 -06:00
Michael Hafen	62a590a954	Reduce logging from rebuild_zebra.pl with a command line option This reduces the output of the script and zebraidx, and creates a -v command line switch which will increase the logging to their former states. Signed-off-by: Galen Charlton <galen.charlton@liblime.com>	2008-10-01 13:05:20 -05:00
Henri-Damien LAURENT	ca8d24546e	Bug Fixing merge_authority.pl merge works on the fly now. But for an obscure reason, merge_authority.pl fails to update database when lanched on command line. Adding one table to LOCK for noZebra UPDATE in Biblio.pm You should remove C4::Search from merg_authority.pl Signed-off-by: Galen Charlton <galen.charlton@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-08-09 11:05:53 -05:00
Galen Charlton	df1f46f9da	bug 2253: improve rebuild_zebra's handling of zebraqueue Prior to this patch, rebuild_zebra.pl -z was effectively hanging on to a lock on the zebraqueue table, preventing other scripts from inserting new entries into the table. This had the effect of causing circulation operations to time out. Refactored by having rebuld_zebra.pl pull the active queue into memory, then mark entries done by zebraqueue.id. Consequently, rebuild_zebra.pl should no longer block adding new entries into zebraqueue. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-06-19 09:49:06 -05:00
Paul POULAIN	feae120738	BUGFIX : script to fix & fill onloan field in items table. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-05-12 09:24:43 -05:00
Galen Charlton	a78b115d35	kohabug 2076 - make biblioitems.marc longblob during upgrade Change to match 3.0 definition of that column. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-05-11 05:37:18 -05:00
Paul POULAIN	8e1844d495	missing ) Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-05-05 05:39:13 -05:00
Paul POULAIN	e7209ed02a	UNIMARC specific rebuild items correctly note 995 for items is hardcoded, so it's really for UNIMARC only. The script exit if you're not UNIMARCflavour Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-04-22 17:34:41 -05:00
Galen Charlton	3109d5820e	rebuild_zebra.pl - add -y option rebuild_zebra.pl will now mark all zebraqueue entries of the affected record type(s) done when run in normal mode to index all records (as opposed to running it with -z to just process the zebraqueue). This prevents any running zebraqueue_daemon processes from attempting to reindex the same records, redundantly. The new -y swtich overrides this new behavior; in other words, if running rebuild_zebra.pl without -z, you can specify -y to not mark zebraqueue done. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-04-21 11:17:29 -05:00
Frederic Demians	004524584b	Tweak bullmarcimport.pl * Add a new parameter -o to begin importing input file after skiping n records. * Enclose input file reading in an eval directive to avoid abording import if few records are corrupted: they are now skipped. * Help formating. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-04-17 05:52:53 -05:00
Galen Charlton	e2c1f11715	fixed memory leak I introduced Accidentally introducing a circular reference in a MARC::Record object does not lead to goodness, particularly if you export lots and lots of them. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-04-01 06:46:05 -05:00
Galen Charlton	4f001186b6	still more rebuild_zebra refactoring Merged duplicate code for indexing bibs and authorities into a single index_records() function. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-25 07:58:03 -05:00
Galen Charlton	a5576b8dfe	IMPORTANT: added -z option to rebuild_zebra.pl The -z option, when used in conjunction with -a and/or -b, selects the records to reindex from the zebraqueue table. Both record updates and record deletes are handled. -z is cannot be used with -s or -r: the updated records must always be freshly exported, and if zebraqueue is to be processed, it's assumed that you don't want to drop the Zebra index first. This means that rebuild_zebra.pl -b -a -x can be used as a cronjob to update the indexes periodically; it is believed that this will offer much better indexing performance on some setups as compared to zebraqueue_daemon.pl, which uses Z39.50 extended services to send record updates to Zebra. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-25 07:58:01 -05:00
Galen Charlton	57d128f727	rebuild_zebra: exit if both -a and -x specified At moment using both -a (index authorities) and -x (export records as MARC XML) is not allowed - if the Zebra authority database is using the DOM filter, zebraidx will not be able to process the exported records correctly. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-25 07:57:44 -05:00
Galen Charlton	f0d5da7448	more rebuild_zebra.pl refactoring 1. Logic to fix up record IDs, UNIMARC 100 field, and record leader now in separate functions. 2. Removed (incorrect) logic to save corrected record in database. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-25 07:57:43 -05:00
Galen Charlton	f98c27a8bc	refactor rebuild_zebra: new routine for invoking zebraidx Created a routine for calling zebraidx, replacing separate invocations for bibs and authorities. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-25 07:57:42 -05:00
Galen Charlton	ae8a76dacc	rebuild_zebra.pl: removed disused $limit option Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-25 07:57:41 -05:00
Galen Charlton	b4f39e5c58	do not let MARC::Batch open MARC files The version of MARC::Batch->new() distributed with version 2.0.0 of MARC::Record, if given a file name, will open it using the ':utf8' layer. This results in an incorrect character conversion when processing records in the MARC-8 character encoding. To avoid this, batch jobs that use MARC::Batch now open the file themselves, then pass the file handle to MARC::Batch->new(). Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-21 21:46:39 -05:00
Galen Charlton	ad0639e548	remove some unneeded use statements Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-21 21:46:29 -05:00
Galen Charlton	4e95689287	bulkmarcimport.pl: XML input option documented Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-03 13:01:00 -06:00
Galen Charlton	d49873cc2f	bulkauthimport.pl - various improvements Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-03 13:00:59 -06:00
Mason James	5057e74914	more \Q...\E wrapping on regexes, to handle occassionally problematic strings. Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-28 07:58:45 -06:00
Mason James	d451f072f9	corrections to host-item, shelf_loc and collection-code indexes Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-28 07:58:43 -06:00
Mason James	be14507658	oops, removing un-needed $dbh->commit() calls Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-20 20:16:42 -06:00
Mason James	b57c146b26	setting $dbh->{AutoCommit} = 0, and adding a new --commit arg. Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-20 20:16:41 -06:00
Paul POULAIN	0e2b065219	NoZebra : removing . and : before indexing Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-19 20:27:36 -06:00
Paul POULAIN	bcf36122a6	speeding a lot rebuild_nozebra by using autocommit OFF feature Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-19 20:27:24 -06:00
Mason James	bdd2afc747	a little speed tweak here, setting "SET FOREIGN_KEY_CHECKS = 0" before clearing bib/bi/items tables. Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-18 22:07:09 -06:00
Ryan Higgins	71dd69d5ac	add option to export and index xml to rebuild_zebra Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-15 08:25:46 -06:00
Mason James	3cb4ea7ecf	added 440* and 490* 'series' indexes Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-11 16:14:54 -06:00
Galen Charlton	60a98d258a	IMPORTANT - refactor MARC character set handling * IsStringUTF8ish - determine if scalar contains a string in UTF8 * MarcToUTF8Record - convert MARC blob or MARC::Record to UTF8 * SetMarcUnicodeFlag - set appropriate MARC21 or UNIMARC field to indicate that record is in UTF-8. Design points of this module include: * No dependencies on other C4 modules, making it easier to add more test cases * All character conversion code in one place * Single entry point for doing a character conversion on a MARC record * Capture of errors and warnings produced by Text::Iconv and MARC::Charset * Start of support for guessing the source character set of a MARC record. Several functions were moved from other scripts or modules to C4::Charset: * C4::Koha->FixEncoding (expanded and renamed MarcToUTF8Record) * C4::Koha->char_decode5426 * fMARC8ToUTF8 from bulkmarcimport.pl (renamed _marc_marc8_to_utf8) Several batch jobs were adjusted to use MarcToUTF8Record instead of FixEncoding. Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-03 07:23:56 -06:00
Daniel BÃÂ¼nzli	78f3e56e2c	bulkauthimport fix Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Galen Charlton <galen.charlton@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-01-22 07:20:28 -06:00
Joshua Ferraro	2a37c19dac	Rudimentary import of MARC21 authorities Also adding support for ingesting format MARCXML in bulkmarcimport and bulkauthimport Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-01-04 21:30:17 -06:00
Joshua Ferraro	9c25d6368a	improvements to INSTALL.debian, adding Symbols for currencies adding \n to make bulkmarcimport.pl prettier Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-01-03 21:28:37 -06:00

1 2 3 4 5 ...

339 commits