Koha-community/Koha - Koha: The world's first free and open source library system

Author	SHA1	Message	Date
Galen Charlton	df1f46f9da	bug 2253: improve rebuild_zebra's handling of zebraqueue Prior to this patch, rebuild_zebra.pl -z was effectively hanging on to a lock on the zebraqueue table, preventing other scripts from inserting new entries into the table. This had the effect of causing circulation operations to time out. Refactored by having rebuld_zebra.pl pull the active queue into memory, then mark entries done by zebraqueue.id. Consequently, rebuild_zebra.pl should no longer block adding new entries into zebraqueue. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-06-19 09:49:06 -05:00
Paul POULAIN	feae120738	BUGFIX : script to fix & fill onloan field in items table. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-05-12 09:24:43 -05:00
Galen Charlton	a78b115d35	kohabug 2076 - make biblioitems.marc longblob during upgrade Change to match 3.0 definition of that column. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-05-11 05:37:18 -05:00
Paul POULAIN	8e1844d495	missing ) Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-05-05 05:39:13 -05:00
Paul POULAIN	e7209ed02a	UNIMARC specific rebuild items correctly note 995 for items is hardcoded, so it's really for UNIMARC only. The script exit if you're not UNIMARCflavour Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-04-22 17:34:41 -05:00
Galen Charlton	3109d5820e	rebuild_zebra.pl - add -y option rebuild_zebra.pl will now mark all zebraqueue entries of the affected record type(s) done when run in normal mode to index all records (as opposed to running it with -z to just process the zebraqueue). This prevents any running zebraqueue_daemon processes from attempting to reindex the same records, redundantly. The new -y swtich overrides this new behavior; in other words, if running rebuild_zebra.pl without -z, you can specify -y to not mark zebraqueue done. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-04-21 11:17:29 -05:00
Frederic Demians	004524584b	Tweak bullmarcimport.pl * Add a new parameter -o to begin importing input file after skiping n records. * Enclose input file reading in an eval directive to avoid abording import if few records are corrupted: they are now skipped. * Help formating. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-04-17 05:52:53 -05:00
Galen Charlton	e2c1f11715	fixed memory leak I introduced Accidentally introducing a circular reference in a MARC::Record object does not lead to goodness, particularly if you export lots and lots of them. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-04-01 06:46:05 -05:00
Galen Charlton	4f001186b6	still more rebuild_zebra refactoring Merged duplicate code for indexing bibs and authorities into a single index_records() function. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-25 07:58:03 -05:00
Galen Charlton	a5576b8dfe	IMPORTANT: added -z option to rebuild_zebra.pl The -z option, when used in conjunction with -a and/or -b, selects the records to reindex from the zebraqueue table. Both record updates and record deletes are handled. -z is cannot be used with -s or -r: the updated records must always be freshly exported, and if zebraqueue is to be processed, it's assumed that you don't want to drop the Zebra index first. This means that rebuild_zebra.pl -b -a -x can be used as a cronjob to update the indexes periodically; it is believed that this will offer much better indexing performance on some setups as compared to zebraqueue_daemon.pl, which uses Z39.50 extended services to send record updates to Zebra. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-25 07:58:01 -05:00
Galen Charlton	57d128f727	rebuild_zebra: exit if both -a and -x specified At moment using both -a (index authorities) and -x (export records as MARC XML) is not allowed - if the Zebra authority database is using the DOM filter, zebraidx will not be able to process the exported records correctly. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-25 07:57:44 -05:00
Galen Charlton	f0d5da7448	more rebuild_zebra.pl refactoring 1. Logic to fix up record IDs, UNIMARC 100 field, and record leader now in separate functions. 2. Removed (incorrect) logic to save corrected record in database. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-25 07:57:43 -05:00
Galen Charlton	f98c27a8bc	refactor rebuild_zebra: new routine for invoking zebraidx Created a routine for calling zebraidx, replacing separate invocations for bibs and authorities. Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-25 07:57:42 -05:00
Galen Charlton	ae8a76dacc	rebuild_zebra.pl: removed disused $limit option Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-25 07:57:41 -05:00
Galen Charlton	b4f39e5c58	do not let MARC::Batch open MARC files The version of MARC::Batch->new() distributed with version 2.0.0 of MARC::Record, if given a file name, will open it using the ':utf8' layer. This results in an incorrect character conversion when processing records in the MARC-8 character encoding. To avoid this, batch jobs that use MARC::Batch now open the file themselves, then pass the file handle to MARC::Batch->new(). Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-21 21:46:39 -05:00
Galen Charlton	ad0639e548	remove some unneeded use statements Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-21 21:46:29 -05:00
Galen Charlton	4e95689287	bulkmarcimport.pl: XML input option documented Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-03 13:01:00 -06:00
Galen Charlton	d49873cc2f	bulkauthimport.pl - various improvements Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-03-03 13:00:59 -06:00
Mason James	5057e74914	more \Q...\E wrapping on regexes, to handle occassionally problematic strings. Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-28 07:58:45 -06:00
Mason James	d451f072f9	corrections to host-item, shelf_loc and collection-code indexes Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-28 07:58:43 -06:00
Mason James	be14507658	oops, removing un-needed $dbh->commit() calls Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-20 20:16:42 -06:00
Mason James	b57c146b26	setting $dbh->{AutoCommit} = 0, and adding a new --commit arg. Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-20 20:16:41 -06:00
Paul POULAIN	0e2b065219	NoZebra : removing . and : before indexing Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-19 20:27:36 -06:00
Paul POULAIN	bcf36122a6	speeding a lot rebuild_nozebra by using autocommit OFF feature Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-19 20:27:24 -06:00
Mason James	bdd2afc747	a little speed tweak here, setting "SET FOREIGN_KEY_CHECKS = 0" before clearing bib/bi/items tables. Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-18 22:07:09 -06:00
Ryan Higgins	71dd69d5ac	add option to export and index xml to rebuild_zebra Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-15 08:25:46 -06:00
Mason James	3cb4ea7ecf	added 440* and 490* 'series' indexes Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-11 16:14:54 -06:00
Galen Charlton	60a98d258a	IMPORTANT - refactor MARC character set handling * IsStringUTF8ish - determine if scalar contains a string in UTF8 * MarcToUTF8Record - convert MARC blob or MARC::Record to UTF8 * SetMarcUnicodeFlag - set appropriate MARC21 or UNIMARC field to indicate that record is in UTF-8. Design points of this module include: * No dependencies on other C4 modules, making it easier to add more test cases * All character conversion code in one place * Single entry point for doing a character conversion on a MARC record * Capture of errors and warnings produced by Text::Iconv and MARC::Charset * Start of support for guessing the source character set of a MARC record. Several functions were moved from other scripts or modules to C4::Charset: * C4::Koha->FixEncoding (expanded and renamed MarcToUTF8Record) * C4::Koha->char_decode5426 * fMARC8ToUTF8 from bulkmarcimport.pl (renamed _marc_marc8_to_utf8) Several batch jobs were adjusted to use MarcToUTF8Record instead of FixEncoding. Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-02-03 07:23:56 -06:00
Daniel BÃÂ¼nzli	78f3e56e2c	bulkauthimport fix Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Galen Charlton <galen.charlton@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-01-22 07:20:28 -06:00
Joshua Ferraro	2a37c19dac	Rudimentary import of MARC21 authorities Also adding support for ingesting format MARCXML in bulkmarcimport and bulkauthimport Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-01-04 21:30:17 -06:00
Joshua Ferraro	9c25d6368a	improvements to INSTALL.debian, adding Symbols for currencies adding \n to make bulkmarcimport.pl prettier Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-01-03 21:28:37 -06:00
Galen Charlton	8c60e82605	fixed variable masking warnings found by perl -w Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-01-03 20:23:59 -06:00
Galen Charlton	c2a0ed8077	item rework: replaced AddBiblioAndItems Replace C4::Biblio::AddBiblioAndItems with two things: * An option to C4::Biblio::AddBiblio to defer writing biblioitems.marc and biblioitems.marcxml. This option was created to give a significant speed boost to bulkmarcimport.pl, but is not recommended for general use. * C4::Items::AddItemBatchFromMarc This refactoring removes the need to have functions in C4::Biblio and C4::Items that call each other's private functions. Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-01-03 16:26:16 -06:00
Galen Charlton	9d4d8897b2	item rework: various changes * Move CheckItemPreSave to C4::Items (from C4::Biblio) * Modified C4::Biblio::AddBiblioAndItems to use appropriate internal routines from C4::Items * Moved GetItemnumberFromBarcode to C4::Items * Removed duplicate C4::Biblio::_koha_new_items * Removed disused C4::Biblio::MARCitemchange Currently AddBiblioAndItems is a special routine that uses private subs from both C4::Biblio and C4::Items. This needs to be refactored. Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-01-03 16:25:42 -06:00
Joshua Ferraro	5c23369af2	Fixing Database Definitions for Statuses PARTIAL Prior to this fix, the status fields had three 'off' values, NULL, "", and 0. I've reduced it to two in the db, removing the option for NULL, and setting the default value to 0, however, we need to verify that we don't ever write out as "" as this needlessly complicates the indexing process, critical for searching or limiting by status (e.g., availability). Also, queries that attempt to write a NULL value to one of these fields will fail (based on my tests). This patch includes the following changes: * Updated the database definition for notforloan, damaged, itemlost, and wthdrawn in kohastructure.sql to forbid NULL and default to 0; MySQL can't forbid other values (such as empty ""), so this has to be handled at the application layer and REQUIRES further patching. * Fixed the 'limit by availability' query node in Search.pm to use a much less confusing definition of 'available' * Added code to set values to 0 where they are NULL or empty ( "" ) for notforloan, damaged, itemlost or wthdrawn in both the MARC and the items table: * Biblio.pm -> AddBiblioAndItems * catalogue/updateitem.pl * SEE NOTE BELOW, REQUIRES UPDATE TO THE REST OF KOHA'S ITEM MGT! * Removed code in bulkmarcimport.pl that sets notforloan status depending on item-level or bib-level itemtype -- that flag is designed to be set only to override the notforloan setting for the item's (or bib's, depending on the syspref) assigned itemtype (it doesn't need to override to 'for loan', only to 'not for loan'). added $dbh->do("truncate zebraqueue"); when operation is 'delete' * I updated some notes in catalogue/updateitem.pl as to why ModItem can't be used -- we don't have _a_ place where we can change the item and marc :/ I've tested the following: bulkmarcimport.pl..........................MARC/items OK Staged Records Import......................NOT OK updateitem.pl (via moredetail.pl)..........MARC/items OK circulation.pl.............................NOT OK returns.pl.................................NOT OK addbiblio.pl...............................NOT OK additem.pl.................................NOT OK Basically, there isn't a single place to apply this patch that will update both item data and MARC data in one place ... a future patch needs to address this issue. Signed-off-by: Galen Charlton <galen.charlton@liblime.com> Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-01-03 16:23:04 -06:00
Chris Cormack	c7215e7a93	Escaping the $title in the regexes with \Q and \E to handle nested quantifiers Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-01-03 01:20:40 -06:00
Paul POULAIN	319a32b16e	rebuild_zebra : directories updated the unimarc stuff has been moved to marc_defs directory and the lang specific is in lang_defs Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-01-03 00:55:12 -06:00
Joshua Ferraro	554bbe1bda	s/Waited/Expected/ for serials statuses reformating rebuild_nozebra.pl indexes Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2008-01-01 12:59:28 -06:00
Joshua Ferraro	dd3f557f53	fixing nomenclature on files in misc/, adding a few new utilities Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-12-30 12:13:34 -06:00
Joshua Ferraro	c6ddddad98	adding a new option, -w, which disables shadow indexing for the current batch (faster indexing of large sets where ACID isn't critical) Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-12-30 12:13:27 -06:00
Galen Charlton	3508933c66	bulkmarcimport: enable MARC-8 to UTF-8 conversion Enabled automatic conversion of MARC-8 records to UTF-8. Record is converted if its Leader/09 contains a blank and the -s (skip) option hasn't been supplied on the command-line. Any record that cannot be converted to UTF-8 is skipped. Also now use Unicode Normalization Form C (NFC) for records converted from MARC-8. Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-12-25 09:08:38 -06:00
Galen Charlton	d426a91d0e	removed extraneous comments Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-12-25 09:08:35 -06:00
Galen Charlton	cb6cf680bc	improved error detection in AddBiblioAndItems Introduced new C4::Biblio function CheckItemPreSave, which checks for duplicate barcodes and invalid branch codes. Not yet sure whether this function needs to be exported or whether it will just be used internally to C4::Bibli. Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-12-25 09:08:34 -06:00
Galen Charlton	6b49df4c3f	removed superfluous comments Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-12-25 09:08:31 -06:00
Galen Charlton	7d47666f7e	bulk MARC record import - speed improved Changes to improve speed of MARC bib and item imports: [1] Turn off autocommit and commit database transactions in larger batches. [2] Introduce a new C4::Biblio function (AddBiblioAndItems) to combine AddBiblio and AddItems -- this is faster because we are not parsing the MARC XML of the biblio every time we add an item. [3] Introduce FasterTransformMarcToKoha, which is much faster than TransformMarcToKoha. The new version, which will replace the old one once it has been fully tested, scans through each field in the MARC record just once, instead of potentially dozens of times. [4] Remove code in bulkmarcexport that moved the item tags to separate MARC::Record objects. Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-12-25 09:08:28 -06:00
Galen Charlton	4609608ccc	allow use of older version of File::Temp Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-12-22 22:58:12 -06:00
Galen Charlton	93beb943c0	bug 1661: rebuild_zebra.pl changes [1] Use File::Temp to create and manage export directory if -d is not specified. [2] Added usage message. [3] Code that attempts to fix up Zebra configuration files changed so that it is invoked only if --munge-config option is supplied; this code will ultimately either be removed or moved to a separate script -- the sorts of errors that it tries to fix should no longer be appearing in a standard install. [4] Fixed Win32 portability problem when removing temporary directory. Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-12-20 19:19:43 -06:00
Henri-Damien LAURENT	bdade9bc9d	Adding rebuild field 100$a for Unimarc Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-12-20 18:40:43 -06:00
Henri-Damien LAURENT	c9fb20928b	Generating index for authorities on AUTHtypecode from table auth_header Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-12-20 18:30:50 -06:00
Galen Charlton	b8a58c4934	installer: command-line scripts improve finding C4 modules Command-line scripts now use a new SCRIPT_DIR/kohalib.pl to put installed location of Koha's Perl modules into @INC.	2007-12-17 09:13:54 -06:00

1 2 3

120 commits