Koha-community/Koha - Koha: The world's first free and open source library system

Author	SHA1	Message	Date
Paul POULAIN	1cd11f4d54	fixes in NoZebra search & indexing - the quotemeta was wrong (and introduced some bugs in diacritics) - fixing some bugs that appear only sometimes : the union was done including weight, which is wrong & resulted in missing some results (when various weighting) Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-10-31 05:53:36 -05:00
Paul POULAIN	fa26bcc037	rebuild_unimarc_100 : better handling of unusual cases If 100$a repeated, the scripts failed to handle that correctly Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-10-24 17:08:56 -05:00
Paul POULAIN	cd8a565a6a	temp Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-10-24 17:08:40 -05:00
Paul POULAIN	837e5c5e94	less verbose Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-10-24 17:06:36 -05:00
Joshua Ferraro	9d29ce5d58	improvements to zebra configuration files Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-10-21 19:14:12 -05:00
Paul POULAIN	1ac38782a1	#1474 : Bulkmarcimport croaks when Log is ON set to 0 and restore at the end of the import Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-10-11 14:53:59 -05:00
Paul POULAIN	057d654a5b	skipping wrong XMLs when rebuilding nozebra indexes Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-10-09 19:11:47 -05:00
Paul POULAIN	49ef1df969	Adding a new option to rebuildzebra : noxml This option uses the iso2709 version of the MARC record instead of the XML one (biblioitems.marc vs biblioitems.marcxml) No change if the parameter is not set. Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-10-09 19:07:36 -05:00
Joshua Ferraro	827d27111f	adding barcode index Signed-off-by: Chris Cormack <crc@liblime.com> Signed-off-by: Joshua Ferraro <jmf@liblime.com>	2007-10-06 21:46:02 -05:00
Paul POULAIN	375d2f1158	(minor) updating doc & removing warn Signed-off-by: Chris Cormack <crc@liblime.com>	2007-10-03 14:57:12 -05:00
Chris Catalfo	502487e2ba	Added basic MARC21 index definitions. Signed-off-by: Chris Cormack <crc@liblime.com>	2007-10-02 15:38:32 -05:00
Paul POULAIN	6f7efca7e1	BUGFIX for browser and nozebra tables - adding browser and nozebra table definition to kohastructure & updatedatabase - bumping to 3.00.00.005 Signed-off-by: Chris Cormack <crc@liblime.com>	2007-10-02 04:35:49 -05:00
Joshua Ferraro	ae34e8f45a	changing the name of the zebra password file to passwd Signed-off-by: Chris Cormack <crc@liblime.com>	2007-10-01 23:14:47 -05:00
Joshua Ferraro	b87d4924b9	commenting out set_service_options, but also removes commit op Signed-off-by: Chris Cormack <crc@liblime.com>	2007-10-01 17:40:31 -05:00
Ryan Higgins	c44efe7b84	fix bad call to GetMarcFromKohaField in bulkmarcimport, and add -fk param, allowing disabling of fk constraints during import. Signed-off-by: Chris Cormack <crc@liblime.com>	2007-09-30 21:16:50 -05:00
Paul POULAIN	0d7a4aafd0	BUGFIX : NoZebra indexing was wrong for accented words Signed-off-by: Chris Cormack <crc@liblime.com>	2007-09-26 05:28:37 -05:00
Paul POULAIN	623ac80330	BUGFIXES : 3 (marc_biblio, check biblionumber, ModMarcBiblio API) - use biblio instead of marc_biblio, - better check that biblionumber is correctly stored - fix an buggy API call when ModMarcBiblio Signed-off-by: Chris Cormack <crc@liblime.com>	2007-09-13 17:18:50 -05:00
Paul POULAIN	ec7bd0b2ff	(unimarc specific) BUGFIX : if 100$a exist but is not 35 char long, MARC::File::XML may fail So, add blanks if needed... Signed-off-by: Chris Cormack <crc@liblime.com>	2007-09-13 17:17:56 -05:00
tipaul	1399945a75	eval() on getAuthority & getBiblio to avoid a script failure	2007-08-01 09:20:03 +00:00
toins	5e7b171686	adding an eval to don't die if an error occurs	2007-07-19 09:48:22 +00:00
tipaul	23427c51b9	some fixes (and only fixes)	2007-06-15 13:44:44 +00:00
toins	6dfb0dca36	next if there is an error getting the biblio.	2007-06-11 15:22:59 +00:00
toins	4728830e34	it's faster to 'truncate' instead of using 'delete from'...	2007-06-08 09:41:14 +00:00
tipaul	5dd3f0229a	bugfixes (various), handling utf-8 without guessencoding (as suggested by joshua, fixing some zebra config files -for french but should be interesting for other languages-	2007-06-06 13:08:35 +00:00
btoumi	68bcf35387	delete space in beggining of the script to accept script launch	2007-05-25 10:00:54 +00:00
tipaul	0569dccd5f	some changes to default zebra config for better searches	2007-05-25 09:34:30 +00:00
tipaul	651b075197	small script to check XML parser. Remember that PurePerl Parser is buggued and can t handle utf8 correctly	2007-05-25 09:33:58 +00:00
tipaul	5ff7fcffa4	Bugfixes & improvements (various and minor) : - updating templates to have tmpl_process3.pl running without any errors - adding a drupal-like css for prog templates (with 3 small images) - fixing some bugs in circulation & other scripts - updating french translation - fixing some typos in templates	2007-05-22 09:13:54 +00:00
tipaul	ca201e36af	Koha NoZebra : - support for authorities - some bugfixes in ordering and "CCL" parsing - support for authorities <=> biblios walking Seems I can do what I want now, so I consider its done, except for bugfixes that will be needed i m sure !	2007-05-10 14:45:15 +00:00
tipaul	e1d907c688	various bugfixes on parameters modules + adding default NoZebraIndexes systempreference if it's empty	2007-05-04 16:24:08 +00:00
tipaul	3e85c9e97f	NoZebra SQL index management : * adding 3 subs in Biblio.pm - GetNoZebraIndexes, that get the index structure in a new systempreference (added with this commit) - _DelBiblioNoZebra, that retrieve all index entries for a biblio and remove in a variable the biblio reference - _AddBiblioNoZebra, that add index entries for a biblio. Note that the 2 _Add and _Del subs work only in a hash variable, to speed up things in case of a modif (ie : delete+add). The effective SQL update is done in the ModZebra sub (that existed before, and dealed with zebra index). I think the code has to be more deeply tested, but it works at least partially.	2007-05-02 16:44:31 +00:00
tipaul	4213b6ec98	improving NOzebra search : - changing nozebra table to have biblionumber,title-ranking; (; is the entry separator. Now, if a value is several times in an index, it is stored only once, with a higher ranking (the ranking is the number of times the word appeard for this index) - improving search to have ranking value (default order). The ranking is the sum of ranking of all terms. The list is ordered by ranking+title, from most to lower	2007-05-02 11:57:11 +00:00
hdl	097fef712a	Removing $dbh from GetMarcFromKohaField (dbh is not used in this function.)	2007-04-27 14:00:48 +00:00
tipaul	b53be9cdaf	Koha 3.0 nozebra 1st commit : the script misc/migration_tools/rebuild_nozebra.pl build the nozebra table, and, if you set NoZebra to Yes, queries will be done through zebra. TODO : - add nozebra table management on biblio editing - the index table content is hardcoded. I still have to add some specific systempref to let the library update it - manage pagination (next/previous) - manage facets WHAT works : - NZgetRecords : has exactly the same API & returns as zebra getQuery, except that some parameters are unused - search & sort works quite good - CQL parser is better that what I thought I could do : title="harry and sally" and publicationyear>2000 not itemtype=LIVR should work fine	2007-04-25 16:26:42 +00:00
tipaul	6b201757c1	some bugfixes for this script that automatically build zebra DB from default config files	2007-04-17 08:50:33 +00:00
tipaul	eba2552086	Code cleaning of Biblio.pm (continued) All subs have be cleaned : - removed useless - merged some - reordering Biblio.pm completly - using only naming conventions Seems to have broken nothing, but it still has to be heavily tested. Note that Biblio.pm is now much more efficient than previously & probably more reliable as well.	2007-03-29 16:45:53 +00:00
tipaul	a481fad4b7	Code cleaning : == Biblio.pm cleaning (useless) == * some sub declaration dropped * removed modbiblio sub * removed moditem sub * removed newitems. It was used only in finishrecieve. Replaced by a Koha2Marc+AddItem, that is better. * removed MARCkoha2marcItem * removed MARCdelsubfield declaration * removed MARCkoha2marcBiblio == Biblio.pm cleaning (naming conventions) == * MARCgettagslib renamed to GetMarcStructure * MARCgetitems renamed to GetMarcItem * MARCfind_frameworkcode renamed to GetFrameworkCode * MARCmarc2koha renamed to TransformMarcToKoha * MARChtml2marc renamed to TransformHtmlToMarc * MARChtml2xml renamed to TranformeHtmlToXml * zebraop renamed to ModZebra == MARC=OFF == * removing MARC=OFF related scripts (in cataloguing directory) * removed checkitems (function related to MARC=off feature, that is completly broken in head. If someone want to reintroduce it, hard work coming...) * removed getitemsbybiblioitem (used only by MARC=OFF scripts, that is removed as well)	2007-03-29 13:30:31 +00:00
tipaul	f8e9fb6445	rel_3_0 moved to HEAD (introducing new files)	2007-03-09 15:34:17 +00:00
tipaul	a3999812e6	rel_3_0 moved to HEAD	2007-03-09 14:52:58 +00:00
thd	ad657e71eb	For MARC 21, instead of deleting the whole subfield when a character does not translate properly from MARC8 into UTF-8, only the problem characters are deleted.	2006-09-01 17:11:53 +00:00
toins	eac83ccd45	Head & rel_2_2 merged	2006-07-04 15:02:42 +00:00
rangi	10b2315eb3	Fixing the problem that all items were getting biblioitem=1 set	2006-04-01 22:10:50 +00:00
kados	44b4d37b54	removed Zconns, no need for them anymore with new Context.pm setup	2006-02-27 01:06:30 +00:00
kados	fafe0896d6	minor bugfix with 'commit' option	2006-02-25 23:40:59 +00:00
kados	77abbe2caf	A bulkmarcimport.pl that is based on the new Biblio.pm Zebra routines. It now responds to: -n : the number of records to import. -commit : the number of records to wait before performing a 'commit' operation ALSO: IMPORTANT: I took out the char_encoding as this should be handled by MARC::File::XML now, unless I'm mistaken.	2006-02-25 21:53:48 +00:00
tipaul	f74823bf1b	OK, this time it seems to work. The last blocking problem was... a space in recordId: (bib1,Identifier-standard) just after the comma. Adam agreed it was a bug, and it should be solved soon. But now we are aware, we can avoid putting the space ! In this commit you have all what is needed to setup a working zebra DB in Unimarc : * collection.abs is UNIMARC specific and must be rewritten for MARC21, in marc21 directory * pdf.properties is to be copied unmodified in the marc21 directory (can also be put somewhere else) * rebuild_zebra.pl is SLOW, but 1 step reindexing tool, using ZOOM * rebuild_zebra_idx is FAST, but 2 step reindexing tool, and does not use zebra. run it, it will create all biblios XML files in /zebra/biblios directory, then zebraidx update biblios in your zebra directory * zebra.cfg is the zebra config file ;-) * test_cql2rpn.pl is a script that will query the database and show the results. Works for me, just change the query at the beginning to get answers you expect. What has to be done : * benchmarking : it seems the zebraidx update is faster than lightning (400biblios/sec : 10 000biblios in 25seconds), while ZOOM indexing is slow (something like 25biblios/second) More benchmarking could be done. * completing collection.abs for UNIMARC. I'll take care of it. * modifying Biblio.pm to use ZOOM instead of the "zebraidx through exec" running actually. I'll take care of it also. * modify the search API & tools & screens. I'll let the ball to someone else (chris ?) for this. I agree SearchMarc.pm can be dropped and replaced by something else (maybe a new-and-clean Search.pm package)	2006-02-09 10:59:34 +00:00
tipaul	369ee65d94	new version of rebuild_zebra. Should work with Perl-ZOOM, but DOES NOT WORK for me. I get : ZOOM error 10002 "Encoding failed" from diag-set 'ZOOM' help expected from indexdata...	2006-01-10 17:03:32 +00:00
tipaul	d5938493d7	synch'ing head and rel_2_2 (from 2.2.5, including npl templates) Seems not to break too many things, but i'm probably wrong here. at least, new features/bugfixes from 2.2.5 are here (tested on some features on my head local copy) - removing useless directories (koha-html and koha-plucene)	2006-01-06 16:39:37 +00:00
tipaul	dba37f38e7	This script can be use to rebuild the zebra DB. It stores all koha MARC records in iso2709, in the bilbios directory. After that, you just have to "zebraidx update biblios" I tried on a 9900 DB, here are the results : [paul@bureau migration_tools]$ ./rebuild_zebra.pl -c 9900 9903 MARC record done in 37.9104120731354 seconds [paul@bureau zebra]$ zebraidx update biblios <snip> 18:31:24-11/08 zebraidx(20348) [log] Iterations . . . 144575 18:31:24-11/08 zebraidx(20348) [log] Distinct words . 39891 18:31:24-11/08 zebraidx(20348) [log] Updates. . . . . 46 18:31:24-11/08 zebraidx(20348) [log] Deletions. . . . 2 18:31:24-11/08 zebraidx(20348) [log] Insertions . . . 39843 18:31:24-11/08 zebraidx(20348) [log] zebra_register_close p=0x8104cf8 18:31:25-11/08 zebraidx(20348) [log] Records: 9887 i/u/d 9881/6/0 18:31:25-11/08 zebraidx(20348) [log] user/system: 531/145 18:31:25-11/08 zebraidx(20348) [log] zebra_stop 18:31:25-11/08 zebraidx(20348) [log] zebraidx times: 11.33 5.31 1.45	2005-08-11 16:35:54 +00:00
tipaul	c52e5b61dd	synch'ing 2.2 and head	2005-08-04 14:10:52 +00:00
tipaul	64cd740d2b	synch'ing 2.2 and head	2005-05-04 08:58:30 +00:00
tipaul	93ff09d081	merging 2.2 branch with head. Sorry for not making it before, many many commits done here	2005-03-01 13:40:35 +00:00
tipaul	51e204fa23	moving bulkmarcimport script to migration_tools directory	2005-01-03 15:25:50 +00:00
tipaul	cd6f87a689	Auto-build LANG authorized values	2005-01-03 12:59:49 +00:00

1 2 3 4 5

204 commits