Commit graph

423 commits

Author SHA1 Message Date
kados
a1d59e6633 New MARC21 collection.abs based on marc21.abs 2006-02-16 00:46:10 +00:00
kados
1f2014e73f Fixing zebra.cfg for usmarc 2006-02-14 22:49:44 +00:00
kados
2017a41f0b adding some config stuff for usmarc 2006-02-14 22:49:08 +00:00
tipaul
f74823bf1b OK, this time it seems to work. The last blocking problem was... a space in
recordId: (bib1,Identifier-standard) just after the comma. Adam agreed it was a bug, and it should be solved soon. But now we are aware, we can avoid putting the space !

In this commit you have all what is needed to setup a working zebra DB in Unimarc :
* collection.abs is UNIMARC specific and must be rewritten for MARC21, in marc21 directory
* pdf.properties is to be copied unmodified in the marc21 directory (can also be put somewhere else)
* rebuild_zebra.pl is SLOW, but 1 step reindexing tool, using ZOOM
* rebuild_zebra_idx is FAST, but 2 step reindexing tool, and does not use zebra. run it, it will create all biblios XML files in /zebra/biblios directory, then zebraidx update biblios in your zebra directory
* zebra.cfg is the zebra config file ;-)
* test_cql2rpn.pl is a script that will query the database and show the results. Works for me, just change the query at the beginning to get answers you expect.

What has to be done :
* benchmarking : it seems the zebraidx update is faster than lightning (400biblios/sec : 10 000biblios in 25seconds), while ZOOM indexing is slow (something like 25biblios/second) More benchmarking could be done.
* completing collection.abs for UNIMARC. I'll take care of it.
* modifying Biblio.pm to use ZOOM instead of the "zebraidx through exec" running actually. I'll take care of it also.
* modify the search API & tools & screens. I'll let the ball to someone else (chris ?) for this. I agree SearchMarc.pm can be dropped and replaced by something else (maybe a new-and-clean Search.pm package)
2006-02-09 10:59:34 +00:00
tipaul
369ee65d94 new version of rebuild_zebra. Should work with Perl-ZOOM, but DOES NOT WORK for me.
I get  :
ZOOM error 10002 "Encoding failed" from diag-set 'ZOOM'

help expected from indexdata...
2006-01-10 17:03:32 +00:00
tipaul
d8e4043f88 some modifs in zebra unimarc config files 2006-01-10 17:01:55 +00:00
tipaul
d5938493d7 synch'ing head and rel_2_2 (from 2.2.5, including npl templates)
Seems not to break too many things, but i'm probably wrong here.
at least, new features/bugfixes from 2.2.5 are here (tested on some features on my head local copy)

- removing useless directories (koha-html and koha-plucene)
2006-01-06 16:39:37 +00:00
kados
8b2316640b Fixed wrong path for tab file. 2005-12-11 04:09:56 +00:00
kados
efbd8b407c Fixing wrong mapping for biblionumber should be 090$c not 090$9 2005-12-11 04:09:29 +00:00
kados
67134b5feb Adding usmarc zebra configuration files. 2005-12-11 00:57:08 +00:00
tipaul
e82eca42c9 see mail on koha-devel : code cleaning on Search.pm + normalizing API + use of biblionumber everywhere (instead of bn, biblio, ...) 2005-09-22 10:01:45 +00:00
tipaul
11337d7ef4 minor change in UNIMARC zebra.cfg 2005-08-12 13:58:08 +00:00
tipaul
dba37f38e7 This script can be use to rebuild the zebra DB. It stores all koha MARC records in iso2709, in the bilbios directory. After that, you just have to "zebraidx update biblios"
I tried on a 9900 DB, here are the results :

[paul@bureau migration_tools]$ ./rebuild_zebra.pl -c
9900
9903 MARC record done in 37.9104120731354 seconds

[paul@bureau zebra]$ zebraidx update biblios
<snip>
18:31:24-11/08 zebraidx(20348) [log] Iterations . . . 144575
18:31:24-11/08 zebraidx(20348) [log] Distinct words .  39891
18:31:24-11/08 zebraidx(20348) [log] Updates. . . . .     46
18:31:24-11/08 zebraidx(20348) [log] Deletions. . . .      2
18:31:24-11/08 zebraidx(20348) [log] Insertions . . .  39843
18:31:24-11/08 zebraidx(20348) [log] zebra_register_close p=0x8104cf8
18:31:25-11/08 zebraidx(20348) [log] Records:    9887 i/u/d 9881/6/0
18:31:25-11/08 zebraidx(20348) [log] user/system: 531/145
18:31:25-11/08 zebraidx(20348) [log] zebra_stop
18:31:25-11/08 zebraidx(20348) [log] zebraidx times: 11.33  5.31  1.45
2005-08-11 16:35:54 +00:00
tipaul
9a508fe3df Playing with the zebra...
* go to koha cvs home directory
* in misc/zebra there is a unimarc directory. I suggest that marc21 libraries create a marc21 directory
* put your zebra.cfg files here & create your database.
* from koha cvs home directory, ln -s misc/zebra/marc21 zebra (I mean create a symbolic link to YOUR zebra directory)
* now, everytime you add/modify a biblio/item your zebra DB is updated correctly.

NOTE :
* this uses a system call in perl. CPU consumming, but we are waiting for indexdata Perl/zoom
* deletion still not work
* UNIMARC zebra config files are provided in misc/zebra/unimarc directory. The most important line being :
in zebra.cfg :
recordId: (bib1,Local-number)
storeKeys:1

in .abs file :
elm 090            Local-number            -
elm 090/?          Local-number            -
elm 090/?/9        Local-number            !:w

(090$9 being the field mapped to biblio.biblionumber in Koha)
2005-08-11 16:12:47 +00:00
tipaul
c52e5b61dd synch'ing 2.2 and head 2005-08-04 14:10:52 +00:00
tipaul
86438403bd some parameter files for zebra for UNIMARC. Just for testing & sharing purposes for instance. 2005-06-27 13:13:17 +00:00
slef
30a17190d9 perldoc and prototype updates 2005-06-24 12:19:34 +00:00
tipaul
6b0ee5525b synch'ing 2.2 and head 2005-06-20 14:10:00 +00:00
tipaul
4cdb643e3e new version, doing tag by tag. Should be slower, but requires a lot less memory 2005-06-01 18:55:08 +00:00
tipaul
b79bc88f4b new version, doing tag by tag. Should be slower, but requires a lot less memory 2005-06-01 18:47:38 +00:00
slef
113a1f8f50 possible fix for shell characters in kohaadmin user and database names 2005-05-27 13:33:42 +00:00
tipaul
41c209a4bd proof of concept for inverted index tables for search
how it works :
* create the table marc_Tword with the following structure :
CREATE TABLE `marc_Tword` (
  `word` varchar(80) NOT NULL default '',
  `usedin` text NOT NULL,
  `tagsubfield` varchar(4) NOT NULL default '',
  PRIMARY KEY  (`word`,`tagsubfield`)
) TYPE=MyISAM;
* open a console & type export PERL5LIB & export KOHA_CONF as usual.
* fill this table with misc/build_marc_Tword.pl. Warning, this script uses a very very consumming but very fast method to fill the table : it does everything in memory, then write everything. Another method is provided (& commented), but it's 100x times slower (really !)
* open opac-search.pl and replace use C4::SearchMarc; by use C4::SearchMarcTest; as the API hasn't changed, it will work immediatly.
* go to opac-search (advanced search) & search whatever you want. Should work fine.

LIMITS :
* build_marc_Tword has problem with extended chars (accented ones mainly). So don't be afraid if you get sql errors. They are not a problem for a POC
* search works always order by title, whatever you choose.
* search works only search WORDA and WOARDB, not yet WORDA or WORDB or WORDA except WORDB.
2005-05-27 09:30:23 +00:00
tipaul
9d21434743 changing dewey to varchar 2005-05-06 15:56:06 +00:00
joshferraro
fd9122cdba produces a comma separated list of currently available reserves, with item and borrower details.
NOTE: it's NPL-specific because of hardcoded branch info -- this could be
fixed with a query for branches.
2005-05-04 14:01:45 +00:00
joshferraro
ce775cce9c Fixes priority of reserves it also e-mails a list of 'problem' reserves to me at the library 2005-05-04 13:58:35 +00:00
joshferraro
82b9f92bd9 set itemlost status to '2' ("long overdue") on items more than 90 days overdue. 2005-05-04 13:54:01 +00:00
joshferraro
8810f62ad4 Changed the status of a Juvinile patron to Adult on his/her 18th birthday 2005-05-04 13:52:03 +00:00
joshferraro
effd10ecdf Minor change to generalize the script 2005-05-04 13:45:18 +00:00
joshferraro
0bd7d912b8 Files relevant to LibLime's spellcheck and suggest features are placed
in this directory.
2005-05-04 13:41:36 +00:00
tipaul
64cd740d2b synch'ing 2.2 and head 2005-05-04 08:58:30 +00:00
tipaul
93ff09d081 merging 2.2 branch with head. Sorry for not making it before, many many commits done here 2005-03-01 13:40:35 +00:00
acli
0fbc036392 Some changes from rel_2_2:
Make the generated pot file (i.e., result of "create") look more "real",
but using msgmerge to reformat the output

Script failed to create intermediate directories if the directory of the
target does not exist and the parent of that directory does not exist
either. This should fix that.
2005-02-04 03:49:33 +00:00
tipaul
0eb7623640 *** empty log message *** 2005-01-05 14:41:49 +00:00
tipaul
056c211411 minor update just before releasing official 2.2.0 2005-01-05 14:31:06 +00:00
tipaul
8664309215 updating doc & adding a script to rebuild officially supported languages 2005-01-03 16:30:13 +00:00
tipaul
ae197cb5dd removing .po in the wrong place (is in po/ now) 2005-01-03 16:12:28 +00:00
tipaul
51e204fa23 moving bulkmarcimport script to migration_tools directory 2005-01-03 15:25:50 +00:00
tipaul
cd6f87a689 Auto-build LANG authorized values 2005-01-03 12:59:49 +00:00
tipaul
453a01a391 adding an index 2005-01-03 12:55:22 +00:00
tipaul
f02a496ab0 * rolling back lock_tables_privs = 'Y' during install stage (too many problems) The option is shown in bulkmacimport instead. 2005-01-03 11:11:10 +00:00
acli
2dfe2c917a *** empty log message *** 2004-12-30 10:29:28 +00:00
acli
6fe93e5363 *** empty log message *** 2004-12-30 09:37:19 +00:00
acli
80dbe83160 Preliminary, incomplete update 2004-12-30 09:20:41 +00:00
acli
b8604171cb Some progress 2004-12-30 08:01:43 +00:00
acli
f04e3656d2 Oops, specified the wrong English template directory
The css translation file is only about 33% done.
2004-12-30 07:15:53 +00:00
acli
2cce962b98 Preliminary update (based on old default_opac_zh_TW.po file) 2004-12-30 07:05:04 +00:00
acli
78c09f8e7b Recommend relative paths for English template location, to make PO files
somewhat easier to read
2004-12-30 06:58:19 +00:00
acli
5c84b3411e Minor documentation correction 2004-12-30 06:53:13 +00:00
acli
52e5918b91 Minor documentation updates 2004-12-30 06:48:19 +00:00
acli
9638aa4638 Moved the old po files to the new standard location 2004-12-30 06:17:21 +00:00