Koha/misc/migration_tools
Galen Charlton 60a98d258a IMPORTANT - refactor MARC character set handling
* IsStringUTF8ish - determine if scalar contains a string in UTF8
* MarcToUTF8Record - convert MARC blob or MARC::Record to UTF8
* SetMarcUnicodeFlag - set appropriate MARC21 or UNIMARC field to
  indicate that record is in UTF-8.

Design points of this module include:

* No dependencies on other C4 modules, making it easier to add
  more test cases
* All character conversion code in one place
* Single entry point for doing a character conversion on a
  MARC record
* Capture of errors and warnings produced by Text::Iconv
  and MARC::Charset
* Start of support for guessing the source character set of
  a MARC record.

Several functions were moved from other scripts
or modules to C4::Charset:

* C4::Koha->FixEncoding (expanded and renamed
  MarcToUTF8Record)
* C4::Koha->char_decode5426
* fMARC8ToUTF8 from bulkmarcimport.pl (renamed
  _marc_marc8_to_utf8)

Several batch jobs were adjusted to use MarcToUTF8Record instead of
FixEncoding.

Signed-off-by: Chris Cormack <crc@liblime.com>
Signed-off-by: Joshua Ferraro <jmf@liblime.com>
2008-02-03 07:23:56 -06:00
..
22_to_30 Adding rebuild field 100$a for Unimarc 2007-12-20 18:40:43 -06:00
build6xx.pl C4 and misc: permissions fixes 2007-12-13 19:00:34 -06:00
buildCOUNTRY.pl synch'ing 2.2 and head 2005-05-04 08:58:30 +00:00
buildEDITORS.pl rel_3_0 moved to HEAD 2007-03-09 14:52:58 +00:00
buildLANG.pl synch'ing 2.2 and head 2005-08-04 14:10:52 +00:00
bulkauthimport.pl bulkauthimport fix 2008-01-22 07:20:28 -06:00
bulkmarcimport.pl IMPORTANT - refactor MARC character set handling 2008-02-03 07:23:56 -06:00
merge_authority.pl fixing nomenclature on files in misc/, adding a few new utilities 2007-12-30 12:13:34 -06:00
rebuild_nozebra.pl fixed variable masking warnings found by perl -w 2008-01-03 20:23:59 -06:00
rebuild_zebra.pl rebuild_zebra : directories updated 2008-01-03 00:55:12 -06:00