perl-unicode

Re: Caseless and accentless string comparisons

2003-05-13 19:30:05


On Tue, May 13, 2003 at 11:32:57PM +0900, SADAHIRO Tomoyuki wrote:

I'm sorry, reasons why allkeys.txt is not included in the package are:

(1) its huge file size.

Well it is less than 1 mb...  But if size is a big problem then I
could provide it as a separate "module".  Basically it would be
Unicode::Collate::allkeys and allkeys.pm would be as simple as
possible to tell the perl tools that the module was installed...

I think the Makefile.pm should offer to get the latest version from
the unicode web site though to make upgrading the file easy.  Although
since it appears not to change much that is probably overkill and I
can just keep the pm in sync with their file.

If this seems reasonable I can implement it if you want.
 
(2) overwrite of updated allkeys.txt may break
    someone's tailoring for old allkeys.txt.

Ah.. valid point.  Perhaps a 2 step approach would make sense?  You
could have allkeys.txt and optionally customkeys.txt.  Then the
ordering would be:
  - User specified always wins
  - If present use customkeys.txt for site tailoring
  - Fall back to allkeys.txt

That way we can safely upgrade allkeys.txt.  If this seems reasonable
let me know and I can provide a patch.

(3) I'm not sure of the way to install a file
    with a suffix other than .pm
    (hmmm, is it ok only by writing MANIFEST like this?)
[snip]

Well the sample keys.txt ends up installed so that must be right.

Thanks for all the help (and a cool module).  I am a little reluctant
to make my module use this until I can get the allkeys.txt as an
install dependency.

                -ben