perl-unicode

Re: Encode; Should we aggregate all EUCs?

2002-02-05 05:44:45
On Tue, Feb 05, 2002 at 08:38:28AM +0000, Nick Ing-Simmons wrote:
Dan Kogai <dankogai(_at_)dan(_dot_)co(_dot_)jp> writes:

Perhaps we make "Build CJK encodings?" a Configure question?
We could determine default based on locale, or (as I once
did for a UK/USA paper size choice) by TZ.

107853 bytes (112%) saved spotting duplicates

Probably worth keeping.

22801 bytes (23.6%) saved using substrings

That is where the time goes - there is a loop which uses index()
on all existing strings to see if it can re-use one.
It saves 22K but is that worth while?

Then surely this extra searching becomes the configure question?

  Try harder to compress CJK encodings (this will slow your build considerably)?
  [no]


Unless we find a more efficient algorithm to search for common substrings.

Nicholas Clark
-- 
EMCFT http://www.ccl4.org/~nick/CV.html