nmh-workers
[Top] [All Lists]

Re: mhfixmsg character set conversion

2022-02-09 19:27:45
I think Steven says he's running Manjaro which is an Arch Linux spin off, and
Archers prefer to pass on upstream code unaltered where possible.

Except that par has been altered?

Not by me, at any rate.


I use this version, unaltered:
$ par version
par 1.53.0

   $ par version
   1.52-i18n.4

   $ pacman -Qi par
   Name            : par
   Version         : 1.52-8
   Description     : Paragraph reformatter
   Architecture    : x86_64
   URL             : http://www.nicemice.net/par/
   Licenses        : custom
   Groups          : None
   Provides        : None
   Depends On      : None
   Optional Deps   : None
   Required By     : None
   Optional For    : None
   Conflicts With  : None
   Replaces        : None
   Installed Size  : 98.90 KiB
   Packager        : Unknown Packager
   Build Date      : Mon 06 Jan 2020 12:53:58 AM
   Install Date    : Mon 06 Jan 2020 12:54:19 AM
   Install Reason  : Explicitly installed
   Install Script  : No
   Validated By    : None


Do you have any idea where the following warning comes from?

My money's on par(1) given

    
https://inbox.vuxu.org/voidlinux-github/20191027084150.NZqC6wHlZkyQJ7AkACI7juvuCp0AD_u_IIwftMlDmKs@z/T/

That sure looks like it.

Perhaps, but it isn't.


Steven, to confirm, try

    egrep -l 'seems not configured' /usr/bin/par

   $ egrep -l 'seems not configured' /usr/bin/par
   $ echo $?
   1


Steven, I would try removing par from the end of your mhbuild-convert-text/html
entry.

The problem with that is that it's not there in the first place:

   $ grep par ~/.mh_profile
   $ echo $?
   1

In fact,

   $ grep mhbuild ~/.mh_profile
   mhbuild:        -maxunencoded 500

   $ grep html ~/.mh_profile
   #: mhshow-show-text/html:       %pmime_helper %F %s %{name}
   mhshow-show-text/html:       html_to_text %F | cat -s
   mhshow_in_browser-show-text/html:  %pmime_helper %F %s "%{name}"
   mhfixmsg-format-text/html:      html_to_text < '%F'

   $ grep -w par ~/bin/html_to_text
   $ echo $?
   1

I'll append the full text of the script in case you'd like to see it, but
I'm pretty sure it's not implicated here.

In fact there are no invocations of par anywhere in my ~/bin directory; the
only occurrences of the word are in some old data files:

   $ grep -lrisw par ~/bin
   /home/smw/bin/mars/reports/data/FMARS/jrn/text/20070729
   /home/smw/bin/mars/reports/data/FMARS/jrn/text/20070718
   /home/smw/bin/mars/reports/data/FMARS/jrn/text/20070719
   /home/smw/bin/mars/reports/data/FMARS/jrn/raw/20070719

...and these files have nothing to do with nmh in any way.

I'm reminded of an old Jackie Mason routine, in which he describes a visit
to a psychiatrist.  After a fair bit of dialog which I won't repeat here,
this snippet occurs:

   psychiatrist:  I see your problem.  You hate your sister.

   Jackie Mason:  I haven't got a sister.

   psychiatrist:  I can't help you if you won't cooperate.

...so I feel a need to apologize for being uncooperative :-/, but I'm at a
loss here.

     - Steven


8<-----------------------------   cut here   ---------------------------->8
#!/bin/sh
#
#  html_to_text -- convert HTML to plain text
#
#  Steven Winikoff
#  2010/04/28
#
#  note:  this script uses links
#         [ http://atrey.karlin.mff.cuni.cz/~clock/twibright/links ]
#         because it seems to be the only program available which
#         renders tables reasonably
# 
#         alternatives (lynx and vilistextum) both show tables one
#         column at a time instead of row by row!
#
#
#         UPDATE, 2018/08/22:
#
#         switched from links to elinks, because links fails when invoked
#         via procmail if the source HTML code contains invalid characters
#         (as in a file in Windows character encoding which isn't labelled
#         as such) -- the symptom is that a properly structured message
#         will be converted into one which has an empty HTML part, which
#         is a problem if (and only if :-) the HTML part needs to be viewed
#         in a graphical browser (see ~smw/bin/view_html_message, as called
#         from ~smw/bin/mhread)
#
#--------------------------------------------------------------------------

if [ ! -z "${1}" ]
then
   html="${1}"
else
   # links (as of April 2010, at least) refuses to read standard 
   # input with -dump

   html="/tmp/html_to_text.`date +%Y%m%d.%H%M%S`.$$"
   trap "rm -f ${html} >/dev/null 2>&1; exit 1" 1 2 3 15
   cat > ${html}
fi

elinks -force-html -dump -dump-charset utf-8  ${html} | sed 's/^   //;s/[   
]*$//'
##     | cat -s

#------------------------------------------------------------------------
#  w3m -I utf8 -T text/html -dump -s -o display_link_number=1 \
#      -o color=1 -graph ${html} | sed 's/^   //;s/[   ]*$//'
#------------------------------------------------------------------------
#  elinks -force-html -dump -dump-charset utf-8  ${html} | \
#         sed 's/^   //;s/[   ]*$//'
#------------------------------------------------------------------------
#  links -force-html -dump -codepage utf-8 ${html} | sed 's/^   //;s/[   ]*$//'
#------------------------------------------------------------------------
#  links -force-html -no-g -width 75 -html-margin 0 -codepage us-ascii \
#        -html-tables 1 -html-table-order 0 -html-frames 1 \
#        -html-numbered-links 1 -dump ${html} | sed 's/[        ]*$//'
#------------------------------------------------------------------------

[ -z "${1}" ] && rm -f ${html} >/dev/null 2>&1
8<-----------------------------   cut here   ---------------------------->8
-- 
___________________________________________________________________________
Steven Winikoff      |
Montreal, QC, Canada | "It's amazing how much 'mature wisdom'
smw@smwonline.ca     |  resembles being too tired."
http://smwonline.ca  |
                     |                      - Robert Heinlein

<Prev in Thread] Current Thread [Next in Thread>