Re: Convert Hebrew Text Between Codesets

From: Tzafrir Cohen (tzafrir_at_nonexisting.hamakor.org.il)
Date: Wed 16 Jun 2004 - 13:01:35 IDT


On Wed, Jun 16, 2004 at 10:58:34AM +0200, Ilan Aisic wrote:
> Anyone aware of a utility/program to convert hebrew text between the different codesets and representations?

iconv(1) is a general-purpose charset convertor.

recode is aanother one. More tolerant of its input.

>
> In particular, I'm interested in changing mutlibyte Hebrew to and from
> HTML characters.
> where:
> <Alef><Beit>...<Taf> would be:
> Mulitbye hebrew in hex (e0 is Alef, 2e is '.'):
> e0e12e2e2efa
> The same in HTML ("&#1488;" is Alef, '.' is '.'):
> &#1488;&#1489;...&#1514;

For such html conversions, use a web browser.

$ echo '&#1488;&#1489;...&#1514;' |lynx -dump -stdin

   àá...ú

If this does not produce the required output, check lynx's various
charset settings.

A different approach is a simple sed script.
I once wrote a simple perl script for converting between different
Hebrew representations in LaTeX. You can find in the the 'util/'
subdirectory of the ivritex source distribution.

-- 
Tzafrir Cohen                       +---------------------------+
http://www.technion.ac.il/~tzafrir/ |vim is a mutt's best friend|
mailto:tzafrir_at_technion.ac.il       +---------------------------+
=================================================================
To unsubscribe, send mail to linux-il-request_at_linux.org.il with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail linux-il-request_at_linux.org.il


This archive was generated by hypermail 2.1.7 : Wed 16 Jun 2004 - 13:16:14 IDT