Hebrew filenames from a Windows(XP) zip file.

From: Amir Hardon (hardon_at_nonexisting.hamakor.org.il)
Date: Wed 25 Aug 2004 - 09:57:43 IDT


I'm trying to extract a zip file with Hebrew file names that was created with
winzip on a Windows XP machine.
It looks like there is an encoding problem, but a weird one.

Just for testing the encoding I listed the file names into a text file ('unzip
-l > file.txt'), and tried it to convert to different encodings using iconv.
But iconv always failed(No matter which encoding I'm trying to use),
with the following message:
iconv: illegal input sequence at position 112
The first byte that supposed to be Hebrew is at position 112,
it's value is 0xEA which is "Kaf sofit" in iso-8859-8.

Anyway I just opened the text file with Mozilla and tried to view it using
every Hebrew or Unicode encoding it supports, but none of them worked.

My last resort was to calculate the difference between the values of the
letter I get and the letter it should be, the first two letters have the same
difference (reduce two to get the original letter) but the third letter have
a different one (add five to get the original letter).
That is strange!

(List's Hebrew haters, please forgive the next paragraph)
Just for the record here is the string I get:
"ךכאמ ִחאפ מחאפ"
Which should be:
"טיול מחוף לחוף"
(Both strings are in logical order)

So I have two questions:
1. (The simple one) What's the problem with iconv?
2. What can I do with the Hebrew filenames?

Thanks!
 -Amir.

=================================================================
To unsubscribe, send mail to linux-il-request_at_linux.org.il with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail linux-il-request_at_linux.org.il



This archive was generated by hypermail 2.1.7 : Wed 25 Aug 2004 - 10:06:02 IDT