Compile Dictionaries for Different Character Set
Argyriou Andreas
aargy at ote.gr
Mon Jan 18 03:54:17 CST 2010
Hello Dhore,
I am not sure Dhore is your first name, apologies if not. Mine is Andreas (Argyriou is the last name).
Now to the point.
I assume the clause UPSHIFT HEXADECIMAL 'E0C0' means that HEX E0 is a lowercase character and the corresponding uppercase character is HEX C0.
In this case the piece of code you want to include in your system options is as follows (after the ; I give you the names of the greek letters):
UPSHIFT &
HEXADECIMAL 'E1C1' & ;alpha
HEXADECIMAL 'E2C2' & ;beta
HEXADECIMAL 'E3C3' & ;gamma
HEXADECIMAL 'E4C4' & ;delta
HEXADECIMAL 'E5C5' & ;epsilon
HEXADECIMAL 'E6C6' & ;zeta
HEXADECIMAL 'E7C7' & ;eta
HEXADECIMAL 'E8C8' & ;theta
HEXADECIMAL 'E9C9' & ;iota
HEXADECIMAL 'EACA' & ;kappa
HEXADECIMAL 'EBCB' & ;lambda
HEXADECIMAL 'ECCC' & ;mi
HEXADECIMAL 'EDCD' & ;ni
HEXADECIMAL 'EECE' & ;xi
HEXADECIMAL 'EFCF' & ;omikron
HEXADECIMAL 'F0D0' & ;pi
HEXADECIMAL 'F1D1' & ;rho
HEXADECIMAL 'F2D3' & ;sigma final
HEXADECIMAL 'F3D3' & ;sigma
HEXADECIMAL 'F4D4' & ;tay
HEXADECIMAL 'F5D5' & ;ypsilon
HEXADECIMAL 'F6D6' & ;phi
HEXADECIMAL 'F7D7' & ;chi
HEXADECIMAL 'F8D8' & ;psi
HEXADECIMAL 'F9D9' & ;omega
HEXADECIMAL 'DCA2' & ;alpha with accent
HEXADECIMAL 'DDB8' & ;epsilon with accent
HEXADECIMAL 'DEB9' & ;eta with accent
HEXADECIMAL 'DFBA' & ;iota with accent
HEXADECIMAL 'FCBC' & ;omikron with accent
HEXADECIMAL 'FDBE' & ;ypsilon with accent
HEXADECIMAL 'FEBF' & ;omega with accent
HEXADECIMAL 'FADA' & ;iota with diaeresis
HEXADECIMAL 'FBDB' & ;ypsilon with diaeresis
HEXADECIMAL 'C0DA' & ;iota with accent and diaeresis
HEXADECIMAL 'E0DB' & ;ypsilon with accent and diaeresis
DOWNSHIFT &
HEXADECIMAL 'C1E1' & ;alpha
HEXADECIMAL 'C2E2' & ;beta
HEXADECIMAL 'C3E3' & ;gamma
HEXADECIMAL 'C4E4' & ;delta
HEXADECIMAL 'C5E5' & ;epsilon
HEXADECIMAL 'C6E6' & ;zeta
HEXADECIMAL 'C7E7' & ;eta
HEXADECIMAL 'C8E8' & ;theta
HEXADECIMAL 'C9E9' & ;iota
HEXADECIMAL 'CAEA' & ;kappa
HEXADECIMAL 'CBEB' & ;lambda
HEXADECIMAL 'CCEC' & ;mi
HEXADECIMAL 'CDED' & ;ni
HEXADECIMAL 'CEEE' & ;xi
HEXADECIMAL 'CFEF' & ;omikron
HEXADECIMAL 'D0F0' & ;pi
HEXADECIMAL 'D1F1' & ;rho
HEXADECIMAL 'D3F3' & ;sigma
HEXADECIMAL 'D4F4' & ;tay
HEXADECIMAL 'D5F5' & ;ypsilon
HEXADECIMAL 'D6F6' & ;phi
HEXADECIMAL 'D7F7' & ;chi
HEXADECIMAL 'D8F8' & ;psi
HEXADECIMAL 'D9F9' & ;omega
HEXADECIMAL 'A2DC' & ;alpha with accent
HEXADECIMAL 'B8DD' & ;epsilon with accent
HEXADECIMAL 'B9DE' & ;eta with accent
HEXADECIMAL 'BADF' & ;iota with accent
HEXADECIMAL 'BCFC' & ;omikron with accent
HEXADECIMAL 'BEFD' & ;ypsilon with accent
HEXADECIMAL 'BFFE' & ;omega with accent
HEXADECIMAL 'DAFA' & ;iota with diaeresis
HEXADECIMAL 'DBFB' & ;ypsilon with diaeresis
The above covers all the greek letters used in ISO 8859-7 including letters with accent and diaeresis. I have also consulted the relevant Wikipedia page http://en.wikipedia.org/wiki/ISO/IEC_8859-7 where I found one difference, namely uppercase alpha with accent is listed there as hex B6, while in our systems the value is hex A2. I believe A2 is the correct value.
Also, please notice that there are 3 cases where two characters upshift to the same character:
1. sigma, sigma final
2. iota with diaeresis, iota with accent and diaeresis
3. ypsilon with diaeresis, ypsilon with accent and diaeresis
For this reason the downshift lines are less than the upshift ones. As a result downshifting is not very dependable, especially in the case of the sigma final, which is frequent.
I hope this answers your question. If you need more info, please ask me.
Regards,
Andreas Argyriou
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.sowder.com/pipermail/powerh-l/attachments/20100118/8002b342/attachment.htm
More information about the powerh-l
mailing list