Compile Dictionaries for Different Character Set

Argyriou Andreas aargy at ote.gr
Mon Jan 18 03:54:17 CST 2010


Hello Dhore,



I am not sure Dhore is your first name, apologies if not. Mine is Andreas (Argyriou is the last name).



Now to the point.



I assume the clause UPSHIFT HEXADECIMAL 'E0C0' means that HEX E0 is a lowercase character and the corresponding uppercase character is HEX C0.



In this case the piece of code you want to include in your system options is as follows (after the ; I give you the names of the greek letters):



  UPSHIFT              &

    HEXADECIMAL 'E1C1' &  ;alpha

    HEXADECIMAL 'E2C2' &  ;beta

    HEXADECIMAL 'E3C3' &  ;gamma

    HEXADECIMAL 'E4C4' &  ;delta

    HEXADECIMAL 'E5C5' &  ;epsilon

    HEXADECIMAL 'E6C6' &  ;zeta

    HEXADECIMAL 'E7C7' &  ;eta

    HEXADECIMAL 'E8C8' &  ;theta

    HEXADECIMAL 'E9C9' &  ;iota

    HEXADECIMAL 'EACA' &  ;kappa

    HEXADECIMAL 'EBCB' &  ;lambda

    HEXADECIMAL 'ECCC' &  ;mi

    HEXADECIMAL 'EDCD' &  ;ni

    HEXADECIMAL 'EECE' &  ;xi

    HEXADECIMAL 'EFCF' &  ;omikron

    HEXADECIMAL 'F0D0' &  ;pi

    HEXADECIMAL 'F1D1' &  ;rho

    HEXADECIMAL 'F2D3' &  ;sigma final

    HEXADECIMAL 'F3D3' &  ;sigma

    HEXADECIMAL 'F4D4' &  ;tay

    HEXADECIMAL 'F5D5' &  ;ypsilon

    HEXADECIMAL 'F6D6' &  ;phi

    HEXADECIMAL 'F7D7' &  ;chi

    HEXADECIMAL 'F8D8' &  ;psi

    HEXADECIMAL 'F9D9' &  ;omega

    HEXADECIMAL 'DCA2' &  ;alpha with accent

    HEXADECIMAL 'DDB8' &  ;epsilon with accent

    HEXADECIMAL 'DEB9' &  ;eta with accent

    HEXADECIMAL 'DFBA' &  ;iota with accent

    HEXADECIMAL 'FCBC' &  ;omikron with accent

    HEXADECIMAL 'FDBE' &  ;ypsilon with accent

    HEXADECIMAL 'FEBF' &  ;omega with accent

    HEXADECIMAL 'FADA' &  ;iota with diaeresis

    HEXADECIMAL 'FBDB' &  ;ypsilon with diaeresis

    HEXADECIMAL 'C0DA' &  ;iota with accent and diaeresis

    HEXADECIMAL 'E0DB' &  ;ypsilon with accent and diaeresis

  DOWNSHIFT            &

    HEXADECIMAL 'C1E1' &  ;alpha

    HEXADECIMAL 'C2E2' &  ;beta

    HEXADECIMAL 'C3E3' &  ;gamma

    HEXADECIMAL 'C4E4' &  ;delta

    HEXADECIMAL 'C5E5' &  ;epsilon

    HEXADECIMAL 'C6E6' &  ;zeta

    HEXADECIMAL 'C7E7' &  ;eta

    HEXADECIMAL 'C8E8' &  ;theta

    HEXADECIMAL 'C9E9' &  ;iota

    HEXADECIMAL 'CAEA' &  ;kappa

    HEXADECIMAL 'CBEB' &  ;lambda

    HEXADECIMAL 'CCEC' &  ;mi

    HEXADECIMAL 'CDED' &  ;ni

    HEXADECIMAL 'CEEE' &  ;xi

    HEXADECIMAL 'CFEF' &  ;omikron

    HEXADECIMAL 'D0F0' &  ;pi

    HEXADECIMAL 'D1F1' &  ;rho

    HEXADECIMAL 'D3F3' &  ;sigma

    HEXADECIMAL 'D4F4' &  ;tay

    HEXADECIMAL 'D5F5' &  ;ypsilon

    HEXADECIMAL 'D6F6' &  ;phi

    HEXADECIMAL 'D7F7' &  ;chi

    HEXADECIMAL 'D8F8' &  ;psi

    HEXADECIMAL 'D9F9' &  ;omega

    HEXADECIMAL 'A2DC' &  ;alpha with accent

    HEXADECIMAL 'B8DD' &  ;epsilon with accent

    HEXADECIMAL 'B9DE' &  ;eta with accent

    HEXADECIMAL 'BADF' &  ;iota with accent

    HEXADECIMAL 'BCFC' &  ;omikron with accent

    HEXADECIMAL 'BEFD' &  ;ypsilon with accent

    HEXADECIMAL 'BFFE' &  ;omega with accent

    HEXADECIMAL 'DAFA' &  ;iota with diaeresis

    HEXADECIMAL 'DBFB' &  ;ypsilon with diaeresis



The above covers all the greek letters used in ISO 8859-7 including letters with accent and diaeresis. I have also consulted the relevant Wikipedia page http://en.wikipedia.org/wiki/ISO/IEC_8859-7 where I found one difference, namely uppercase alpha with accent is listed there as hex B6, while in our systems the value is hex A2. I believe A2 is the correct value.



Also, please notice that there are 3 cases where two characters upshift to the same character:

1. sigma, sigma final

2. iota with diaeresis, iota with accent and diaeresis

3. ypsilon with diaeresis, ypsilon with accent and diaeresis



For this reason the downshift lines are less than the upshift ones. As a result downshifting is not very dependable, especially in the case of the sigma final, which is frequent.



I hope this answers your question. If you need more info, please ask me.



Regards,

Andreas Argyriou


-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.sowder.com/pipermail/powerh-l/attachments/20100118/8002b342/attachment.htm 


More information about the powerh-l mailing list