• UTF8 to CP850 translation table for Golded

    From Michiel van der Vlist@2:280/5555 to All on Sun Apr 30 10:23:37 2017
    Hello All,


    I have probably published this before, but here it is again for those that can not read my Umlauts and accents when I post something in UTF-8.

    This translation table is incomplete. It just takes care of the sz and umlauted and accented characters as found in Dutch and German.

    This table is for Golded. It may work for other readers as well. Your milage may vary.


    === Cut ===
    ; file utf8_850.chs
    ;
    ; By Michiel van der Vlist, 2:280/5555
    ;
    ; This file is a Golded+ charset conversion module in text form.
    ;
    ; This module converts UTF-8 characters to IBM CP850 characters.
    ; For now only the characters in the range u+00C0 - u+00FF
    ;
    ; Characters outside this range are displayed as a rectangle followed by
    ; one, two or three nonsence characters.
    ;
    ;
    ; Format: ID, version, level,
    ; from charset, to charset,
    ; 128 entries: first & second byte
    ; "END"
    ; Lines beginning with a ";" or a ";" after the entries are comments
    ;
    ;
    ; cedilla = , ; dieresis = .. ; acute = '
    ; grave = ` ; circumflex = ^ ; ring = o
    ; tilde = ~ ; caron = v
    ; All of these are above the character, apart from the cedilla which is below. ;
    ; \ is the escape character: \0 means decimal zero,
    ; \dnnn where nnn is a decimal number, is the ordinal value of the character
    ; \xnn where nn is a hexadecimal number
    ; e.g.: \d32 is the ASCII space character
    ; Two \\ is the character "\" itself.
    ;
    0 ; ID number
    0 ; version number
    ;
    4 2 ; level number to display, level number of table
    ;
    UTF-8 ; from set
    CP850 ; to set
    ;
    \0 \xB7 ; 192 00C0 latin capital letter a with grave
    \0 \xB5 ; 193 00C1 latin capital letter a with acute
    \0 \xB6 ; 194 00C2 latin capital letter a with circumflex
    \0 \xC7 ; 195 00C3 latin capital letter a with tilde
    \0 \x8E ; 196 00C4 latin capital letter a with diaeresis
    \0 \x8F ; 197 00C5 latin capital letter a with ring above
    \0 \x92 ; 198 00C6 latin capital letter ae
    \0 \x80 ; 199 00C7 latin capital letter c with cedilla
    \0 \xD4 ; 200 00C8 latin capital letter e with grave
    \0 \x90 ; 201 00C9 latin capital letter e with acute
    \0 \xD2 ; 202 00CA latin capital letter e with circumflex
    \0 \xD3 ; 203 00CB latin capital letter e with diaeresis
    \0 \xDE ; 204 00CC latin capital letter i with grave
    \0 \xD6 ; 205 00CD latin capital letter i with acute
    \0 \xD7 ; 206 00CE latin capital letter i with circumflex
    \0 \xD8 ; 207 00CF latin capital letter i with diaeresis
    \0 \xD1 ; 208 00D0 latin capital letter eth (icelandic)
    \0 \xA5 ; 209 00D1 latin capital letter n with tilde
    \0 \xE3 ; 210 00D2 latin capital letter o with grave
    \0 \xE0 ; 211 00D3 latin capital letter o with acute
    \0 \xE2 ; 212 00D4 latin capital letter o with circumflex
    \0 \xE5 ; 213 00D5 latin capital letter o with tilde
    \0 \x99 ; 214 00D6 latin capital letter o with diaeresis
    \0 \x9E ; 215 00D7 multiplication sign
    \0 \x9D ; 216 00D8 latin capital letter o with stroke
    \0 \xEB ; 217 00D9 latin capital letter u with grave
    \0 \xE9 ; 218 00DA latin capital letter u with acute
    \0 \xEA ; 219 00DB latin capital letter u with circumflex
    \0 \x9A ; 220 00DC latin capital letter u with diaeresis
    \0 \xED ; 221 00DD latin capital letter y with acute
    \0 \xE7 ; 222 00DE latin capital letter thorn (icelandic)
    \0 \xE1 ; 223 00DF latin small letter sharp s (german)
    \0 \x85 ; 224 00E0 latin small letter a with grave
    \0 \xA0 ; 225 00E1 latin small letter a with acute
    \0 \x83 ; 226 00E2 latin small letter a with circumflex
    \0 \x9C ; 227 00E3 English Pound sign
    ;\0 \xC6 ; 227 00E3 latin small letter a with tilde
    \0 \x84 ; 228 00E4 latin small letter a with diaeresis
    ;\0 \xBE ; 229 00E5 Yen sign
    \0 \x86 ; 229 00E5 latin small letter a with ring above
    \0 \x91 ; 230 00E6 latin small letter ae
    \0 \x87 ; 231 00E7 latin small letter c with cedilla
    \0 \x8A ; 232 00E8 latin small letter e with grave
    \0 \x82 ; 233 00E9 latin small letter e with acute
    \0 \x88 ; 234 00EA latin small letter e with circumflex
    \0 \x89 ; 235 00EB latin small letter e with diaeresis
    \0 \x8D ; 236 00EC latin small letter i with grave
    \0 \xA1 ; 237 00ED latin small letter i with acute
    \0 \x8C ; 238 00EE latin small letter i with circumflex
    \0 \x8B ; 239 00EF latin small letter i with diaeresis
    \0 \xD0 ; 240 00F0 latin small letter eth (icelandic)
    \0 \xA4 ; 241 00F1 latin small letter n with tilde
    \0 \x95 ; 242 00F2 latin small letter o with grave
    \0 \xA2 ; 243 00F3 latin small letter o with acute
    \0 \x93 ; 244 00F4 latin small letter o with circumflex
    \0 \xE4 ; 245 00F5 latin small letter o with tilde
    \0 \x94 ; 246 00F6 latin small letter o with diaeresis
    \0 \xF6 ; 247 00F7 division sign
    \0 \x9B ; 248 00F8 latin small letter o with stroke
    \0 \x97 ; 249 00F9 latin small letter u with grave
    \0 \xA3 ; 250 00FA latin small letter u with acute
    \0 \x96 ; 251 00FB latin small letter u with circumflex
    \0 \x81 ; 252 00FC latin small letter u with diaeresis
    \0 \xEC ; 253 00FD latin small letter y with acute
    \0 \xE8 ; 254 00FE latin small letter thorn (icelandic)
    \0 \x98 ; 255 00FF latin small letter y with diaeresis
    \0 \xFE ; 192 C0
    \0 \xFE ; 193
    \0 \0 ; 194 C2
    \0 \0 ; 195 C3
    \0 \xFE ; 196 C4
    \0 \xFE ; 197
    \0 \xFE ; 198
    \0 \xFE ; 199
    \0 \xFE ; 200
    \0 \xFE ; 201
    \0 \xFE ; 202
    \0 \xFE ; 203
    \0 \xFE ; 204
    \0 \xFE ; 205
    \0 \xFE ; 206
    \0 \xFE ; 207
    \0 \xFE ; 208 D0
    \0 \xFE ; 209
    \0 \xFE ; 210
    \0 \xFE ; 211
    \0 \xFE ; 212
    \0 \xFE ; 213
    \0 \xFE ; 214
    \0 \xFE ; 215
    \0 \xFE ; 216
    \0 \xFE ; 217
    \0 \xFE ; 218
    \0 \xFE ; 219
    \0 \xFE ; 220
    \0 \xFE ; 221
    \0 \xFE ; 222
    \0 \xFE ; 223
    \0 \xDF ; 224 E0
    \0 \xDF ; 225
    \0 \xDF ; 226
    \0 \xDF ; 227
    \0 \xDF ; 228
    \0 \xDF ; 229
    \0 \xDF ; 230
    \0 \xDF ; 231
    \0 \xDF ; 232
    \0 \xDF ; 233
    \0 \xDF ; 234
    \0 \xDF ; 235
    \0 \xDF ; 236
    \0 \xDF ; 237
    \0 \xDF ; 238
    \0 \xDF ; 239
    \0 \xDB ; 240 F0
    \0 \xDB ; 241
    \0 \xDB ; 242
    \0 \xDB ; 243
    \0 \xDB ; 244
    \0 \xDB ; 245
    \0 ; 246
    \0 \xDB ; 247
    \0 \xDB ; 248
    \0 \xDB ; 249
    \0 \xDB ; 250
    \0 \xDB ; 251
    \0 \xDB ; 252
    \0 \xDB ; 253
    \0 \xDB ; 254
    \0 \xDB ; 255
    END
    === Cut ===



    Cheers, Michiel

    --- GoldED+/W32-MSVC 1.1.5-b20170303
    * Origin: http://www.vlist.eu (2:280/5555)
  • From Richard Menedetter@2:310/31 to Michiel Van Der Vlist on Sun Apr 30 10:45:16 2017
    Hi Michiel!

    30 Apr 2017 10:23, from Michiel van der Vlist -> All:

    I have probably published this before, but here it is again for those
    that can not read my Umlauts and accents when I post something in
    UTF-8.

    Thanx for posting!

    I guess you did not do the exercise for Latin-1?

    CU, Ricsi

    --- GoldED+/LNX
    * Origin: Don't drink and derive! (2:310/31)
  • From Michiel van der Vlist@2:280/5555 to Richard Menedetter on Sun Apr 30 11:09:39 2017
    Hello Richard,

    On Sunday April 30 2017 10:45, you wrote to me:

    I have probably published this before, but here it is again for
    those that can not read my Umlauts and accents when I post
    something in UTF-8.

    Thanx for posting!

    I guess you did not do the exercise for Latin-1?

    Sorry no. I have no environment with Latin-1 as the native set. so I have no UTF-8 to Latin-1 table. There may be some floating around in Fidonet, but I can't help you there. You have to make one yoyrself. I have translation tables to and from CP850 to Latin-1. You could compose a UTF-8 to Latin-1 table from those...


    Cheers, Michiel

    --- GoldED+/W32-MSVC 1.1.5-b20170303
    * Origin: Blijf Tønijn (2:280/5555)
  • From Richard Menedetter@2:310/31 to Michiel Van Der Vlist on Sun Apr 30 13:39:20 2017
    Hi Michiel!

    30 Apr 2017 11:09, from Michiel van der Vlist -> Richard Menedetter:

    I guess you did not do the exercise for Latin-1?
    Sorry no. I have no environment with Latin-1 as the native set.
    so I have no UTF-8 to Latin-1 table. There may be some floating around
    in Fidonet, but I can't help you there. You have to make one yoyrself.
    I have translation tables to and from CP850 to Latin-1.
    You could compose a UTF-8 to Latin-1 table from those...

    I am still gathering motivation to do so ;)

    If I come around doing it somewhen in the far future I will post it here.

    CU, Ricsi

    --- GoldED+/LNX
    * Origin: Creativity is great, but plagiarism is faster. (2:310/31)
  • From Michiel van der Vlist@2:280/5555 to Richard Menedetter on Sun Apr 30 14:04:32 2017
    Hello Richard,

    On Sunday April 30 2017 13:39, you wrote to me:

    I guess you did not do the exercise for Latin-1?

    Sorry no. I have no environment with Latin-1 as the native set.
    so I have no UTF-8 to Latin-1 table. There may be some floating
    around in Fidonet, but I can't help you there. You have to make
    one yoyrself. I have translation tables to and from CP850 to
    Latin-1. You could compose a UTF-8 to Latin-1 table from those...

    I am still gathering motivation to do so ;)

    It is good execise to get to know how the translation mechnism works in Golded. If it is only for the German umlauts and the sz, I could do it for you, I have the experience. But I also need motivation as I will not use it myself. It is only useful for those that have latin-1 as the native character set.

    But maybe someone in here has already done it. Anyone have the UTF8 to Latin-1 equivalent of th etable posted a few messages ago?

    If I come around doing it somewhen in the far future I will post it
    here.

    Good.

    * Origin: Creativity is great, but plagiarism is faster. (2:310/31)

    Exactly! ;-)


    Cheers, Michiel

    --- GoldED+/W32-MSVC 1.1.5-b20170303
    * Origin: Blijf Tønijn (2:280/5555)
  • From Richard Menedetter@2:310/31 to Michiel Van Der Vlist on Sun Apr 30 18:07:02 2017
    Hi Michiel!

    30 Apr 2017 14:04, from Michiel van der Vlist -> Richard Menedetter:

    I am still gathering motivation to do so ;)
    It is good execise to get to know how the translation mechnism works
    in Golded. If it is only for the German umlauts and the sz, I could
    do it for you, I have the experience. But I also need motivation as I
    will not use it myself. It is only useful for those that have latin-1
    as the native character set.

    No need for you to do it.
    I saved your 850 file ... when I gathered enough motivation I will try it.
    That will take muuuuch time ;))

    * Origin: Creativity is great, but plagiarism is faster.
    Exactly! ;-)

    Sometimes I think my Golded+ has some kind of AI tagline selection algorithm built into it ;)

    CU, Ricsi

    --- GoldED+/LNX
    * Origin: Better to reign in hell than to serve in heaven. (2:310/31)