< prev index next >

test/jdk/java/lang/Character/SpecialCasing.txt

Print this page


   1 # SpecialCasing-10.0.0.txt
   2 # Date: 2017-04-14, 05:40:43 GMT
   3 # Copyright (c) 2017 Unicode, Inc.
   4 # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
   5 # For terms of use, see http://www.unicode.org/terms_of_use.html
   6 #
   7 # Unicode Character Database
   8 #   For documentation, see http://www.unicode.org/reports/tr44/
   9 #
  10 # Special Casing
  11 #
  12 # This file is a supplement to the UnicodeData.txt file. It does not define any
  13 # properties, but rather provides additional information about the casing of
  14 # Unicode characters, for situations when casing incurs a change in string length
  15 # or is dependent on context or locale. For compatibility, the UnicodeData.txt
  16 # file only contains simple case mappings for characters where they are one-to-one
  17 # and independent of context and language. The data in this file, combined with
  18 # the simple case mappings in UnicodeData.txt, defines the full case mappings
  19 # Lowercase_Mapping (lc), Titlecase_Mapping (tc), and Uppercase_Mapping (uc).
  20 #
  21 # Note that the preferred mechanism for defining tailored casing operations is
  22 # the Unicode Common Locale Data Repository (CLDR). For more information, see the
  23 # discussion of case mappings and case algorithms in the Unicode Standard.


 104 1FC6; 1FC6; 0397 0342; 0397 0342; # GREEK SMALL LETTER ETA WITH PERISPOMENI
 105 1FD2; 1FD2; 0399 0308 0300; 0399 0308 0300; # GREEK SMALL LETTER IOTA WITH DIALYTIKA AND VARIA
 106 1FD3; 1FD3; 0399 0308 0301; 0399 0308 0301; # GREEK SMALL LETTER IOTA WITH DIALYTIKA AND OXIA
 107 1FD6; 1FD6; 0399 0342; 0399 0342; # GREEK SMALL LETTER IOTA WITH PERISPOMENI
 108 1FD7; 1FD7; 0399 0308 0342; 0399 0308 0342; # GREEK SMALL LETTER IOTA WITH DIALYTIKA AND PERISPOMENI
 109 1FE2; 1FE2; 03A5 0308 0300; 03A5 0308 0300; # GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND VARIA
 110 1FE3; 1FE3; 03A5 0308 0301; 03A5 0308 0301; # GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND OXIA
 111 1FE4; 1FE4; 03A1 0313; 03A1 0313; # GREEK SMALL LETTER RHO WITH PSILI
 112 1FE6; 1FE6; 03A5 0342; 03A5 0342; # GREEK SMALL LETTER UPSILON WITH PERISPOMENI
 113 1FE7; 1FE7; 03A5 0308 0342; 03A5 0308 0342; # GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND PERISPOMENI
 114 1FF6; 1FF6; 03A9 0342; 03A9 0342; # GREEK SMALL LETTER OMEGA WITH PERISPOMENI
 115 
 116 # IMPORTANT-when iota-subscript (0345) is uppercased or titlecased,
 117 #  the result will be incorrect unless the iota-subscript is moved to the end
 118 #  of any sequence of combining marks. Otherwise, the accents will go on the capital iota.
 119 #  This process can be achieved by first transforming the text to NFC before casing.
 120 #  E.g. <alpha><iota_subscript><acute> is uppercased to <ALPHA><acute><IOTA>
 121 
 122 # The following cases are already in the UnicodeData.txt file, so are only commented here.
 123 
 124 # 0345; 0345; 0345; 0399; # COMBINING GREEK YPOGEGRAMMENI
 125 
 126 # All letters with YPOGEGRAMMENI (iota-subscript) or PROSGEGRAMMENI (iota adscript)
 127 # have special uppercases.
 128 # Note: characters with PROSGEGRAMMENI are actually titlecase, not uppercase!
 129 
 130 1F80; 1F80; 1F88; 1F08 0399; # GREEK SMALL LETTER ALPHA WITH PSILI AND YPOGEGRAMMENI
 131 1F81; 1F81; 1F89; 1F09 0399; # GREEK SMALL LETTER ALPHA WITH DASIA AND YPOGEGRAMMENI
 132 1F82; 1F82; 1F8A; 1F0A 0399; # GREEK SMALL LETTER ALPHA WITH PSILI AND VARIA AND YPOGEGRAMMENI
 133 1F83; 1F83; 1F8B; 1F0B 0399; # GREEK SMALL LETTER ALPHA WITH DASIA AND VARIA AND YPOGEGRAMMENI
 134 1F84; 1F84; 1F8C; 1F0C 0399; # GREEK SMALL LETTER ALPHA WITH PSILI AND OXIA AND YPOGEGRAMMENI
 135 1F85; 1F85; 1F8D; 1F0D 0399; # GREEK SMALL LETTER ALPHA WITH DASIA AND OXIA AND YPOGEGRAMMENI
 136 1F86; 1F86; 1F8E; 1F0E 0399; # GREEK SMALL LETTER ALPHA WITH PSILI AND PERISPOMENI AND YPOGEGRAMMENI
 137 1F87; 1F87; 1F8F; 1F0F 0399; # GREEK SMALL LETTER ALPHA WITH DASIA AND PERISPOMENI AND YPOGEGRAMMENI
 138 1F88; 1F80; 1F88; 1F08 0399; # GREEK CAPITAL LETTER ALPHA WITH PSILI AND PROSGEGRAMMENI
 139 1F89; 1F81; 1F89; 1F09 0399; # GREEK CAPITAL LETTER ALPHA WITH DASIA AND PROSGEGRAMMENI
 140 1F8A; 1F82; 1F8A; 1F0A 0399; # GREEK CAPITAL LETTER ALPHA WITH PSILI AND VARIA AND PROSGEGRAMMENI
 141 1F8B; 1F83; 1F8B; 1F0B 0399; # GREEK CAPITAL LETTER ALPHA WITH DASIA AND VARIA AND PROSGEGRAMMENI
 142 1F8C; 1F84; 1F8C; 1F0C 0399; # GREEK CAPITAL LETTER ALPHA WITH PSILI AND OXIA AND PROSGEGRAMMENI
 143 1F8D; 1F85; 1F8D; 1F0D 0399; # GREEK CAPITAL LETTER ALPHA WITH DASIA AND OXIA AND PROSGEGRAMMENI
 144 1F8E; 1F86; 1F8E; 1F0E 0399; # GREEK CAPITAL LETTER ALPHA WITH PSILI AND PERISPOMENI AND PROSGEGRAMMENI


   1 # SpecialCasing-11.0.0.txt
   2 # Date: 2018-02-22, 06:16:47 GMT
   3 # Copyright (c) 2018 Unicode, Inc.
   4 # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
   5 # For terms of use, see http://www.unicode.org/terms_of_use.html
   6 #
   7 # Unicode Character Database
   8 #   For documentation, see http://www.unicode.org/reports/tr44/
   9 #
  10 # Special Casing
  11 #
  12 # This file is a supplement to the UnicodeData.txt file. It does not define any
  13 # properties, but rather provides additional information about the casing of
  14 # Unicode characters, for situations when casing incurs a change in string length
  15 # or is dependent on context or locale. For compatibility, the UnicodeData.txt
  16 # file only contains simple case mappings for characters where they are one-to-one
  17 # and independent of context and language. The data in this file, combined with
  18 # the simple case mappings in UnicodeData.txt, defines the full case mappings
  19 # Lowercase_Mapping (lc), Titlecase_Mapping (tc), and Uppercase_Mapping (uc).
  20 #
  21 # Note that the preferred mechanism for defining tailored casing operations is
  22 # the Unicode Common Locale Data Repository (CLDR). For more information, see the
  23 # discussion of case mappings and case algorithms in the Unicode Standard.


 104 1FC6; 1FC6; 0397 0342; 0397 0342; # GREEK SMALL LETTER ETA WITH PERISPOMENI
 105 1FD2; 1FD2; 0399 0308 0300; 0399 0308 0300; # GREEK SMALL LETTER IOTA WITH DIALYTIKA AND VARIA
 106 1FD3; 1FD3; 0399 0308 0301; 0399 0308 0301; # GREEK SMALL LETTER IOTA WITH DIALYTIKA AND OXIA
 107 1FD6; 1FD6; 0399 0342; 0399 0342; # GREEK SMALL LETTER IOTA WITH PERISPOMENI
 108 1FD7; 1FD7; 0399 0308 0342; 0399 0308 0342; # GREEK SMALL LETTER IOTA WITH DIALYTIKA AND PERISPOMENI
 109 1FE2; 1FE2; 03A5 0308 0300; 03A5 0308 0300; # GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND VARIA
 110 1FE3; 1FE3; 03A5 0308 0301; 03A5 0308 0301; # GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND OXIA
 111 1FE4; 1FE4; 03A1 0313; 03A1 0313; # GREEK SMALL LETTER RHO WITH PSILI
 112 1FE6; 1FE6; 03A5 0342; 03A5 0342; # GREEK SMALL LETTER UPSILON WITH PERISPOMENI
 113 1FE7; 1FE7; 03A5 0308 0342; 03A5 0308 0342; # GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND PERISPOMENI
 114 1FF6; 1FF6; 03A9 0342; 03A9 0342; # GREEK SMALL LETTER OMEGA WITH PERISPOMENI
 115 
 116 # IMPORTANT-when iota-subscript (0345) is uppercased or titlecased,
 117 #  the result will be incorrect unless the iota-subscript is moved to the end
 118 #  of any sequence of combining marks. Otherwise, the accents will go on the capital iota.
 119 #  This process can be achieved by first transforming the text to NFC before casing.
 120 #  E.g. <alpha><iota_subscript><acute> is uppercased to <ALPHA><acute><IOTA>
 121 
 122 # The following cases are already in the UnicodeData.txt file, so are only commented here.
 123 
 124 # 0345; 0345; 0399; 0399; # COMBINING GREEK YPOGEGRAMMENI
 125 
 126 # All letters with YPOGEGRAMMENI (iota-subscript) or PROSGEGRAMMENI (iota adscript)
 127 # have special uppercases.
 128 # Note: characters with PROSGEGRAMMENI are actually titlecase, not uppercase!
 129 
 130 1F80; 1F80; 1F88; 1F08 0399; # GREEK SMALL LETTER ALPHA WITH PSILI AND YPOGEGRAMMENI
 131 1F81; 1F81; 1F89; 1F09 0399; # GREEK SMALL LETTER ALPHA WITH DASIA AND YPOGEGRAMMENI
 132 1F82; 1F82; 1F8A; 1F0A 0399; # GREEK SMALL LETTER ALPHA WITH PSILI AND VARIA AND YPOGEGRAMMENI
 133 1F83; 1F83; 1F8B; 1F0B 0399; # GREEK SMALL LETTER ALPHA WITH DASIA AND VARIA AND YPOGEGRAMMENI
 134 1F84; 1F84; 1F8C; 1F0C 0399; # GREEK SMALL LETTER ALPHA WITH PSILI AND OXIA AND YPOGEGRAMMENI
 135 1F85; 1F85; 1F8D; 1F0D 0399; # GREEK SMALL LETTER ALPHA WITH DASIA AND OXIA AND YPOGEGRAMMENI
 136 1F86; 1F86; 1F8E; 1F0E 0399; # GREEK SMALL LETTER ALPHA WITH PSILI AND PERISPOMENI AND YPOGEGRAMMENI
 137 1F87; 1F87; 1F8F; 1F0F 0399; # GREEK SMALL LETTER ALPHA WITH DASIA AND PERISPOMENI AND YPOGEGRAMMENI
 138 1F88; 1F80; 1F88; 1F08 0399; # GREEK CAPITAL LETTER ALPHA WITH PSILI AND PROSGEGRAMMENI
 139 1F89; 1F81; 1F89; 1F09 0399; # GREEK CAPITAL LETTER ALPHA WITH DASIA AND PROSGEGRAMMENI
 140 1F8A; 1F82; 1F8A; 1F0A 0399; # GREEK CAPITAL LETTER ALPHA WITH PSILI AND VARIA AND PROSGEGRAMMENI
 141 1F8B; 1F83; 1F8B; 1F0B 0399; # GREEK CAPITAL LETTER ALPHA WITH DASIA AND VARIA AND PROSGEGRAMMENI
 142 1F8C; 1F84; 1F8C; 1F0C 0399; # GREEK CAPITAL LETTER ALPHA WITH PSILI AND OXIA AND PROSGEGRAMMENI
 143 1F8D; 1F85; 1F8D; 1F0D 0399; # GREEK CAPITAL LETTER ALPHA WITH DASIA AND OXIA AND PROSGEGRAMMENI
 144 1F8E; 1F86; 1F8E; 1F0E 0399; # GREEK CAPITAL LETTER ALPHA WITH PSILI AND PERISPOMENI AND PROSGEGRAMMENI


< prev index next >