未统一汉字列表
有些字只是同一字在不同地区的写法,但因为原规格分离原则而只好分开编码。由于韩国KS X 1001:1998(U+F900-U+FA0B,268个字)、台湾Big5(U+FA0C-U+FA0D,2个字)、日本IBM 32(CP932变种;U+FA0E-U+FA2D,32个字)、韩国KS X 1001:2004(U+FA2E-U+FA2F,2个字)、日本JIS X 0213(U+FA30-U+FA6A,59个字)、ARIB STD-B24(U+FA6B-U+FA6D,3个字)和朝鲜KPS 10721-2000(U+FA70-U+FAD9,106个字)均有字形非常接近但编码上分离的字,为实现与这些标准的互换性而创立“兼容表意文字区”(Compatibility Ideographs)。值得注意的是原规格分离原则由“Unicode联盟决定把不正统的编入位于基本多文种平面的‘兼容表意文字区’”时起废弃,原因是台湾来源(T-source,即CNS 11643)有太多字形非常接近,按Unicode标准应该统一的字。这些字只有正统的会编入正式字集(包括扩展区),不正统的编入位于“第二辅助平面”的“兼容表意文字补充区”(Compatibility Ideographs Supplement)中。
以下是所有摘自ISO/IEC JTC1/SC2/WG2原规格分离原则文件之中有的字。但有的分离是正确的,不同字形有不同的意思。
Unicode | 字 | Unicode | 字 | Unicode | 字 |
---|---|---|---|---|---|
U+4E1F | 丟 | U+4E22 | 丢 | ||
U+4E48 | 么 | U+5E7A | 幺 | ||
U+4E89 | 争 | U+722D | 爭 | ||
U+4EDE | 仞 | U+4EED | 仭 | ||
U+4F75 | 併 | U+5002 | 倂 | ||
U+4FA3 | 侣 | U+4FB6 | 侶 | ||
U+4FC1 | 俁 | U+4FE3 | 俣 | ||
U+4FDE | 俞 | U+516A | 兪 | ||
U+4FF1 | 俱 | U+5036 | 倶 | ||
U+5024 | 値 | U+503C | 值 | ||
U+5077 | 偷 | U+5078 | 偸 | ||
U+507D | 偽 | U+50DE | 僞 | ||
U+514C | 兌 | U+5151 | 兑 | ||
U+514E | 兎 | U+5154 | 兔 | ||
U+5156 | 兖 | U+5157 | 兗 | ||
U+518A | 冊 | U+518C | 册 | ||
U+51C0 | 净 | U+51C8 | 凈 | ||
U+51E2 | 凢 | U+51E3 | 凣 | ||
U+5203 | 刃 | U+5204 | 刄 | ||
U+520A | 刊 | U+520B | 刋 | ||
U+5220 | 删 | U+522A | 刪 | ||
U+5225 | 別 | U+522B | 别 | ||
U+5238 | 券 | U+52B5 | 劵 | ||
U+5239 | 刹 | U+524E | 剎 | ||
U+524F | 剏 | U+5259 | 剙 | ||
U+525D | 剝 | U+5265 | 剥 | ||
U+5292 | 劒 | U+5294 | 劔 | ||
U+52FB | 勻 | U+5300 | 匀 | ||
U+5355 | 单 | U+5358 | 単 | ||
U+5373 | 即 | U+537D | 卽 | ||
U+5377 | 卷 | U+5DFB | 巻 | ||
U+53C1 | 叁 | U+53C2 | 参 | ||
U+53C3 | 參 | U+53C4 | 叄 | ||
U+5415 | 吕 | U+5442 | 呂 | ||
U+541E | 吞 | U+5451 | 呑 | ||
U+5433 | 吳 | U+5434 | 吴 | U+5449 | 呉 |
U+5436 | 吶 | U+5450 | 呐 | ||
U+543F | 吿 | U+544A | 告 | ||
U+5527 | 唧 | U+559E | 喞 | ||
U+55A9 | 喩 | U+55BB | 喻 | ||
U+5618 | 嘘 | U+5653 | 噓 | ||
U+568F | 嚏 | U+5694 | 嚔 | ||
U+56EF | 囯 | U+56FD | 国 | ||
U+5708 | 圈 | U+570F | 圏 | ||
U+570E | 圎 | U+5713 | 圓 | ||
U+5716 | 圖 | U+5717 | 圗 | ||
U+5759 | 坙 | U+5DE0 | 巠 | ||
U+57D2 | 埒 | U+57D3 | 埓 | ||
U+5848 | 塈 | U+588D | 墍 | ||
U+5861 | 塡 | U+586B | 填 | ||
U+5897 | 増 | U+589E | 增 | ||
U+58EE | 壮 | U+58EF | 壯 | ||
U+58FD | 壽 | U+5900 | 夀 | ||
U+5910 | 夐 | U+657B | 敻 | ||
U+5965 | 奥 | U+5967 | 奧 | ||
U+5968 | 奨 | U+596C | 奬 | U+734E | 獎 |
U+5986 | 妆 | U+599D | 妝 | ||
U+598D | 妍 | U+59F8 | 姸 | ||
U+59CD | 姍 | U+59D7 | 姗 | ||
U+5A1B | 娛 | U+5A2F | 娯 | U+5A31 | 娱 |
U+5A55 | 婕 | U+5AAB | 媫 | ||
U+5A7E | 婾 | U+5AAE | 媮 | ||
U+5AAA | 媪 | U+5ABC | 媼 | ||
U+5AAF | 媯 | U+5B00 | 嬀 | ||
U+5B0E | 嬎 | U+5B14 | 嬔 | ||
U+5B24 | 嬤 | U+5B37 | 嬷 | ||
U+5B73 | 孳 | U+5B76 | 孶 | ||
U+5BAB | 宫 | U+5BAE | 宮 | ||
U+5BDB | 寛 | U+5BEC | 寬 | ||
U+5BDC | 寜 | U+5BE7 | 寧 | ||
U+5BDD | 寝 | U+5BE2 | 寢 | ||
U+5C02 | 専 | U+5C08 | 專 | ||
U+5C06 | 将 | U+5C07 | 將 | ||
U+5C13 | 尓 | U+5C14 | 尔 | ||
U+5C19 | 尙 | U+5C1A | 尚 | ||
U+5C2A | 尪 | U+5C2B | 尫 | ||
U+5C36 | 尶 | U+5C37 | 尷 | ||
U+5C4F | 屏 | U+5C5B | 屛 | ||
U+5CE5 | 峥 | U+5D22 | 崢 | ||
U+5DD3 | 巓 | U+5DD4 | 巔 | ||
U+5E21 | 帡 | U+5E32 | 帲 | ||
U+5E2F | 帯 | U+5E36 | 帶 | ||
U+5E76 | 并 | U+5E77 | 幷 | ||
U+5EC4 | 廄 | U+5ECF | 廏 | ||
U+5F11 | 弑 | U+5F12 | 弒 | ||
U+5F37 | 強 | U+5F3A | 强 | ||
U+5F39 | 弹 | U+5F3E | 弾 | ||
U+5F50 | 彐 | U+5F51 | 彑 | ||
U+5F54 | 彔 | U+5F55 | 录 | ||
U+5F59 | 彙 | U+5F5A | 彚 | ||
U+5F5B | 彛 | U+5F5C | 彜 | ||
U+5F5D | 彝 | U+5F5E | 彞 | ||
U+5F65 | 彥 | U+5F66 | 彦 | ||
U+5FB3 | 徳 | U+5FB7 | 德 | ||
U+5FB4 | 徴 | U+5FB5 | 徵 | ||
U+6075 | 恵 | U+60E0 | 惠 | ||
U+6085 | 悅 | U+60A6 | 悦 | ||
U+609E | 悞 | U+60AE | 悮 | ||
U+60B3 | 悳 | U+60EA | 惪 | ||
U+6120 | 愠 | U+614D | 慍 | ||
U+613C | 愼 | U+614E | 慎 | ||
U+6229 | 戩 | U+622C | 戬 | ||
U+622F | 戯 | U+6231 | 戱 | ||
U+6236 | 戶 | U+6237 | 户 | U+6238 | 戸 |
U+623B | 戻 | U+623E | 戾 | ||
U+629B | 抛 | U+62CB | 拋 | ||
U+629C | 抜 | U+62D4 | 拔 | ||
U+6329 | 挩 | U+635D | 捝 | ||
U+633F | 挿 | U+63D2 | 插 | U+63F7 | 揷 |
U+634F | 捏 | U+63D1 | 揑 | ||
U+635C | 捜 | U+641C | 搜 | ||
U+63B2 | 掲 | U+63ED | 揭 | ||
U+63FA | 揺 | U+6416 | 搖 | U+6447 | 摇 |
U+63FE | 揾 | U+6435 | 搵 | ||
U+6483 | 撃 | U+64CA | 擊 | ||
U+654E | 敎 | U+6559 | 教 | ||
U+6553 | 敓 | U+655A | 敚 | ||
U+65E2 | 既 | U+65E3 | 旣 | ||
U+6602 | 昂 | U+663B | 昻 | ||
U+665A | 晚 | U+6669 | 晩 | ||
U+66A8 | 暨 | U+66C1 | 曁 | ||
U+66FD | 曽 | U+66FE | 曾 | ||
U+67B4 | 枴 | U+67FA | 柺 | ||
U+67E5 | 查 | U+67FB | 査 | ||
U+67F5 | 柵 | U+6805 | 栅 | ||
U+68B2 | 梲 | U+68C1 | 棁 | ||
U+6961 | 楡 | U+6986 | 榆 | ||
U+6982 | 概 | U+69EA | 槪 | ||
U+6985 | 榅 | U+69B2 | 榲 | ||
U+699D | 榝 | U+6A27 | 樧 | ||
U+69C7 | 槇 | U+69D9 | 槙 | ||
U+69D8 | 様 | U+6A23 | 樣 | ||
U+6A2A | 横 | U+6A6B | 橫 | ||
U+6B65 | 步 | U+6B69 | 歩 | ||
U+6B72 | 歲 | U+6B73 | 歳 | ||
U+6B7F | 歿 | U+6B81 | 殁 | ||
U+6BBB | 殻 | U+6BBC | 殼 | ||
U+6BC0 | 毀 | U+6BC1 | 毁 | ||
U+6BCE | 毎 | U+6BCF | 每 | ||
U+6C32 | 氲 | U+6C33 | 氳 | ||
U+6C5A | 汚 | U+6C61 | 污 | ||
U+6C92 | 沒 | U+6CA1 | 没 | ||
U+6D44 | 浄 | U+6DE8 | 淨 | ||
U+6D89 | 涉 | U+6E09 | 渉 | ||
U+6D97 | 涗 | U+6D9A | 涚 | ||
U+6D99 | 涙 | U+6DDA | 淚 | ||
U+6DE5 | 淥 | U+6E0C | 渌 | ||
U+6DF8 | 淸 | U+6E05 | 清 | ||
U+6E07 | 渇 | U+6E34 | 渴 | ||
U+6E29 | 温 | U+6EAB | 溫 | ||
U+6E88 | 溈 | U+6F59 | 潙 | ||
U+6E89 | 溉 | U+6F11 | 漑 | ||
U+6EDA | 滚 | U+6EFE | 滾 | ||
U+6F5B | 潛 | U+6FF3 | 濳 | ||
U+7028 | 瀨 | U+702C | 瀬 | ||
U+70BA | 為 | U+7232 | 爲 | ||
U+712D | 焭 | U+7162 | 煢 | ||
U+7155 | 煕 | U+7199 | 熙 | ||
U+7174 | 煴 | U+7185 | 熅 | ||
U+72B6 | 状 | U+72C0 | 狀 | ||
U+7464 | 瑤 | U+7476 | 瑶 | ||
U+74F6 | 瓶 | U+7501 | 甁 | ||
U+7522 | 產 | U+7523 | 産 | ||
U+75E9 | 痩 | U+7626 | 瘦 | ||
U+76A1 | 皡 | U+76A5 | 皥 | ||
U+771E | 眞 | U+771F | 真 | ||
U+773E | 眾 | U+8846 | 衆 | ||
U+7814 | 研 | U+784F | 硏 | ||
U+797F | 祿 | U+7984 | 禄 | ||
U+79BF | 禿 | U+79C3 | 秃 | ||
U+7A05 | 稅 | U+7A0E | 税 | ||
U+7A42 | 穂 | U+7A57 | 穗 | ||
U+7B5D | 筝 | U+7B8F | 箏 | ||
U+7BB3 | 箳 | U+7C08 | 簈 | ||
U+7BE1 | 篡 | U+7C12 | 簒 | ||
U+7CA4 | 粤 | U+7CB5 | 粵 | ||
U+7D55 | 絕 | U+7D76 | 絶 | ||
U+7DA0 | 綠 | U+7DD1 | 緑 | ||
U+7DD2 | 緒 | U+7DD6 | 緖 | ||
U+7DE3 | 緣 | U+7E01 | 縁 | ||
U+7DFC | 緼 | U+7E15 | 縕 | ||
U+7E48 | 繈 | U+7E66 | 繦 | ||
U+7FAE | 羮 | U+7FB9 | 羹 | ||
U+7FF6 | 翶 | U+7FFA | 翺 | ||
U+80FC | 胼 | U+8141 | 腁 | ||
U+812B | 脫 | U+8131 | 脱 | ||
U+817D | 腽 | U+8183 | 膃 | ||
U+8203 | 舃 | U+8204 | 舄 | ||
U+820D | 舍 | U+820E | 舎 | ||
U+8216 | 舖 | U+8217 | 舗 | ||
U+8358 | 荘 | U+838A | 莊 | ||
U+83D1 | 菑 | U+8458 | 葘 | ||
U+8480 | 蒀 | U+8495 | 蒕 | ||
U+848B | 蒋 | U+8523 | 蔣 | ||
U+848D | 蒍 | U+853F | 蔿 | ||
U+8570 | 蕰 | U+8580 | 薀 | ||
U+85AB | 薫 | U+85B0 | 薰 | ||
U+85F4 | 藴 | U+860A | 蘊 | ||
U+865A | 虚 | U+865B | 虛 | ||
U+86FB | 蛻 | U+8715 | 蜕 | ||
U+885B | 衛 | U+885E | 衞 | ||
U+886E | 衮 | U+889E | 袞 | ||
U+88C5 | 装 | U+88DD | 裝 | ||
U+8A2E | 訮 | U+8A7D | 詽 | ||
U+8AAA | 說 | U+8AAC | 説 | ||
U+8ACC | 諌 | U+8AEB | 諫 | ||
U+8B20 | 謠 | U+8B21 | 謡 | ||
U+8C5C | 豜 | U+8C63 | 豣 | ||
U+8D70 | 走 | U+8D71 | 赱 | ||
U+8EFF | 軿 | U+8F27 | 輧 | ||
U+8F1C | 輜 | U+8F3A | 輺 | ||
U+8F3C | 輼 | U+8F40 | 轀 | ||
U+8FBE | 达 | U+8FD6 | 迖 | ||
U+8FF8 | 迸 | U+902C | 逬 | ||
U+9059 | 遙 | U+9065 | 遥 | ||
U+90A2 | 邢 | U+90C9 | 郉 | ||
U+90CE | 郎 | U+90DE | 郞 | ||
U+90F7 | 郷 | U+9109 | 鄉 | U+9115 | 鄕 |
U+9196 | 醖 | U+919E | 醞 | ||
U+91A4 | 醤 | U+91AC | 醬 | ||
U+9203 | 鈃 | U+9292 | 銒 | ||
U+92B3 | 銳 | U+92ED | 鋭 | ||
U+9304 | 錄 | U+9332 | 録 | ||
U+932C | 錬 | U+934A | 鍊 | ||
U+93AD | 鎭 | U+93AE | 鎮 | ||
U+95B1 | 閱 | U+95B2 | 閲 | ||
U+9667 | 陧 | U+9689 | 隉 | ||
U+9751 | 靑 | U+9752 | 青 | ||
U+9759 | 静 | U+975C | 靜 | ||
U+976D | 靭 | U+9771 | 靱 | ||
U+9839 | 頹 | U+983D | 頽 | ||
U+984F | 顏 | U+9854 | 顔 | ||
U+985A | 顚 | U+985B | 顛 | ||
U+98EE | 飮 | U+98F2 | 飲 | ||
U+9905 | 餅 | U+9920 | 餠 | ||
U+99B1 | 馱 | U+99C4 | 駄 | ||
U+99E2 | 駢 | U+9A08 | 騈 | ||
U+9AA9 | 骩 | U+9AAB | 骫 | ||
U+9AD8 | 高 | U+9AD9 | 髙 | ||
U+9AEA | 髪 | U+9AEE | 髮 | ||
U+9B2C | 鬬 | U+9B2D | 鬭 | ||
U+9C1B | 鰛 | U+9C2E | 鰮 | ||
U+9CEF | 鳯 | U+9CF3 | 鳳 | ||
U+9D87 | 鶇 | U+9DAB | 鶫 | ||
U+9DC6 | 鷆 | U+9DCF | 鷏 | ||
U+9EAA | 麪 | U+9EAB | 麫 | ||
U+9EBC | 麼 | U+9EBD | 麽 | ||
U+9EC3 | 黃 | U+9EC4 | 黄 | ||
U+9ED1 | 黑 | U+9ED2 | 黒 |
自上表发表后,WG2亦调查过其他汉字[1],认为以下属于基本多文种平面的汉字,亦可考虑收编到ISO 10646 Annex S3:
Unicode | 字 | Unicode | 字 |
---|---|---|---|
U+5022 | 倢 | U+507C | 偼 |
U+52C0 | 勀 | U+52CA | 勊 |
U+5637 | 嘷 | U+5651 | 噑 |
U+5EFB | 廻 | U+5EFD | 廽 |
U+6323 | 挣 | U+6399 | 掙 |
U+66AD | 暭 | U+66CD | 曍 |
U+6808 | 栈 | U+685F | 桟 |
U+6D85 | 涅 | U+6E7C | 湼 |
U+6F40 | 潀 | U+6F68 | 潨 |
U+6FF2 | 濲 | U+7014 | 瀔 |
U+734B | 獋 | U+7354 | 獔 |
U+84D8 | 蓘 | U+8509 | 蔉 |
U+86D4 | 蛔 | U+8716 | 蜖 |
U+8B86 | 讆 | U+8B8F | 讏 |
U+8FF4 | 迴 | U+9025 | 逥 |
U+91F0 | 釰 | U+91FC | 釼 |
注释
参考资料
- UCS重复汉字一覧表 (页面存档备份,存于互联网档案馆) (日语)
- Eiso Chan:Possible unification (页面存档备份,存于互联网档案馆)