IPA_in_Unicode IPA_in_Unicode

IPA in Unicode - Definition and Overview

The International Phonetic Alphabet can be represented in Unicode, with symbols not used in other alphabets assigned range U+0250–02AD. The following is a representation of the IPA chart encoded in Unicode.

There also exist systems for representing the information contained in IPA in ASCII, including SAMPA, Kirshenbaum and other ad hoc systems to work around the difficulty of displaying IPA on computers.

See also: Table of Unicode characters, 128 to 999, Unicode and HTML

Contents

Consonants (pulmonic)

Bilabial Labio-
dental
Dental Alveolar Post-
alveolar
Retroflex Palatal Velar Uvular Pharyngeal Glottal
Plosive p b     t d   ʈ ɖ c ɟ k g q ɢ     ʔ  
Nasal m ɱ   n   ɳ ɲ ŋ ɴ  
Trill ʙ     r         ʀ    
Tap or Flap       ɾ   ɽ          
Fricative ɸ β f v θ ð s z ʃ ʒ ʂ ʐ ç ʝ x ɣ χ ʁ ħ ʕ h ɦ
Lateral Fricative   ɬ ɮ            
Approximant   ʋ   ɹ   ɻ j ɰ      
Lateral Approximant     l   ɭ ʎ ʟ    
  • Where symbols appear in pairs, the one to the right represents a voiced consonant.
  • Shaded areas indicate articulations judged impossible.

Consonants (non-pulmonic)

Clicks Voiced implosives Ejectives
ʘ Bilabial ɓ Bilabial ʼ For example:
ǀ Dental ɗ Dental/alveolar Bilabial
ǃ Alveolar (retroflex) ʄ Palatal Dental/alveolar
ǂ Palatoalveolar (alveolar) ɠ Velar Velar
ǁ Alveolar lateral (lateral) ʛ Uvular Alveolar fricative

Vowels

Front Central Back
Close i • y   ɨ • ʉ   ɯ • u
    ɪ • ʏ   ʊ  
Close-mid e • ø   ɘ • ɵ   ɤ • o
      ə    
Open-mid ɛ • œ   ɜ • ɞ   ʌ • ɔ
  æ   ɐ    
Open a • ɶ       ɑ • ɒ


Where symbols appear in pairs, the one to the right represents a rounded vowel.

Other symbols

ʍ Voiceless labial-velar fricative
w Voiced labial-velar approximant
ɥ Voiced labial-palatal approximant
ʜ Voiceless epiglottal fricative
ʢ Voiced epiglottal fricative
ʡ Epiglottal plosive
ɕ ʑ Alveolo-palatal fricatives
ɺ Alveolar lateral flap
ɧ Simultaneous ʃ and x


Affricates and double articulations can be represented by two symbols joined by a tie bar if necessary, or represented by a ligature in the case of some common affricates:

Ligature Tie bar
ʣ d͡z
ʤ d͡ʒ
 – k͡p
ʦ t͡s
ʧ t͡ʃ


Due to a bug in the font Arial Unicode MS, these incorrectly formed character combinations may look better in your browser: dz͡  dʒ͡  kp͡  ts͡  tʃ ͡.

Suprasegmentals

ˈ Primary stress
ˌ Secondary stress
ː Long
ˑ Half-long
˘ Extra-short
. Syllable break
| Minor (foot) group
Major (intonation) group
Linking (absence of a break)

Tones and word accents

e̋ or ˥ Extra high
é or ˦ High
ē or ˧ Mid
è or ˨ Low
ȅ or ˩ Extra low
Rise
Fall
Downstep
Upstep
Global rise
Global fall

Note: Unicode does not support most IPA symbols for non-flat tones. To represent those, one possibility is to use numbers in the subscript, e.g. /e53/ for a high falling /e/. Another possibility is to use the "Box Drawings" in Unicode, e.g. /e┒/ for a high flat /e/.

Diacritics

Diacritics may be placed above a symbol with a descender, i.e. ŋ̊

n̥ d̥ Voiceless b̤ a̤ Breathy voiced t̪ d̪ Dental
s̬ t̬ Voiced b̰ a̰ Creaky voiced t̺ d̺ Apical
tʰ dʰ Aspirated t̼ d̼ Linguolabial t̻ d̻ Laminal
ɔ̹ More rounded tʷ dʷ Labialized Nasalized
ɔ̜ Less rounded tʲ dʲ Palatalized dⁿ Nasal release
Advanced tˠ dˠ Velarized Lateral release
Retracted tˁ dˁ Pharyngealized No audible release
Centralized Velarized or pharyngealized
Mid-centralized Raised (ɹ̝ = voiced alveolar fricative)
ɹ̩ Syllabic Lowered (β̞ = voiced bilabial approximant)
Non-syllabic Advanced Tongue Root
ə˞ Rhoticity Retracted Tongue Root

(Extended IPA for disordered speech.)

Labial spreading   Strong articulation   Denasal
Dentolabial Weak articulation   Nasal escape
  Interdental/bidental   Reiterated articulation   Velopharyngeal friction
  Alveolar Whistled articulation   Ingressive airflow
  Linguolabial Sliding articulation   Egressive airflow

See also

External links


Example Usage of Unicode

foxitefeed: Databases, Tables and SQL Server: urdu or arabic Unicode data http://bit.ly/3gAnuP
orangeobject: @notoroid なんと??¥p{Katakana} なんと簡単な、、でもPHPは該当しないみたいですねー. http://module.jp/blog/regex_Unicode_prop.html php は上のURLだと\xA5[\xA1-\xF6])+ みたいですね.
ka2n: u'Unicode指定するとかえって文字化けするのはなぜだろう'
Copyright 2009 WordIQ.com - Privacy Policy  :: Terms of Use  :: Contact Us  :: About Us
This article is licensed under the GNU Free Documentation License. It uses material from the this Wikipedia article.