World Library  
Flag as Inappropriate
Email this Article

Persian script

Article Id: WHEBN0003467003
Reproduction Date:

Title: Persian script  
Author: World Heritage Encyclopedia
Language: English
Subject: Persian language, Swahili language, Gujarati language, Azerbaijani language, Uttara Kannada, Saraiki dialect, Balti language, Karakalpak language, İske imlâ alphabet, Abdul Karim Khan
Publisher: World Heritage Encyclopedia

Persian script

For other scripts that have been used to write the Persian language, see Persian language#Orthography.

The Persian or Arabic alphabet (Persian: الفبای فارسی‎) is a writing system based on the Arabic script. Originally used exclusively for the Arabic language, the Arabic alphabet was adapted to the Persian language, adding four letters: پ ], چ ], ژ ], and گ ]. Many languages that use the Arabic script add other letters. Besides the Persian alphabet itself, the Arabic script has been applied to the Urdu alphabet, Sindhi alphabet, Saraiki alphabet, Kurdish Sorani alphabet, Lurish (Luri), Ottoman Turkish alphabet, Balochi alphabet, Punjabi Shahmukhi script, Tatar, Azeri, and several others.

In order to represent non-Arabic sounds, new letters were created by adding dots, lines, and other shapes to existing letters. For example, the retroflex sounds of Urdu are represented orthographically by adding a small ط above their non-retroflex counterparts: د [d̪] and ڈ [ɖ]. The voiceless retroflex fricative [ʂ] of Pashto is represented in writing by adding a dot above and below the س [s] letter, resulting in ښ. The Close back rounded vowel [u] of Kurdish is written by writing two [u], resulting in ﻭﻭ.

The Arabic script is exclusively written cursively. That is, the majority of letters in a word connect to each other. This is also implemented on computers. Whenever the Arabic script is typed, the computer connects the letters to each other. Unconnected letters are not widely accepted. In Arabic, as in Arabic, words are written from right to left while numbers are written from left to right.

A characteristic feature of this script, possibly tracing back to Ancient Egyptian hieroglyphs, is that vowels are underrepresented. For example, in Classical Arabic, of the six vowels, the three short ones are normally entirely omitted (although certain diacritics are added to indicate them in special circumstances, notably in the Qur'an), while the three long ones are represented ambiguously by certain consonants. Only Kashmiri, Uyghur, Kyrgyz (in China), Kazakh (in China), Kurdish and (formerly) Bosnian, of the many languages using adaptations of this script, regularly indicate all vowels.


Below are the 32 letters of the modern Persian alphabet. Since the script is cursive, the appearance of a letter changes depending on its position: isolated, beginning (joined on the left), middle (joined on both sides), and end (joined on the right) of a word.

The letter names are mostly identical to the ones used in Arabic, except for the Persian pronunciation of the consonants. The only ambiguous name is he used for both and ه. For clarification, these are often called ḥe-ye jimi (literally "jim-like ḥe" after jim, the name for the letter ج that uses the same base form) and he-ye do-češm (literally "two-eyed he", after the contextual middle letterform ), respectively.

Name DIN 31635 IPA Contextual forms
End Middle Beginning Isolated
ʾalef ā / ʾ [ɒ], [ʔ] ـا ـا * آ / ا *
be b [b] ـب ـبـ ب
pe p [p] ـپ ـپـ پ
te t [t] ـت ـتـ
s̱e [s] ـث ـثـ
jim j [d͡ʒ] ـجـ
che č [t͡ʃ] ـچـ
ḥe(-ye jimi) [h] ـحـ
khe x [x] ـخـ
dāl d [d] ـد ـد* *
ẕāl [z] ـذ ـذ* *
re r [ɾ] ـر ـر* *
ze z [z] ـز ـز* *
že ž [ʒ] ـژ ـژ* ژ* ژ
sin s [s] ـس ـسـ
šin š [ʃ] ـش ـشـ
ṣād [s] ـص ـصـ
z̤ād [z] ـض ـضـ ﺿ
ṭā [t] ـط ـطـ
ẓā [z] ـظ ـظـ
ʿeyn ʿ [ʔ] ـع ـعـ
ġeyn ġ [ɣ] / [ɢ] ـغ ـغـ
fe f [f] ـف ـفـ
qāf q [ɢ] / [ɣ] / [q] (in some dialects) ـق ـقـ
kāf k [k] ـک ـکـ ک
gāf g [ɡ] ـگ ـگـ گ
lām l [l] ـل ـلـ
mim m [m] ـم ـمـ
nun n [n] ـن ـنـ
vāv v / ū / ow [v] / [uː] / [o] / [ow] / [oː] (in Dari) ـو ـو* و* و
he(-ye do-češm) h [h] ـه ـهـ هـ
ye y / ī / á [j] / [i] / [ɒː] / [eː] (in Dari) ـیـ


There are seven letters (وژ) in the Persian alphabet that do not connect to other letters like the rest of the letters in the alphabet. These seven letters do not have distinctive initial or medial forms but the isolated and the final forms are used instead because they do not allow for a connection to be made on the left hand side to the other letters in the word. For example, when the letter ا "alef" is at the beginning of a word such as اینجا "injā" (here), the initial/isolated form of "alef" is used. Or in the case of امروز "emruz" (today) the letter "re" is the final form and the letter و "vāv" is the initial/isolated form, although they are in the middle of the word; is the initial/isolated form, although it is at the end of the word.


Persian scripts has adopted a subset of Arabic diacritics which consists of fatḥah ([æ]), kasrah ([e]), ḍammah ([ou] or [o]), sukūn, tanwīn nasb ([æn]) and tashdid. Other Arabic diacritics may be seen in Arabic words in a Persian text by the virtue of being Arabic.

Other characters

The following are not actual letters but different orthographical shapes for letters, and in the case of the lām alef, a ligature. As to hamze, it has only a single graphic, since it is never tied to a preceding or following letter. However, it is sometimes 'seated' on a vāv, ye or alef, and in that case the seat behaves like an ordinary vāv, ye or alef respectively. Technically, hamze is not a letter but a diacritic.

Name Transliteration IPA Final Medial Initial Stand-alone
alef madde ā [ɒ]
he ye -eye or -eyeh [eje] ۀ
lām alef [lɒ]

Although at first glance they may seem similar, there are many differences in the way the different languages use the alphabets. For example, similar words are written differently in Persian and Arabic, as they are used differently.

Novel letters

The main Persian letters are ا, ب, پ, ت, ج, چ, خ, د, ر, ز, ژ, س, ش, ف, ک, گ, ل, م, ن, و, ه, ی and other letters that came into it from Arabic literature. The Persian alphabet adds four letters to the Arabic alphabet, [p], [ɡ], [t͡ʃ] (ch in chair), [ʒ] (s in measure):

Sound Shape Unicode name
[p] پ pe
[t͡ʃ] (ch) چ che
[ژ] (zh) ژ zhe
[ɡ] گ gaf

Changes from the Arabic writing system

The following is a list of differences between the Arabic writing system and the Persian writing system:

  1. A hamze (ء) is neither written above an alef (ا) to denote a zabar or piš nor below to denote a zir.
  2. The final kâf is typically written without a flourish, while in Arabic it would be .
  3. The Arabic letter tāʾ marbūṭa (ة), unless used in a direct Arabic quotation, is usually changed to a te (ت) or he ه because tāʾ marbūṭa is a grammatical construct in Arabic denoting femininity. Since Persian grammar lacks gender constructs, the tāʾ marbūṭa is not necessary and is only kept to maintain fidelity to the original Arabic spelling.
  4. Two dots are removed in the final ye (ی). Arabic differentiates the final yāʾ with the two dots and the alif maqsura (except in Egyptian Arabic), which is written like a final yāʾ without two dots. Because Persian drops the two dots in the final ye, the alif maqsura cannot be differentiated from the normal final ye. For example, the name Mûsâ (Moses) is written موسی. In the final letter in Musâ, Persian does not differentiate between ye or an alif maqsura.
  5. The letters pe (پ), che (چ), že (ژ), and gâf (گ) are added because Arabic lacks these phonemes, yet they occur in the Persian language.
  6. Arabic letter waw (و) is used as vâv for [v], because Arabic has no [v] and standard Iranian Persian has [w] only within the diphthong [ow].
  7. In the Arabic alphabet hāʾ () comes before wāw (و), however in the Persian alphabet, he () comes after vâv (و).

Word boundaries

Typically words are separated from each other by a space. Certain morphemes (such as the plural ending '-hâ') are written without a space. When writing on a computer, they are separated from the word using the zero-width non-joiner.

Languages using the Arabic script

Current Use

Former Use
A number of languages have used the Arabic script before, but have since changed.

Arguments and discussions on use of Arabic

In almost all countries which use Arabic script, there have been discussions between parties about replacing it, often raising the concept of romanization. For example:

  • Tajikistan has implemented a Cyrillic alphabet instead of Arabic
  • Turkish people have chosen a Latin-Based Turkish alphabet, in part because the eight vowels of Turkish were ambiguously represented by only three symbols
  • Azerbaijan and Uzbek implemented Cyrillic, but have since switched to Latin alphabets.
  • In Iran, methods of romanizations like Unipers have been invented.
  • Kurdish language has utilized a Kurdish Latin alphabet

Relation to Islamic culture

Arabic script in some Islamic countries is being promoted and defended as a sign of Islamic culture. People and governments in some Islamic countries have an interest in this script because of its relation to Islam and because it has been utilized to write the Quran. Therefore the concept of Arabic script and Romanization in these countries is not a politically or socially neutral subject.

Other Arabic alphabets

There are many Arabic-derived alphabets which were not influenced by the Arabic script, including Jawi (used for Malay), Sorabe (Malagasy), and many alphabets used in Northern Africa. These alphabets used other innovations for writing such common sounds as [p] and [ɡ], instead of the Arabic letters پ and گ, although the Jawi script does use the same symbol for [t͡ʃ] (چ‎).

See also

External links

  • Persian dictionary that also provide Randomization
  • Virtual Persian Keyboard
  • Persian Alphabet
  • Persian alphabet, numerals, and pronunciation
  • Persian numerals
  • eiktub: web-based Arabic transliteration pad, with support for Persian characters
  • Persian Character Maps
  • Tests to Practice Joining and Disjoining Persian Letters and Frequently Occurring Shapes
  • Alphabet Tests with Audio to learn Pronunciation
  • Daoulagad - mobile Persian OCR dictionary

Template:Arabic alphabets

This article was sourced from Creative Commons Attribution-ShareAlike License; additional terms may apply. World Heritage Encyclopedia content is assembled from numerous content providers, Open Access Publishing, and in compliance with The Fair Access to Science and Technology Research Act (FASTR), Wikimedia Foundation, Inc., Public Library of Science, The Encyclopedia of Life, Open Book Publishers (OBP), PubMed, U.S. National Library of Medicine, National Center for Biotechnology Information, U.S. National Library of Medicine, National Institutes of Health (NIH), U.S. Department of Health & Human Services, and, which sources content from all federal, state, local, tribal, and territorial government publication portals (.gov, .mil, .edu). Funding for and content contributors is made possible from the U.S. Congress, E-Government Act of 2002.
Crowd sourced content that is contributed to World Heritage Encyclopedia is peer reviewed and edited by our editorial staff to ensure quality scholarly research articles.
By using this site, you agree to the Terms of Use and Privacy Policy. World Heritage Encyclopedia™ is a registered trademark of the World Public Library Association, a non-profit organization.

Copyright © World Library Foundation. All rights reserved. eBooks from School eBook Library are sponsored by the World Library Foundation,
a 501c(4) Member's Support Non-Profit Organization, and is NOT affiliated with any governmental agency or department.