Title: Hindi
1Hindi Urdu Transliteration issues
- Rahmat Yousufzai and Amba Kulkarni
2 Urdu Alphabet
- ? ? ? ? ? ?
- ? ? ? ?
- ? ? ?
- ? ? ? ?
- ? ? ? ?
- ? ? ? ?
- ? ? ? ?
- ? ? ?
- ? ? ? ? ? ?
3(No Transcript)
4Characteristics of Urdu alphabet
- Most of the Urdu characters join with the
following character and make one ligature. - Example ????? ??????
- This Urdu word is combination of 5 characters
- 5 4 3 2 1
- ? ? ? ? ?
- Urdu characters typically have different shapes
in different positions beginning, middle, last.
5Characteristics contd ...
- Some characters do not join with the following
character and are written in full form even if
they come in a middle position. - ? ? ? ? ? ? ? ? ? ?
- Example
- ???? ?? ???? ????? ???? ???? ???? ????
???? ????? - Please note that there is no space in between.
-
6Hindi Alphabet
- ? ? ? ? ? ? ?
- ? ? ? ? ?? ??
- ? ? ? ? ?
- ? ? ? ? ?
- ? ? ? ? ?
- ? ? ? ? ?
- ? ? ? ? ?
- ? ? ? ? ? ? ? ?
7Consonants missing in Hindi
- ? ? ? ?? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
- These characters do not exist in Hindi. They are
borrowed from Arabic and are used for the words
borrowed from Arabic/Persian only.
8contd2
? ? (Close to ? in Urdu with a minute difference in pronunciation. In Hindi ? is used to express these characters including ? ? ? ? ? In Hindi ? ?? Is used for all these characters. However most of the times ? is written without dot.
9Contd3
? Same as ? in Urdu with a minute difference in pronunciation. In Hindi ? is used for both the characters. ? This character is represented by ?? ? This character is represented by ?? But normally the dot is not used by Hindi writers.
10Contd4
? This is an Arabic consonant which is transliterated into Hindi as one of the vowels. ? Alif with some difference in pronunciation. Example ??? ??? ???? ??? ???? ???? ??? ?? ??? ???? ??? ?? ??? ?? ???? ???
11Contd5
- If ? comes in between or as a last character
then most of the Urdu speakers normally pronounce
it as Alif but in poetry, special care is taken
to pronounce it correctly.
12Contd6
? and ? have similar pronunciation and in Hindi ? is used to represent both the characters. ? This Character is same like F in English. This sound is represented by ? or ??. But many a times Hindi writers do not use the dot.
13Contd7
- ? This sound also does not exist in Hindi. To
represent this character, ? or ?? is used.
However normally Hindi writers do not use the dot.
14Contd8
- ? This is a Persian character and does not exist
in Arabic. In Hindi this is represented by ? and
??. - Example Television ???? ???? ?????????
15Contd9
- ? This is Do-chashmi He and gives its sound only
when joined with certain characters. - An important point to be notedArabic has a
character ? ( do-chashmi he ). This character
retains its shape only when it comes as first
and middle position. In the last position it gets
changed as ? (gola he ). - Suggestion For Urdu we should use ? with
unicode u06BE.
16Contd8
- ? Noon without dot (Noon Gunna)
- When this character comes in between the word
then dot is marked. This creates ambiguity as it
can be read as ? - Example
- ???? ??????
17Urdu characters borrowed from Hindi
-
- ? ? ? ? ? ?, ?, ??
- These characters do not exist in Arabic or
Persian. These have been borrowed from Hindi.
18Certain Hindi characters representation in Urdu
- ?? ??? ? ?? ? ?? ? ?? ? ?? ?? ? ? ?? ? ??? ?? ?
?? - ?, ?, ?, ?, ?, ?, ??, ?, ?, ?,
?These are the Hindi characters and are
represented in Urdu by adding ? (Do-Chashmi He)
to the initial character.
19Contd2
- ?? ? ?? ? ?? ? ?? ? ??? ??? ??
-
- There are no specific characters in Hindi.
However the sound is represented as under. - ??? , ?? , ???, ??? , ??? , ??, ??, ??
-
20Contd3
- Example??????? ? ????? ? ?????? ? ???? ? ?????
???? -
- ???????, ??????, ???????, ?????, ???,
- ?????
- However in Urdu these words are also
written as ??????? ? ????? ? ?????? But ????
is written without change.
21Ambiguous Characters
- 1. aliph (? ) Ena (?)These two characters
are pronounced differently, but Urdu speakers do
not pay attention to the difference. - Example ???(?? common) ? ?? (?? mango)
22Contd2
- 2. sa se (?), sIna (?), svAda (?)se (?) and
svAda (?) - These are purely arabic and Persian
characters and are used in only Arabic or Persian
words. Where as sIna (?) is used in Hindi, Urdu,
Persian and Arabic.The above characters
including ? are written as ? in Hindi.
23Contd3
- 3. Ta Te (?), Toya (?)Toya (?) is purely Arabic
and Persian character and is used only in Persian
and Arabic words. - Both the characters are written as ? in
Hindi.
24Contd4
- 4. he badI he (?) gola he (?)badI he (?) is
Arabic/Persian character. - gola he (?) is common in Arabic, Persian,
Urdu and Hindi. - Hindi equivalent for both ?
25Contd5
- 5. ja jAla (?), je (?), jvAda (?), joya (?),
- PArasI je (?)Z sound is not available in
Hindi and almost in all Indian languages. Instead
j is used. - The sound of ja jAla (?), je (?), jvAxa (?),
joya (?) are almost the same and all are
Arabic/Persian characters. - je (?) is purely Persian character and is not
available in even Arabic. - For all the above characters ? or ?? is
used. Most of the times dot below is not written.
26Contd2
- It may be noted that if the next character
after ?? is ?, ?, ?, ?, ? then it is pronounced
as "??" otherwise it is pronounced as
"??".Example - 1., ??????, ?????, ?????????? ? ?????? ?
???? - 2. ????, ????, ????, ?????, ?????
- 3. ???, ???, ????, ????, ????
27Contd3 Also, when ?? comes as a last character of
the word then it gives the sound of
"??".Example???, ?????, ??????????
???? ??? Interestingly the
same rule of ?, ?, ?, ?, ? is applied in Urdu
also but mostly the character ? is used in
proper nouns and English words.Example ??????
????? ??????
??????, ???? ??????
28Contd4
- ?? This also is the combination of ? and
?Example It does not come as the first
character of the word however it comes in the
middle and last.Example??????????, ???????,
??????, ??? - ??? ????? ???? ??? ???? ?????
29Contd6
- ?,?,?These characters do not exist in Urdu.
Instead ? is used.Example??????, ???????
???? ?? ? ??????????(??? ??), ??????(????
??) ???? ? ????????, ??? ???? ? ???
30Contd7
- ?, ?These two characters have almost the same
sound with a minute difference.In Uru there is
only one character ? to express this
sound.Example - ?????,??????, ?????
- ????? ? ????? ? ???
- ????, ???????, ??????
- ???? ???? ????
31Diacritic marks in Urdu
- In Urdu, there are no Matras like Hindi.
- Urdu has some diacritic marks but uses them
only in elementary books.
32Contd2
- Zabara ? This is placed above the character
to indicate a consonant with ?. - Ex- ?? ?Zer ? This is placed below the
character and to indicate a vowel ? or ?. - Ex- ?? ??, ??
33Contd3
- Pesh ? This is placed above the character
and creates the sound of ? along with the sound
of the character on which it is applied. Ex-
?? ??Jazama ? Equivalent of Halant in
Devanagari - Ex- ?? ?? ???? ????
34Contd4
- Tashdeed ? This is used for reduplication as
in ???? _????? ????? ??????????? , ?????
Do zabar ? This is placed above the last
character Alif ( ? ) and gives the sound of n .
It may be noted that the character just before
Alif should be with Zabar.Example ????? ?????
(the character ? is with Zabar but normally it
is not written)
35Contd5
- Do zer ? This is placed above the last
character Alif ( ? ) and gives the sound of n .
It may be noted that the character just before
Alif should be with zer but normally it is not
written.Example ????? ??? ????? ????? ???
?????? (The character ? is with Zer )Ulta
pesh ? This is placed above the character and
gives the sound of oo (?)Example ????? ???? ?
????? ?????
36Contd6
- Khada zabar ? This adds the sound of Alif to the
character on which it is applied. Mostly it is
put on Choti ye and Badi ye (? ? ? ) and the
efect of it ie Aa sound is transfered to the
character earlier to Choti ye and Badi ye. It
comes in the middle also and the character on
which it is applied is added with the sound
Aa.Example ????? ? ?????? ? ????
??? ???????? ?????
37Contd7
-
- Other Diacritic marks are pesh and khada zer.
38Gender
- Rules for gender in most of the words which
have been derived from Hindi/Indian languages, do
not change between Urdu and Hindi. But for some
of the words which have been borrowed from Arabic
or persian, the gender changes.vyavastha
(feminine) ?????? (Masculine)aakarshan
(Masculine) ??? (feminine)prakash (Masculine)
????? (feminine)
39Compound Words
- In Urdu two words are joined together.
- ( Same as in English where Apostrophe is used
to join the words and Apostrophe gives the sense
of "of"). In Urdu, the words are joined by
"Izaafat". There are three types of Izaafat.
40Contd2
- 1. Zer ? is added after the first word and then
the other word is written. Example ???? ??
Most important thing is that there has to be
space after zer ? other wise the words may join
together and will be problematic to read
correctly.Example ???? ??If space is not
given then the word will appear like this. ??????
41Contd3
- 2. If the last character of the first word is
"he" or "choti ye", then Hamza is added after the
first word. Example ????? ??? ????? ???
42Contd4
- 3. If the last character of the first word is
alif or wav then "Hamza Badi ye" is added after
the first word.Example ???? ???? ??? ??
43Compound words without Izaafat
- Normally in Hindi some words are written
togetherExample isaka, usaka, Taajmahal etc but
in Urdu they are written separately. - ?? ??? ?? ??? ??? ???It may be noted that if
Tajmahal is written together then it will not be
readable. ??????
44Typographical errors
- 1. In Urdu when choti ye ? and ? come as a
last character of a word then it retains its
original shape. But if it comes in middle then it
is difficult to recognize because the appearance
is same in hand written text and Word processing
packages like Inpage. Unicode badi ye does not
join with the following character. example ????
????
45Contd2
- In all Urdu packages this will be written as
???? which creates ambiguity and transliteration
through machine becomes problematic.
46Contd3
- 2. There are certain characters in Urdu which
do not join with the following character. These
characters are ?????? ?? ?? ?? ?? ?? ? ? . Data
entry operators do not care much to give space
between the two words as it is difficult for them
to notice the joined position of the words. Due
to this machine takes the two words as one and
fails to process the word.
47Contd4
- 3. The diacritic marks are ignored in Urdu and
hence apparently there is no difference between - ?? ?? and ?? ??
48Ambiguity
- ??? ??? (noun)
- ??? ??? (verb)
- In Urdu both words are written in the same way
but meaning is different. The spelling in Hindi
is also different.