WX notation

WX notation is a transliteration scheme for representing Indian languages in ASCII. This scheme originated at IIT Kanpur for computational processing of Indian languages, and is widely used among the natural language processing (NLP) community in India. The notation (though unidentified) is used, for example, in a textbook on NLP from IIT Kanpur.^[1] The salient features of this transliteration scheme are: Every consonant and every vowel has a single mapping into Roman. Hence it is a prefix code, advantageous from computation point of view. Typically the small case letters are used for un-aspirated consonants and short vowels while the capital case letters are used for aspirated consonants and long vowels. While the retroflexed voiceless and voiced consonants are mapped to 't, T, d and D', the dentals are mapped to 'w, W, x and X'. Hence the name of the scheme "WX", referring to the idiosyncratic mapping. Ubuntu Linux provides a keyboard support for WX notation.

Vowels

अ	आ	इ	ई	उ	ऊ	ए	ऐ	ओ	औ
a	A	i	I	u	U	e	E	o	O

Sonorants

ऋ	ॠ	ऌ
q	Q	L

Anusvāra and visarga

अं	अः
M	H

Consonants

क्	ख्	ग्	घ्	ङ्	Velar
k	K	g	G	f
च्	छ्	ज्	झ्	ञ्	Palatal
c	C	j	J	F
ट्	ठ्	ड्	ढ्	ण्	Retroflex
t	T	d	D	N
त्	थ्	द्	ध्	न्	Dental
w	W	x	X	n
प्	फ्	ब्	भ्	म्	Labial
p	P	b	B	m
य्	र्	ल्	व्		Semi-vowel
y	r	l	v
श्	ष्	स्	ह्		Fricative
S	R	s	h

This scheme was further extended to represent all the Indian scripts derived from Brahmi.

References

↑ Akshar Bharati; Vineet Chaitanya; Rajeev Sangal (1996). "Appendix B". Natural Language Processing: A Paninian Perspective (PDF). Prentice-Hall of India. pp. 191–193. ISBN 9788120309210. Retrieved 16 February 2014.

External links

This article is issued from Wikipedia - version of the Thursday, June 04, 2015. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.