a note on numerals in tamil script sourashtran …...a note on numerals in tamil script sourashtran...

A note on numerals in Tamil script Sourashtran orthography, their use in orthographic

systems, and the order of signs in the akshara

Christopher Miller, February 2016

Numerals in the adaptation of Tamil script to writing Sourashtram

Sourashtram is a Indo-Aryan language spoken by a minority population in Tamil Nadu

whose historical traditions recount a migration from the Gujarati-Maharashtra region (Dave

1976; Randle 1944) — their name itself harks back to the Saurashtra or Kathiawar peninsula

of Gujarat. Their language is closely related to Gujarati and Marathi but its precise

relationships remain to be clarified. Traditionally, Sourashtram was written in at least two

different scripts closely related to each other, to the Marathi Moḍi script and to Mahajani and

other commercial script varieties of North India (Miller 2014). As a minority in Tamil Nadu

and before that in Telugu-speaking areas, bilingualism in the majority languages led to

Sourashtram being written in Telugu and Tamil scripts as well, and in fact the vowel and final

consonant signs of both Sourashtran scripts are largely adapted from the two South Indian

scripts.

Figure 1. The Obula wedding invitation

I recently received a wedding invitation (Figure 1) from Subramanian Obula, who actively

promotes Sourashtran language, culture and script with online blogs and videos (see links in

references section). The invitation is written for the most part in Sourashtram, but in two

different scripts in parallel. The first is the modernised version of the script first put into print

by the scholar Rāma Rāo around the turn of the 19th century (Rāma Rāo 1902, inter alia).

After each line in Rāma Rāo script, the same Sourashtram-language text is repeated in an

adaptation of Tamil script, for the benefit of readers literate in the latter but not the former.

Figure 2. Superscript numerals in Tamil script Sourashtran orthography

This is an interesting strategy for contrasting voiceless aspirated stops cf. (without marked

vowels) /kh/ (spelled க2), voiced unaspirated stops cf. /g/ (spelled க3) and voiced aspirated

stops cf. /gh/ (spelled க4 )with voiceless unaspirated stops cf. /k/, (spelled க), a four-way

contrast that Sourashtram shares with other Indo-Aryan languages. Since Tamil stops have

no phonemic contrasts in voicing or aspiration, Tamil script early abandoned Brahmic letters

for those other than the voiceless unaspirated series (k c ṭ t p), which was taken as basic,

each letter being the first of its series (varga) in the traditional varṇamālā arrangement. As a

result, writing the other three series of Sourashtram stops in Tamil script required some sort

of workaround strategy.

In itself, using numerals as part of the orthography of a language is unusual, so this seems

an appropriate place to review the few cases I know of where they are used to represent the

sounds of a language in one way or another. After this, I will come back to the specific way

the numerals are ordered relative to the basic vowel and consonant signs, as this is relevant

to the structure of the written akshara.

Use of Numerals in Scripts

It is not uncommon to use numerals in notation systems, and numerals are indeed used to

represent the tones of tonal languages, one of the better-known cases being the Wade-Giles

romanisation system for Mandarin, largely supplanted by Pinyin Zimu since the 1980s.

Where Pinyin represents tones with iconic diacritic accents (xiōng, xióng, xiǒng, xiòng),

Wade-Giles simply placed a numeral for each tone (in conventional citation order) at the end

of the written syllable (hstiung1, hsiung2, hsiung3, hsiung4).

Thai, as well as Lao (Lew 2014), adopted a similar strategy. In Thai, one of four tone

diacritics is placed above the base letter of an akshara to indicate (together with the series

the consonant belongs to as well as syllable structure) which tone is to be pronounced on its

vowel. The names of the diacritics are clearly borrowed from an Indo-Aryan language; the

form of the respective diacritics, though different from Thai numerals, is closely related to

numerals from 1 to 4 in most North Indian scripts but rotated leftward (compare the

Devanagari numerals in Figure 3); these facts taken together seem to indicate that they

were specifically borrowed from a similar source for this purpose.

Name: mai + ek to tri cattawa

Thai tone marks

Devanagari numerals १ २ ३ ४

Thai numerals

Figure 3. Thai tone marks compared with Devanagari numerals

Jawi Arabic script for Malay used the inline Arabic numeral ‹2› to represent word root

reduplication and this was transferred into the older Malaysian and Indonesian Latin script

orthographies. The modern common Malaysian-Indonesian orthography got rid of that

abbreviation, but it is still used fairly often in informal writing. It is also used the same way,

under the name pada pangrangkep, in Javanese script.

The general idea was also adopted in the Makasarese "Jangang-jangang" script, but put to

another use. In South Sulawesi (and by inheritance the Philippines), pairs of syllable with the

same onset and a vowel sign on each could be abbreviated by writing the onset letter once

and doubling the two vowel signs on that letter. Since that didn't work if one akshara was

read with /a/, Makasarese scribes at some point adopted the Arabic number as a simpler-

shaped stand-in for the second letter (except, generally for <n>, which was a simple arch). If

one of the two or both had a vowel other than /a/, then it was marked as usual with the

corresponding vowel sign. The Makasarese parajanjiyana, borrowed from Malay perjanjian

‘treaty, agreement’, is one example of this use of ; instead of repeating the ‹j› letter with its

‹-i› vowel dot above, the vowel dot is placed over the as a stand-in for the more complex

letter.

Arabic ‹2› Jangang-jangang ‹pa.ra.ja. i.ya.na› parajanjiyana

Figure 4. Arabic ‹2› and its use in Makasarese Jangang-jangang spelling as a dummy

character for syllable onset repetition

The way it is used in Jangang-jangang is especially interesting, since like the other Sumatra-

Sulawesi-Philippine scripts, it doesn’t have its own (Indic) numerals and simply spells out

numbers or borrows Arabic or European numerals. There have been various proposals to

add “indigenous” numerals to one or another of these scripts, none of which has ever gained

any ground — though a curious quinary system called angka bejagung i.e. ‘barley-like

numbers’ was used for calculation and record-keeping by some Rejang users of Surat Ulu in

South Sumatra (Jaspan 1967).

There are two other, somewhat different cases of numerals used in a script that rely on the

way Arabic letters are used in the old Aramaic order as numerals, the same way the letters

of Aramaic, Hebrew, Greek and other scripts were used to represent numbers before Arabic-

Indic numerals replaced the older systems for most purposes. However, these two scripts

reverse the situation by using the Arabic-Indic numerals to represent letters.

1. One was a cipher script, apparently developed in India, that used the Arabic

numerals from 1-9 to stand in for Arabic letters. Since Arabic has 28 letters but there

are only 9 numerals (other than 0), the form of the numerals (usually the length of the

stem) changes as they are used for different powers. In the original version of the

cipher, the first series uses 1-9 for the first 9 letters of the abjad order ‹ˀ›, ‹b›, ‹j›, ‹d›,

‹h›, ‹w›, ‹z›, ‹ḥ›, ‹ṭ›, but they are shortened vertically. The same numerals are used

with their normal vertical length for ‹y›, ‹k›, ‹l›, ‹m›, ‹n›, ‹s›, ‹ˁ›, ‹f›, ‹ṣ›), which

correspond to 10-90, and the numerals are then lengthened below the baseline for

‹q›, ‹r›, ‹š›, ‹t›, ‹θ›, ‹x›, ‹ð›, ‹ḍ›, ‹ẓ›, which correspond to 100-900; finally an extra tail

is added at the bottom of 1 for 1000 (i.e. ‹ğ›). A more complex version was adapted

via Jawi Arabic script correspondences to write Malay sounds and then a second

time, via correspondences between Jawi and Bugis-Makasarese script, to write riddle

poetry and secret correspondence in Bugis as Lontara’ bilang-bilang ‘number script’

(Figure 5) .

Figure 5. Explanation of Lontara’ bilang-bilang cipher script from Matthes (1883)

2. The other is Thaana in the Maldives: it uses the Arabic numerals 1-9 for the first five

letters, but their values don’t correspond at all to the corresponding abjad letter.

(However, 9, which looks like Arabic wāw, is used for that letter, like a very few other

non-numeral-based letters in the script that somewhat resemble their Arabic

counterpart.)

haa shaviyani noonu raa baa ḷaviyani kaafu alifu vaavu

Figure 6. Thaana letters and corresponding Arabic numerals (Indo-Persian variants)

These are interesting examples in themselves, but there are no other examples I know of

where numerals are used diacritically, though the semi-systematic graduated use of

one/two/three dots to distinguish letters with the same basic shape in Arabic script, and even

up to four in some of its derivatives (e.g. Sindhi), comes close. The Sourashtran adaptation

of Tamil script fits right in with the kinds of adaptations talked about in Miller (2014):

improving a known but ill-adapted script’s fit to the phonological needs of the language by

borrowing and integrating (parts of) an external system to “fill in the gaps”.

The whole question of the relation of the numeral component to the rest of a script is an

interesting one. It’s quite common for scripts with their own numerals to replace (or

augment) them for some or all purposes with a completely different numeral system (most

usually the Western Arabic-Indic numeral set). It strikes me, for example, how the

Sourashtran text in the wedding invitation uses its own Nagari-derived numerals whereas

the equivalent in Sourashtran Tamil script uses European numerals (Figure 7).

Figure 7. Nagari-derived Sourashtran numerals versus Tamil script use of Arabic-Indic

equivalents

I also find it interesting that modern Sourashtran script abandoned Rāma Rāo’s original use

of subscript conjuncts closely modelled on Telugu. The modern use of a Devanagari-like

halant/virāma below the vowelless letter seems directly modelled on the Tamil puḷḷi dot,

perhaps a concession to the Tamil spelling system most Sourashtrans in Tamil Nadu are

acquainted with. This can be seen in the first word of each of the second and third lines

illustrated in Figure 7: ‹“upamanØyu”› and ‹vilØlāpuramØ›, where I use Ø to stand

respectively for the Sourashtran halant and Tamil puḷḷi zero-vowel signs.

Position of the numeral diacritics and akshara structure

Apart from the fact that the diacritic numerals that distinguish the other three classes of

stops from voiceless non-aspirates are unusual in themselves, their placement in Tamil

script adapted to Sourashtram is worth noting. Rather than appear right next to the base

consonant letter itself, they are placed after any following vowel sign. If the sign appears

above, below or to the left of the letter, the numeral will appear immediately to the base

letter’s right, but in the case of ‹-ā› and ‹-o›, both of which use a detached sign to the right of

the base letter, the diacritic numeral is always placed to that sign’s right, rather than between

the base letter and the following sign. This can be seen in bovni in the first example in Figure

8 and in mhaṭo, beginning the second line.

This relates to another peculiarity of how Tamil script is adapted to spelling Sourashtram.

Apart from the normal Indo-Aryan complement of aspirated stops, voiced and voiceless,

Saurashtram has several aspirated sonorants that are spelled with a diacritic that

immediately follows the consonant letter in Rāma Rao script. This diacritic is found, for

example, in ‹mhu› and ‹rhi› in Rāma Rao (1902) — apparently the normal conjunct

form in that earlier version of the script — and may likely derive originally from a handwritten

variant of Nagari ‹h› similar to the found in some Gujarati Nagari annotations in Avestan

scriptures, by eliminating the adjunct vertical stroke on the left; the ‹h› letter in the quite

different Hāḷivi Sourashtran script appears to derive from a similar shape by starting with the

vertical stroke and following through to the serpentine body with a transitional loop (cf.

Figure 1 in Miller 2014). Figure 8 shows three examples of the modern reformed script’s

spelling of /mh/ and one of /rh/.

viveha rubh mhuhūrtu bovni patriko

mhaṭo mhoṭṭān rhī

Figure 8. Spelling of /rhV/ and /mhV/ in Sourashtran and Tamil Sourashtran scripts

For each Tamil script example in Figure 8, the aspiration is spelled with a visarga (:). Just

like the diacritic numerals, the visarga appears after a postconsonantal vowel sign, which

can be seen in mhoṭṭān on the second line of the figure. In both cases, the structure of the

akshara requires the vowel sign to be directly adjacent to its base consonant letter, with any

diacritics modifying the consonant moved to the end of the sequence of signs making up the

akshara. This is reminiscent of the behaviour of the repha sign for an akshara-initial /r/ in

Devanagari and Gujarati scripts: although it represents a preceding /r/ orthographically

annexed to the onset of the following akshara, it is not always written on the consonant letter

itself (unlike, for example, postconsonantal /r/ in a /pr/ sequence where the combination of

‹p› प with following ‹r› र yields the compound form ): instead, the repha preconsonantal ‹r-›

is written on the rightmost component of an akshara. Thus, when the vowel of the akshara

is long and written with ‹-ā› , ‹-o› , ‹-au› or ‹-ī› , it appears above that vowel sign and

not the letter representing the consonant that it precedes in speech. This can be seen, for

example, in the name Śarmā. (As a learner of Devanagari, I had to force myself not to

read this iconically as /ʃəmɑːr/ rather than /ʃərmɑː/ as the orthography demands!)

Both the Devanagari and Tamil-Sourashtran examples illustrate how the akshara as an

orthographic reflection of the syllable can lead to non-iconic orderings of elements, treating

the C+V sequence as the core. Yet other akshara-based orthographies, such as Lampung,

Surat Ulu and Batak in Sumatra (see Miller 2014), allow the vowel sign to be displaced from

its logical host onto a following consonant letter marked with a “vowel killer” as a syllable

coda. Nonetheless, even in this case, the displaced vowel sign is always immediately

associated with the coda consonant letter and always precedes the “vowel killer” sign. These

conventions can be traced to ultimately arbitrary changes occurring in previous stages of the

evolution of the relevant scripts, but whether any general synchronic ordering constraints

valid for all akshara orthographies exist remains an interesting question for further research.

====

We would like to thank Mr. Obula S. Subramanian who kindly gave us permission to share

his son’s wedding invitation on the Lingua Akshara. Here is a note from him on the

orthography (personal communication, 16.1.16):

Since this script is not taught in Schools, and since almost all Sourashtras know Tamil

language (as they study in schools Tamil and English), we are using Tamil script with

superscript numbers 2,3, and 4 for the consonants ka, ca, Ta, ta and pa since Tamil has

only 18 consonants. Sourashtra language has 35+4 consonant symbols.

We live in Tamil Nadu State of India and almost all Sourashtras are bilinguals knowing Tamil

language besides Sourashtram which they speak in their homes.

References

Dave, I. R. (1976). The Saurashtrians in South India. Their Language, Literature and

Culture. Rajkot: Saurashtra University.

Jaspan, M. (1967). Symbols at work. Aspects of kinetic and mnemonic representation in

Redjang ritual. Bijdragen tot de Taal-, Land- en Volkenkunde 123 (1967) No. 4, 476-516.

Lew, Sigrid (2014). A linguistic analysis of the Lao writing system and its suitability for

minority language orthographies. Writing Systems Research Vol. 6, No. 1, 25-40.

Matthes, B.F. (1883). Eenige Proeven van Boegineze en Makassaarsche Poëzie. The

Hague: Nijhoff (Google Books digital edition).

Miller, C. (2014). Devanagari’s descendants in North and South India, Indonesia and the

Philippines. Writing Systems Research Vol. 6, No. 1, 10-24.

Rāma Rāo, T. M. (1902). Saurāṣṭra-nīti-śambu [A Hundred Saurashtra Moral Maxims].

Madras.

Randle, H. N. (1944). The Saurashtrans of South India. Journal of the Royal Asiatic Society

of Great Britain and Ireland, 2, 151–164.

Internet links

“Upamanyu” blog. http://subramanian-obula.blogspot.com/

Subramanian Obula blog. https://plus.google.com/109960219105236913501

Subramanian Obula gallery. https://www.flickr.com/photos/29347649@N02/

Subramanian Obula YouTube channel. https://www.youtube.com/user/upamanyuoss

http://subramanian-obula.blogspot.com/

https://plus.google.com/109960219105236913501

https://www.flickr.com/photos/29347649@N02/

https://www.youtube.com/user/upamanyuoss

a note on numerals in tamil script sourashtran …...a note on numerals in tamil script sourashtran...

Documents