![]() |
Webpages of Tamil Electronic Library (C)K. Kalyanasundaram
Tamil Script Code for Information Interchange (TSCII) |
After nearly three years of discussions, Internet Tamil community has agreed upon an Encoding scheme for Tamil called "Tamil Script Code for Information Interchange (TSCII)". During Fall 1998, this standard was submitted to the Tamilnadu Special Advisory Committe for Tamil Computing for possible adoption as a glyph encoding standard for tamil. A 8-bit bilingual glyph encoding scheme forms the basis of TSCII. Details of the proposed TSCII standard are available at the TSCII website.
A major goal of the proposed Script Code is to unify the mode ofinformation interchange in the Internet via Email, WWW, pdf etc by all of us using the same font encoding scheme. This way tamils worldwide can readily access tamil pages without the need to download and install one tamil font for each website.
Several TSCII-conformant font faces are available free for download from the following websites: TSCII website or from my Tamil Electronic Library Website. Tamil fonts and Tools (Text Editors, Keyboard Editors, File Convertors,...) are available FREE for usage on all three of the commonly used computer platforms Windows, Macintosh and Unix OS. TSCII-conformant font faces are readily identified by the suffix TSC after the font name, e.g. MylaiTSC, Sri-TSC, InaimathiTSC, MaduramTSC, TneriTSC,.. Time is ripe for all of us to start using the proposed Tamil Standard TSCII in the World Wide Web. In this page, we would like to provide some guidelines for setting up Webpages in Tamil and also give pointers to sample tamil webpages based on TSCII.
Latin-1 scheme does not include any characters in rows 8 and 9 (slots 128-159). In order that
the grantha characters and tamil numerals (present in slots 128-159), careful attention needs
to be paid in the preparation of HTML files for 8-bit fonts. Based on many successful trials,
the Internet Working Group for TSCII strongly recommends that the tamil text be present
as a raw 8-bit text. For this, files can be generated in HTML format using simple text
Editors or ensuring that "user-defined" option is chosen before entering the tamil text in
the html file. Most of the web-editors introduce descriptions automatically
inscribing the encoding used at the time of the html file preparation. (see below).
Till that time, an immediate option is to invoke "x-user-defined" case for the char-set
in the META Header of the HTML file and have the end-user choose TSCII-conformant Tamil
font as the font to use for the "User-defined" encoding (using Browser Preferences Menu).
Implementation: For this to work properly the following must be done:
Author of the HTML file must place a META header part at the beginning of the font
file that should read as follows:
Advantage of Option (i): Each client can use his/her own TSCII font to view
the tamil page (and no need to download any font for any tamil website)
Advantage of Option (ii): Allows the author of HTML a finer control in the Web
presentation using font faces of personal preference/choice.
There is a growing trend in the internet to avoid using tags and go
for character-set option.
Recommendations of Internet Working Group for TSCII
Based on several successful trials on several of the commonly used web-browsers in
different computer OS, IWC recommends that the Tamil webpages carry the tamil text as
raw 8-bit text and with Meta-Headers that specify "x-user-defined" as the charset.Sample Web Pages in Tamil based on TSCII format
The following are sample collections of tamil webpages based
on TSCII encoding scheme. You can view them in tamil script using any TSCII-conformant
font face.
- - - - - - - - - - - - - - - - - - - - -