Home
ArabTEX a System for Typesetting Arabic User Manual Version 3.00
Contents
1. A gt CHAPTER 5 INPUT CODING CONVENTIONS 14 jim with three dots below ha with three dots above Table 5 2 Additional codings generally available It may also be written A likewise I and U will produce madda on ya and on waw as required in some older writing conventions e The coding lt gt for ayn is a single left quote beware of confusing it with hamzal e The invisible consonant lt gt may be inserted in order to break unwanted ligatures and to influence the hamza writing It will not show in the Arabic output or in the transliteration At the beginning of a word it will suppress a following short vowel otherwise it acts like a consonant e The sequence lt gt will insert a small space as does lt gt see Section 5 2 below The adjacent characters will not be connected e Sadda is indicated by doubling the appropriate letter coding e The definite article is separated from the following word by a hyphen It may be written in the assimilated form if it exists lt as salaamu gt or always as lt al gt in that case a subsequent sun letter must be doubled lt al ssalaamu gt to receive a Sadda and to prevent a suk n on the lam The transliteration in both cases is identical e Hyphens lt gt are used for tying words together or for indicating a con necting vowel in Arabic or an izafet connection in Persian They may be used freely and generally d
2. A Ambros Einf hrung in die moderne arabische Schriftsprache 1 Auflage 1969 Max Hueber Verlag M nchen ASMO 449 7 bit coded Arabic character set for information interchange Arabic Standards and Measurements Organization 1982 J D Becker Arabic Word Processing Comm ACM 30 7 600 610 1987 T Borg Arabisch fur Auslander Ein Lehrbuch f r modernes Hocharabisch 2 Auflage 1979 Verlag Borg GmbH Hamburg J A Boyle Grammar of Modern Persian Wiesbaden Otto Harrassowitz 1966 B Comrie ed The World s Major Languages Croom Helm London 1987 DIN 31 635 Umschrift des Arabischen Alphabets Deutsches Institut fur Normung e V 1982 J Lavagnino and D Wujastyk An Overview of EDMAC A plain TEX format for critical editions TUGboat 11 4 623 643 1990 L P Elwell Sutton Elementary Persian Grammar Cambridge University Press 1963 29 CHAPTER 10 REFERENCES 30 C Faulmann Das Buch der Schrift enthaltend die Schriften und Alphabete aller Zeiten und aller V lker des gesammten sic Erdkreises K K Hof und Staatsdruckerei Wien 1878 W D Fischer Grammatik des Klassischen Arabisch 2 Auflage 1987 Verlag Otto Harrassovvitz VViesbaden A Grohmann Arabische Pal ographie Teil I und II Osterreichische Akademie der Wissenschaften Philosophisch historische Klasse Denkschriften 94 1 Wien 1967 E Harder A Schimmel Arabische Sprachlehre 15 Auflage 1983 Julius Groos Verlag He
3. beu components of compounds can be separated by or sA heb _hAneH se le sahebhaneh ta ht e hwAb She cx taht e hab pasl land z 51381 Us pasandaz naw AmUz 565 Ei nauamauz bI _hwod 532 3 bihod Appendix H Alternate input encodings H 1 ASMO 449 ISO 9036 The file asmo449 sty contains a reading module for the ASMO 449 code identical to ISO 9036 It is installed by the IAIpX option asmo449 or by input asmo449 sty The module is activated by setcode asmo449 or setcode 1509036 all following Arabic text will be considered to be coded according to the ASMO 449 standard The ArabTEX notation may be reacti vated by Nsetcode arabtex ASMO 449 see Table H 1 is a 7 bit code differing from ASCII ISO 646 mainly by replacing the letters by the Arabic letter characters and diacritical marks the Arabic digits share their positions with the ASCII digits The positions of special and control characters in both codes are identical A minimal driver file for processing e g a file asmotext dat could be structured as follows documentstyle arabtex asmo449 article begin document setcode asmo449 begin arabtext input asmotext dat the preceding blank line is required if asmotext dat did not end with a blank line itself this is strange and embarrassing end arabtext end document 48 APPENDIX H ALTERNATE INPUT ENCODINGS DEJESENESESENPSES mper Lafe lo ee pe sex pe
4. new paragraph without indentation not inside Arabic quotations e emphasize Arabic item will put a bar over the Arabic item e emphasize group_of_Arabic items will put a bar over the indicated group of Arabic items e setnash setnashbf setnastalig font selection commands see Section 4 e size changing IXTp X commands like large etc only if IATpX is used e the following commands footnote observe that the syntax for Plain TEX and IATpX is different marginpar also with Plain TEX analogous to the ATEX usage e the TEX IATEX commands smallskip medskip bigskip input hfill u for a space space with their usual meaning e nospace will place the adjacent items in the output in contact without any intervening space e hspace width will introduce the indicated amount of spacing in the output e mbox tert puts the tert into a box that will not be split across a line break e Nspreadbox width tert spreads out the text to the indicated width This may be useful e g when typesetting poetry spreadbox width tert hfil1 will inhibit the spreading spreadbox width hfill tezt hfill will center the text inside the box spreadbox width hfill or Nspreadbox width just intro duces the indicated amount of horizontal space as will hspace width If two boxing commands follow each other without any intervening blank space in the input there will also be no resulting spac
5. use the capital letters lt A gt lt I gt lt U gt or lt aa gt lt iy gt lt uw gt e To get the defective writing of long vowels use lt _a gt lt _i gt lt _u gt e Alif magsura is lt _A gt or XY e The short vowels fatha kasra damma are coded a i u and need not normally be written except in the following cases 12 CHAPTER 5 INPUT CODING CONVENTIONS 13 ZEER a plep ep 0 E ONE ee RER gayn f f fa c ayn a paja peje elsje m BEIGE HM III alif maqsura marbuta Table 5 1 Standard codings for Arabic and Persian at the beginning of a word where they generate alf adjacent to hamza where they will influence its carrier when the transliteration is required in the fullvocalize mode e Tanwin is coded lt aN gt iN or lt uN gt A silent alif if required is supplied automatically it may also be explicitly written lt aNA gt Likewise a silent waw may be written lt NU gt as in lt amruNU gt e hamza is denoted by a single right quote After selecting a language by setarab etc the hamza carrier will be determined from the context according to the rules for writing Arabic words if that is not wanted quote the hamza see Section 5 2 below In the setverb mode the hamza carrier is determined by the following letter see Section 5 5 e madda on alif is generated by a right quote hamza before lt A gt lt
6. 39 u damma 12 38 U IX implementation 32 UA 17 38 uN 13 39 unavailable font 11 Urdu 22 23 user defined commands 5 9 uw 12 38 van Dalen Benno 53 verbatim 17 vowel marks 16 vowels long 12 16 38 45 short 12 38 45 W 19 WA 17 wasla 15 16 19 34 42 Y 12 38 ya dots 22 25 y i wahdat 22 23 46 z r 45 zwarakay 22 58
7. Js qabila qa bi 1a Js gabila Ties between words are indicated by a single hyphen bi baladin Ao bi baladin ta al 11 ahi WE ta Ll hi sa ya tI ke sa yarti li yafra ha Es li yafraha wa iswadda 3323 wa swadda ba da mA LAS basda ma t la mi Uk tala ma fI ma e fi ma al ma p ala ma A single hyphen at the beginning or end of a word will enforce the use of the Joining form of the first resp the last character 1f that form exists for special uses only S Cp 5 78 Cpe 8 TST wa 8 ST w S h oh h 4 h h 4 h h a h d d d 4 d 14 N l lA M l 1400 h A yg 1400 h Digit sequences are written in the natural order 1234567890 ywsorvaae 1234567890 3 The ligature otherwise produced automatically looks ugly and has been broken by APPENDIX F CODING EXAMPLES FOR ARABIC 44 Ligatures are generated automatically they can be suppressed by al islAmu EST al islamu al gAru Mel al jaru all gAru 21531 algaru 2 24 PP _tumma 2 tumma _tu mma e tumma zj RG ee A mu hammaduN A muhammadun mul halmmaduN Ass muhammadun Abbreviations and emphasis are indicated by emphasize emphasize sl m gle sim emphasize abll g E ol abj If necessary use grouping by curly braces emphasize alayhi as salAmu p ads alayhi s salamu Appendix G Coding examples for Persian The short vowels e o i are denoted by the lowercase letters a
8. kasra 12 15 16 INDEX Kurdish 21 la 43 language selection 10 IAIpX commands 7 li 43 ligature 16 34 44 breaking 14 16 44 lists 9 long vowels 12 16 Macintosh 32 madda 14 16 45 Maghribi 23 mathematical insertion 7 METAFONT 33 MLS2ARAB 53 Multi Lingual Scholar 53 N 15 16 19 naskh 11 32 33 nasta liq 22 32 nesting 7 9 NFSS 33 NFSS2 33 non Arabic insertion 7 NU 13 39 numbers 43 abjad 53 O 21 46 option abjad 53 arabtex 5 asmo449 18 atrans 20 etrans 20 15088596 18 nashbf 11 nastaliq 11 oldarabtex 35 twoblks 52 Ottoman 21 Pashto 22 23 57 PC implementation 32 Persian 21 Persian copula 22 pis 45 punctuation 6 quotation Arabic 6 non Arabic 7 Roman 7 quoting 13 15 16 Qur an al f 16 reading module 18 Roman insertion 7 adda 14 16 41 on tatweel 15 short vowels 12 silent alif 17 19 size changing 5 8 11 special codings 25 stretching 8 15 25 automatic 25 sukun 15 16 22 34 46 on lam 14 on tatwil 15 sun letter 14 T 40 tabbing environment 6 ta marbuta 40 tanwin 13 15 17 19 39 40 fatha 40 on tatwil 15 tasdid 14 16 tatwil 15 43 TEX commands 7 TEX hash size 5 33 text archaic 25 erroneous 25 TEX XET 33 transliteration 12 19 34 INDEX Encyclopedia of Islam 20 ZDMG 19 twoblks sty 52 U 12 19 38 _U 39 46 U 14 u 12 16
9. letter _hAneH G h nch c e lt eh naH 4 nah u u yal_aH 45 yalah yal A 4 yalah hAneHhA lasl amp hanehha hAneH hA al haneh ha Short edafe is written e or i 2 2 ketAb e U N wits ketab e u rAh e t_U 9 els rah e tu nAmeH i man s Ash n meh i man bInI e An mard 5 ci ga bini e an mard pA i In zan Q5 ox U p in zan bAzU i In zan 05 Cr 5956 baz in zan Long edafe is written _i 2 dAr _i man ga dar i man hU i t U E Ss h i tu 4 u Hemze as ya ye wehdet nesbet zetab is likewise written _i nAmeH _i mu nameh i sormeH _i An sormeh t oA gofteH i aa gofteh i Ye ye wehdet is written I or E ketAb I J S ketab i rAh I Als rah i n meH I cl as nameh 7 dAnA I 36 dana i pArU I 3356 paru i dAnA I keH ASSUIS dana i keh pArU I keH KS 576 p r i keh APPENDIX G CODING EXAMPLES FOR PERSIAN 47 The present tense forms of the verb budeen and the pronominal clitics are written as they are spoken rafteH am 21 3 rafteh am rafteH Im lt 35 rafteh im rafteH I n 3 rafteh i rafteH Id Xl 3 rafteh id rafteH ast 431 3 rafteh ast rafteH and 51 3 rafteh and mard Id ka mard id asb et n Sue asb etan An gA st iau anga st U st KON u st t U st A tu st ketAb I st Cals ketab i si nAmeH I st Zu aab nameh i st The preposition be can be written with or without a hyphen The be man be man be t_U ES be tu be An jk be an be In dek be in beU p
10. problems Apart from some extensions most changes were introduced in order to better conform to the transliteration stan dards and to have less compatibility problems with TEX and ATEX Further versions are expected to be upward compatible if no serious problems will turn up The main differences between versions 1 and 2 are e The font size has increased so the document layout may change The old font nash10 can no more be used as the character locations have been assigned differently e Some Arabic characters are now coded differently ayn is denoted by a left quote and lt c gt z t and lt n gt have been assigned new meanings in order to better conform to the standard transliteration e There are many more ligatures than before This normally need not con cern the user e vocalize will no more generate sukun and wasla except if explicitly indicated by quoting See fullvocalize e Arabic Environments are now always bracketed by the new control se quences beginfarabtext and end arabtext even if only the translit eration is wanted We strongly recommend converting any still existing version 1 input files to the new notation To assist in this migrating procedure the IATpX option 34 APPENDIX C RELEASE HISTORY 39 oldarabtex and or the command oldarabtex will switch to a mode where virtually all places where the old conventions are used will either produce a TEX error message or will be fla
11. sAruN De bassarun nawwara 5 nawwara sayyiduN Ku sayyidun sa AluN Wa sa alun sabiyyull Powe sabiyyun aduwwull SAE caduwwun Instead of iyy uww one can also write Iy Uw nor KA sablyuN go sabiyun adUwul SAS caduwun Assimilation the definite article may be always written al a following sun letter must be written twice like in the Arabic spelling The translitera tion and the use of sukun are adjusted accordingly al ddAru A ad daru al rra gulu Ka ar ragulu al ssanaTu tal as sanatu al nn ru Jl aqn naru al gAru zal al garu al bAbu odi al babu al llaylaTu al sal laylatu al llisAnu ad al lis nu V al 11 ahu 2al laku APPENDIX F CODING EXAMPLES FOR ARABIC 42 The article may also be written in the assimilated form with identical result ad dAru aa ad daru ar ra gulu gt ar ragulu as sanaTu tad as sanatu an n ru jl aqn naru In some special cases the literal spelling must be used alla dI P alladi alla dina 53 alladina allati Ei allati however al lla dAni gadi al ladani al llat ni Kul al latani al llavati ERU al lawati Wasla an auxiliary vowel at the beginning of a word 1s always written but in the middle of a sentence generally without hamza If a vowel precedes the word the auxiliary vowel will be omitted in the transliteration and the wasla sign will be used 1n the spelling wa ismuhu 26
12. the sequel 2 1 Arabic text elements Every Arabic paragraph and every Arabic quotation is a sequence of the following kinds of Arabic items separated by blank spaces or newlines e isolated punctuation marks interpreted as the corresponding Arabic punc tuation mark CHAPTER 2 INPUT TO ARABTEX 7 e numbers i e character sequences starting with a digit A number will be processed using the normal writing sequence from left to right even if it contains letters and or special characters however if the final character is a punctuation mark it will be split off and processed separately e Arabic quotes coded as two left quotes or two right quotes each they may also be written directly adjacent to a word e words i e character sequences starting with a letter or a special non digit character followed by a letter A final punctuation mark will be split off and processed separately The coded characters of a word will in the output be arranged from right to left e a sequence of words numbers and special characters enclosed in curly braces and This introduces a new level of TEX grouping otherwise the constituents are processed normally This feature may be nested Output from all items will be arranged from right to left lines will be broken as necessary Inside an Arabic Environment or in an Arabic quotation you may also have e ArabTEX commands with or without parameters These will b
13. this routine needs a font revision which is hard and tedious whenever this happens the command might well become part of Arab TEX proper 1 3 MLS2ARAB This is an UNIX SED script written by Prof Nicholas Heer University of Wash ington and released for free distribution Tt will almost convert an ASCII file of Arabic text produced by Multi Lingual Scholar to the ArabTEX input notation The conversion is not perfect so some manual corrections might be necessary For operating instructions see the file itself INDEX Index quoting 15 14 15 7 15 Nus 8 NN 8 abjad 53 arabfalse 19 arabtrue 19 begin arabtext 6 34 begin setcode 18 bigskip amp centerline 9 colsep 52 doassign 9 docommand 9 emphasize 8 end arabtext 6 34 end setcode 18 Nfootnote 8 fullvocalize 13 15 16 34 hfill 8 hspace 8 indent 8 input 8 18 input arabtex tex 5 input atrans sty 20 input etrans sty 20 ligsfalse 16 ligstrue 16 magnification 5 marginpar 8 mbox 8 medskip 8 newhanza 41 94 newtanwin 17 40 noindent 8 nospace 8 novocalize 15 17 oldarabtex 35 oldhanza 41 oldtanwin 17 40 par 6 8 pnash 5 pnashbf 5 quiet 26 setarab 6 10 13 21 setcode 18 setcodefarabtex 18 48 50 setcodefasmo449 48 setcodefiso8859 6 50 setcodefiso9036 48 setfarsi 10 21 setmaghribi 10 21 setnash 8 11 setna
14. to trace down errors or to demonstrate the operation of Arab TEX as in the appendix 8 6 Using ArabTeX with EDMAC Arab TEX will cooperate with EDMAC a Plain TEX macro package for criti cal editions written by John Lavagnino and Dominik Wujastyk If EDMAC is already present when ArabTEX is loaded the EDMAC commands will after suitable modifications be available inside an Arabic environment Their argu ments are considered Roman text but may contain Arabic quotations For further details see the EDMAC documentation Chapter 9 Acknowledgments The development of ArabTEX would not have been possible without the as sistance of many people and it is impossible to acknowledge every individual contribution Besides our local team i e Udo Merkel and Heribert Schlebbe helpful advice came among others from Chahriar Assad Benno van Dalen Ivan Derzhanski Wolfdietrich Fischer Ahmed El Hadi Yannis Haralambous Abdelsalam Heddaya Nicholas Heer Iqbal Khan Tom Koornwinder Eberhard Kr ger Asif Lakehsar Jan Lodder Richard Lorch Pierre MacKay Eberhard Mattes Fathy Neamat Allah Bernd Raichle Ulrich Rebstock Mohamed Saba Waheed Samy Annemarie Schimmel Nariman Shehab Dominik Wujastyk and Michio Yano We also have to thank all users who sent error reports comments and suggestions 28 Chapter 10 References B Alavi M Lorenz Lehrbuch der persischen Sprache 5 Auflage 1988 VEB Verlag Enzyklopadie Leipzig A
15. user defined commands but could possibly with other TEX extensions we do not know about With Plain TEX the Arabic font by default is only available at the normal 14 point size which ought to cooperate well with the cm fonts at 10 points A bold variant is also provided For other sizes the user has to change the Nmagnification or to define additional font identifiers himself To change the default inspect the file arabtex tex and redefine the pnash and or pnashbf command accordingly With KIEX the usual size changing commands will also operate on the Arabic font 1A TEX hash table size of 3000 to 3500 is recommended Chapter 2 Input to Arab TEX After activating Arab TEX select one of the Arabic writing styles e g setarab see Section 3 Your modified TEX IATEX system will recognize the following items e normal TEX IATEX text and commands e short Arabic guotations bracketed by lt and gt These must normally fit onto one line of output except if explicitly broken up by or NI com mands see below A quotation may also be started with lt except inside a WIX tabbing environment e longer Arabic texts which are bracketed by beginfarabtext and end arabtext even when using Plain TEX called Arabic Environ ments in the sequel An Arabic Environment consists of one or more para graphs separated by blank lines or par commands Arabic quotations and Arabic environments are called Arabic conterts in
16. 13 wa smuhu f a in sarafa O6 fa nsarafa This also works across word boundaries JA ibni gol amp ya ni h a dA ibnuh_u lt l a h da Bnuh q la u hru g ei J qala hruj An auxiliary vowel at the end of the preceding word may be separated by a hyphen gad i in sarafa al 15 qad i nsarafa ra aW u al bAba dl 131 razav u 1 baba min i ibnih i asi Qa min i bnihi This also works for the article preceding alif al wasl ox o al i ismu E al i smu al i i stirA u KAN cali Stirazu and even if the auxiliary vowel is omitted in the spelling ra guluN i ibnatuh u gamIlaTuN L el gt rajulun i dnatuhu gamilatun mu hammaduN i al qura sIyu gj x muhammadun i l qurasiyu In vowelized writing it may sometimes be advisable to introduce a kasida to prevent the vowel marks from bumping into each other APPENDIX F CODING EXAMPLES FOR ARABIC 43 The particles li and la must be combined with the article except before lam lil rra guli Je lir raguli lal mal gdu JE lal majdu however li llaylaTi dd l llaylati 1i 11 ahi D li ll hi The Name of God is written with a special ligature if it is recognized from the input seguence 11 ah al 11 ahu A zal lahu ta al 11 ahi VE ta 1 lahi Increased spacing Tatwil between adjoining characters may be produced by a double hyphen note the position of the vowel marks qabila Ne qabila ga bi la Js gabila q ab ila Js gabila g a b i la
17. 46 48 50 ISO 8859 6 18 50 ISO 9036 18 48 coding conventions 12 34 commands ArabTpx 7 8 boxing 8 illegal 9 internal 5 ATEX 7 INDEX overview 9 size changing 5 8 11 TEX 7 user defined 5 9 compounds 47 copyright 0 32 dagger al f 12 16 damma 12 15 16 inverted 16 22 39 Dari 21 date 15 default font 5 11 defective writing 12 16 39 definite article 14 19 41 Derzhanski Ivan 45 diacritics 16 diphthongs 45 display mode 7 dots on ya 22 25 E 21 E 22 e 22 23 46 e 22 EDMAC 27 emphasis 44 environment Arabic 6 18 arabtext 6 18 setcode 18 tabbing 6 Farsi 21 fatha 12 15 16 Fischer Wolfdietrich 38 font bold 11 default 5 11 installation 33 nash10 34 nashl4 32 34 nash14bf 33 naskh 11 32 33 56 nasta liq 22 32 selection 11 unavailable 11 grouping 7 44 H 21 23 46 h 15 hamza 13 15 22 40 45 46 carrier 17 40 old style 41 harakat 12 15 16 38 45 on tatwil 15 Heer Nicholas 53 hyphen 15 43 I 12 38 1 22 I 14 i 22 _1 12 16 39 i kasra 12 38 implementation Mac 32 PC 32 U IX 32 iN 13 39 input switching 18 insertion mathematical 7 non Arabic 7 Roman 7 installation 33 internal commands 5 inverted damma 16 22 invisible consonant 14 ISO 646 48 50 ISO 8859 6 50 ISO 9036 48 ty 12 38 izafet 15 22 23 46 kas da 15 25 43
18. Arab TEX a System for Typesetting Arabic User Manual Version 3 00 2 Klaus Lagally November 22 1993 Report Nr 1993 11 Universit t Stuttgart Fakult t Informatik Breitwiesenstra e 20 22 70565 Stuttgart Germany This Report supersedes Report Nr 1992 06 Overview Arab TEX is a package extending the capabilities of TEX IATEX to generate the Arabic writing from an ASCII transliteration for texts in several languages using the Arabic script It consists of a TEX macro package and an Arabic font in several sizes presently only available in the Naskhi style Arab TEX will run with Plain TEX and also with IAT X It is compatible with NESS NFSS2 and the EDMAC package other additions to TpX have not been tried Arab TEX is primarily intended for generating the Arabic writing but the stan dard scientific transliteration can also be easily produced For languages other than Arabic that are customarily written in the Arabic script some limited support is available ArabTeX defines its own input notation which is both machine and human readable and suited for electronic transmission and Email communication How ever texts in some of the Arabic standard encodings can also be processed ArabTEX is copyrighted but free use for scientific experimental and other strictly private noncommercial purposes is granted Offprints of publications using Arab TeX are welcome Using Arab TeX otherwise requires a license agree ment Th
19. CELLANEOUS FEATURES 26 kaf in the final position without a mark dal with a dot below BMU ba without a dot n n without a dot not available in Pashto mode BH alif maqsura ya without dots in all positions Table 8 1 Additional codings for special purposes If further variants are needed write to the author and indicate the required shape the assumed transliteration e a suggestion for the input coding e some information on the intended use We are willing to consider any suggestion Adding a new character might be easy or else it might be impossible Arab TEX is flexible but there are some technical limitations 8 4 Progress report As Arab TEX is slow it will produce some terminal output while running to in dicate it is still alive If that is not wanted e g on a very fast system or while running a batch job say quiet or tracingarab 0 outside an Arabic Envi ronment otherwise say doassign tracingarab 0 tracingarab 1 will only report Arabic paragraphs a value of 2 Arabic lines and insertions a value of 3 or more individual Arabic items CHAPTER 8 MISCELLANEOUS FEATURES 27 8 5 Verbatim copy of the input For test purposes the Arabic input may be reproduced verbatim after showtrue in addition to the normal output showfalse switches this fea ture off again Commands will not usually be shown The output will generally not look pleasant and this feature is only provided in order
20. an Farsi Dari also Ottoman Kur dish e All characters needed for writing Farsi are available by default The short vowels lt e gt and lt o gt are mapped to lt i gt and lt u gt the long vowels lt E gt and lt 0 gt to lt I gt and lt U gt without a vowel indicator lt H gt denotes final silent ha This ha receives no suk n even in fully vowelized mode 21 CHAPTER 7 SUPPORT FOR OTHER LANGUAGES BESIDES ARABIC22 e For fatha or kasra followed by a final silent ha you can also write lt a gt or lt e gt in place of lt aH gt and lt eH gt e The izafet connection may always be written lt i gt or lt e gt with hyphen then the correct spelling will be determined from the context Likewise the ya i wahdat can always be written lt I gt or lt E gt e The present tense forms of the copula are coded lt am gt lt I gt lt ast gt lt Im gt lt Id gt lt and gt In the output they are written as separate words after a little space e The final ya carries no dots Farsi uses the Nasta liq font if available otherwise Naskh For further details see Appendix G 7 2 Urdu e For Urdu additional codings are available see Table 7 1 Some of the given codings also occur in Pashto but with a different meaning see Section 7 3 e The short vowels lt e gt and lt o gt are mapped to lt i gt and lt u gt lt H gt lt a gt and lt e gt are used as in Persian e Even in fu
21. apter 3 Language selection The processing of input text to be written in the Arabic script is somewhat language dependent Thus before the first Arabic quotation or Arabic environ ment you have to indicate the desired processing mode by one of the commands setarab setfarsi seturdu setpashto setmaghribi or setverb no special processing see however Section 5 5 The processing mode may be changed at any time even inside an Arabic environment or an Arabic quotation After selecting a language the symbols lt and gt serve to bracket short insertions in the chosen language Whereas this is usually convenient observe that they can thus no more be used for other purposes except in mathematical mode where they retain their normal meaning as relational operators To temporarily return them to their normal mode of operation deselect the language by setnone Arabic insertions may also be started by lt For further details on supported languages see Section 7 1Note for advanced TEX users All language selecting commands except setnone set the character lt active If Arabic insertions are not needed or are always started with Ng the user may reuse the command lt for other purposes or deactivate it by catcode lt 12 to return it to its normal meaning 10 Chapter 4 Font selection For space economy only the Naskh font is available by default With IEX additional fonts can be loaded by the document style
22. bTEX Installing Arab TEX Release history Sample ArabTEX input Sample ArabTeX output Coding examples for Arabic Coding examples for Persian Alternate input encodings H 1 ASMO 449 ISO 9036 H 2 ASMO 449E ISO 8859 6 22 22 23 24 25 25 25 25 26 27 27 28 29 32 33 34 36 37 38 45 CONTENTS I Miscellaneous utilities L1 twoblkssty eee L2 abjadsty eee 1 3 MLS2ARAB a Index 52 52 53 53 54 List of Tables 9 1 9 2 9 3 7 1 7 2 8 1 H 1 H 2 Standard codings for Arabic and Persian 2 22 220 13 Additional codings generally available 2 2 aaa 14 Verbatim codings for the carrier of hamza 17 Additional codings for Urdu i aaa 23 Additional codings for Pashto aa r 24 Additional codings for special purposes o o oaoa 26 ASMO 449 code table oo nn nn 49 ISO 8859 6 code table 2 on nn 51 Chapter 1 Activating Arab TEX With Plain TEX load the Arab TEX macros by input arabtex tex With IATEX include the option arabtex in the document header In both cases some additional files will be loaded automatically Arab TEX defines several user commands as indicated below There is also a large number of hidden internal commands which could lead to storage hash table overflow in a small TEX implementation All internal commands contain an at sign in their names and thus should not interfere with any
23. dards Thus some alternate reading modules have been written as there CHAPTER 5 INPUT CODING CONVENTIONS 18 are more than 10 different codings in current use this is an open ended activity and a general code switching procedure has been provided An alternate reading module e g asmo449 sty for the ASMO 449 code is installed by adding its name asmo449 as a IATpX style option or by input asmo449 sty Afterwards a code_name in this case asmo449 is de fined Input coding is switched by the command setcode codename that changes the coding for Arabic tert globally or by the environment Nbegin setcoder code name end setcode which follows the normal TEX grouping rules Coding may be switched several times in the same document provided the appropriate reading modules are installed setcode arabtex reverts to the standard ArabTEX notation Please observe that only Arabic text is affected by setcode codename text outside of Arabic contexts and control sequence names are still assumed to be in 7 bit ASCII As existing text files presumably do not contain any control se quences or non Arabic text anyway we suggest using a small ASCII TEX IATEEX driver file setting all relevant options and containing any non Arabic text and calling the Arabic text files by Ninput file name from within an Arabic en vironment For details on available additional reading modules see Appendix H Chapter 6 Translitera
24. e between the boxes in the output CHAPTER 2 INPUT TO ARABTEX 9 e centerline tert will start a new line whose contents are centered not inside Arabic quotations e Nspreadline text will start a new line whose contents are spread out over the whole width of the page not inside Arabic quotations It is approximately equivalent to Nspreadbox hsize Htext e User defined commands whose expansion produces legal ArabTEX input may be called by docommand command and parameters The command is expanded exactly once and the result is processed by ArabTpX again Any side effects of the expansion will be local e Parameter assignments inside an Arabic context may be performed by doassign parameter value The effect is normally local except if the form doassign global parameter value is used e Any non recognized command will generate an error message and will be echoed verbatim in the output Even though ArabTEX tries hard to get into synchronization again additional spurious errors may occur e inside an Arabic Context no further WIX or ArabTpX environment may be nested with the possible future exception of list environments these are not yet implemented For a list of all available commands consult the Index to this report As a reminder a list of all commands that are valid inside Arabic text will appear in the log file 1This is no strong restriction as the expansion may contain docommand calls again Ch
25. e executed immediately e Some but not all TEX IATEX commands see below These will be exe cuted immediately e Short mathematical insertions bracketed by single signs They must fit on one output line and are processed as usual TEX Display mode within an Arabic environment is not provided if it is required the user has to leave the Arabic environment temporarily e short non Arabic Roman quotations containing text and possibly also TEX RIEX commands bracketed by lt and gt These must fit on one output line and introduce a new level of grouping so if they contain any TEX IATEX assignments the effects of these will be local by default This feature is not available within an Arabic quotation The alternate notation N lt is also not provided 2 2 Commands in an Arabic context A control sequence inside an Arabic contert must be separated from the preced ing text item by at least one blank space nevvline or another control sequence and may be of the following kinds CHAPTER 2 INPUT TO ARABTEX 8 e Arab TEX option changing commands These may also be used outside an Arabic Context and usually follow the TEX grouping rules e for a line break the last line will be padded on the left with spaces e for a line break the last line will be aligned If it comes out very badly spaced automatic stretching might help see Section 8 e indent or par or a blank line for a new paragraph noindent for a
26. e or i o or u bar bar beh amp beh bon ei bon The long vowels a i 1 u 4 5 are denoted by the capital letters A I or E U or 0 lef medde is automatically generated for vvord initial a Ab ol ab bad SU bad bId Ay bid bUd 34 bud Note that I yields a ya ye me ruf with zir whilst E yields a ya ye majhul without zir Similarly U yields a waw e me ruf with pis whilst 0 yields a waw e majhul without pis tIr WW tir tE g teg dUr 39 dur zOr 35 zor The diphthongs et and ou are written ay and aw pay Qi pay naw EU naw Intervocalic hemze is written o 2 pA Iz EU paz miyA I gi maya mIgU I Js magi 1 We gratefully acknowledge the voluntary help by Ivan Derzhanski who wrote this chapter and implemented the language specific processing As we exten sively modified his routines during system integration all responsibility for any remaining or new errors rests with us 45 APPENDIX G CODING EXAMPLES FOR PERSIAN 46 tawAnA I hls tawana zanA sU I 45505 zanasu Silent word final waw is generated by _U or 0 t_U q tu AU 33 du tO t dO s gt do Waw e ma dul is written w it is omitted in the transliteration and the pre ceding ze receives no jezm _huAb 2135 hab hwI s ugs his hwod 352 hod Ha ye hewwez e mezfi is generated by H or optionally by e a or A It does not receive a jazm even in fully vocalised mode and is not joined to a following
27. el the corresponding form of tanwin is generated instead Defective writing The coding lt _a gt will produce a Qur an alif accent also called dagger alif instead of an explicit alif character which would be coded lt A gt or aa Likewise lt _i gt will produce a small alif below the preceding consonant in place of lt I gt lt iy gt and lt _u gt will produce an inverted damma in place of lt U gt lt uw gt If a long vowel follows a consonant the corresponding short vowel is implied The long vowel itself carries no diacritical mark If no vowel is given after a consonant suk n will be generated except if a double quote precedes the next consonant The lam of the definite article receives no sukun if a double sun letter follows alif at the beginning of a word carries wasla instead of the vowel indicator if the preceding word ended with a vowel e vocalize As above but sukun and wasla will not be generated except if explicitly indicated by quoting e novocalize No diacritics will be generated except if explicitly asked for by quoting In all modes a double consonant will generate adda and lt A gt always generates madda on alif CHAPTER 5 INPUT CODING CONVENTIONS 17 After lt aN gt the silent alif character is generated if necessary The silent al f may also be explicitly indicated by lt aNA gt or coded literally as lt A gt in novocalize m
28. em dependent and we recommend securing the assistance of a local TeXpert You have to install the nash14 font with its pk and tfm files on the font search path of your TEX system and the sty files and arabtex tex on the source search path usually TEXINPUT of your system Possibly you will also have to rename the pk files according to local conventions and as a last resort you can try to recreate the fonts from the mf METAFONT sources Additional fonts whenever available are installed analogously Arab IEX has been found to cooperate well with TpX versions 3 xxx IATpX versions 2 09 of 1991 or later NFSS and NFSS2 not required and previewers that can handle fonts of more than 128 characters TEX XET or TEX XET are not required and their additional features are presently not exploited The TEX hash size should be at least 3000 to 3500 especially when using Arab TEX in conjunction with IAT X and if the transliteration module is used Use of a BIG TEX may be necessary when using the NFSS2 due to the latter s high demand on string storage Space and time requirements are not negligible and have increased during development however Arab TEX currently still runs albeit slowly even on a PC XT standard configuration 33 Appendix C Release history There was a Version 1 which is no more supported Version 2 was not fully compatible with Version 1 however moving to the new version usually caused little
29. ere is no warranty of any kind either expressed or implied The entire risk as to the quality and performance rests with the user Please send error reports suggestions and inquiries to the author Prof Klaus Lagally Institut f r Informatik Universitat Stuttgart Breitwiesenstra e 20 22 70565 Stuttgart GERMANY lagally informatik uni stuttgart de Copyright 1992 1993 Klaus Lagally Contents 1 Activating ArabTpx 2 Input to ArabTEX 2 1 Arabic text elements 2 2 Commands in an Arabic contert 3 Language selection 4 Font selection 5 Input coding conventions 5 1 Standard Arabic and Persian characters 5 2 Quoting e 5 3 Ligatures 5 4 Vowelization s 5 5 Verbatim input 5 6 Alternate input codings 6 Transliteration 6 1 ZDMG transliteration style 6 2 Encyclopedia of Islam style 7 Support for other languages besides Arabic 7 1 Persian Farsi Dari also Ottoman Kurdish 10 11 12 12 15 16 16 17 17 19 19 20 21 CONTENTS 9 72 Urdu 7 3 Pashto Afghanic 7 4 Maghnbi 7 5 Other languages Miscellaneous features 8 1 Automatic stretching 8 2 Dotson ya 8 3 Additional codings 8 4 Progress report 8 5 Verbatim copy of the input 8 6 Using Arab TEX with EDMAC lens Acknowledgments 10 References A Obtaining Ara
30. gged in the output The changes introduced since the release of Version 2 00 up to now Version 3 00 fall into one of two categories error corrections and upward compati ble extensions Details are not given here but are documented in the text file CHANGES that is part of the distribution package of Arab TEX Version 3 is upwards compatible with version 2 All supported features are documented in this manual Appendix D Sample Arab TEX input documentstyle 12pt arabtex article begin document setarab choose the language conventions vocalize diacritics for short vowels on transtrue additionally switch on the transliteration Yarabtrue print arabic text is on anyway spreadtrue spread out caption centerline lt gu hA wa himAruhu gt beginfarabtext at_A sadiquN il_A gu hA ya tlubu minhu himArahu li yarkabahu fl safraTiN qa sIraTiN wa qAla lahu sawfa u Iduhu ilayka fl al masA i wa adfa u laka u graTaN NN fa q la gu h anA AsifuN giddaN annI 1A asta ti u an u haggiga laka ra gbataka fa al himAru laysa hun al yawma wa gabla an yutimmu gu hA kalAmahu bada a al himAru yanhaqu fl i s tablihi NN fa qAla lahu sadIquhu innI asma u himAraka yA gu hA yanhaqu NN fa gAla lahu gu h garIbuN amruka yA sadIqI a tu saddiqu al himAra wa tuka_d_dibunI end arabtext end document 36 Appendi
31. idelberg vg adi dal acg Dl v ga Hasim Muhammad al Hattat Qawa id al Hatti al Arab Maktaba an Nahda Baghdad Dar al Qalam Beirut 1400 1980 ISO R 233 1961 International System for the Transliteration of Arabic Char acters International Standards Institution 1961 ISO 8859 6 Information processing 8 bit single byte coded graphic character sets Part 6 Latin Arabic alphabet International Organization for Standardization 1987 ISO 9036 Information processing Arabic 7 bit coded character set for infor mation interchange International Organization for Standardization 1987 D E Knuth The METAFONT book Addison Wesley Publishing Comp Reading Mass 1986 D E Knuth The TEXbook Sixth printing Addison Wesley Publishing Comp Reading Mass 1986 D E Knuth and P MacKay Mixing right to left texts with left to right texts TUGboat 8 1 14 25 1987 Ann K S Lambton Persian Grammar Cambridge University Press 1953 L Lamport KIpX A Document Preparation System Addison Wesley Publishing Comp Reading Mass 1986 M Lorenz Lehrbuch des Pashto Afghanisch 2 Auflage 1982 VEB Verlag Enzyklopadie Leipzig P A MacKay Typesetting Problem Scripts BYTE 11 2 201 216 1986 CHAPTER 10 REFERENCES 31 H Ritter ber einige Regeln die beim Drucken mit arabischen Typen zu beachten sind ZDMG 100 2 577 580 1951 Friedrich R ckert Grammatik Poetik und Rhetori
32. imentation by the user If setarab or setfarsi will not produce the desired result try setverb for verbatim mode The vowelization and the transliteration cannot generally be expected to be correct but might work by accident In case some character variants not yet provided are needed feel free to ask the author for help There is no simple way for the user to modify the script Chapter 8 Miscellaneous features 8 1 Automatic stretching For special purposes e g for headlines and for Arabic paragraphs containing long mathematical or non Arabic insertions the connection between adjacent Arabic letters may be made elastic if they form no ligature Thus a kas da is inserted whose length will be adjusted automatically to uniformly fill the output line This feature very easily leads to storage overflow during the processing and should only be used whenever necessary It is switched on with spreadtrue and switched off again with spreadfalse Inside an Arabic Environment it will also be switched off automatically at the end of every paragraph 8 2 Dots on ya Whether ya in the final position carries dots or not is controlled by the chosen language convention You can override this after selecting the language by yahdots and yahnodots 8 3 Additional codings To reproduce exotic erroneous or archaic texts exactly as they are written some additional codings are available see Table 8 1 25 CHAPTER 8 MIS
33. k der Perser Wiesbaden Otto Harrassowitz 1966 C Salemann V Shukovski Persische Grammatik 4 Auflage 1947 Verlag Otto Harrassowitz Leipzig A Schimmel Islamic Calligraphy E J Brill Leiden Netherlands 1970 H J Vermeer W Akhtar A Akhtar Urdu Lautlehre und Urdu Schrift 3 Auflage 1985 Julius Groos Verlag Heidelberg Appendix A Obtaining Arab TEX The ArabTEX system is available from the author s institution by anonymous FTP from ftp informatik uni stuttgart de 129 69 211 2 in the direc tory pub arabtex and from many other common servers e g the CTAN net work Aston Niord Stuttgart The files may be transferred individually or as a package arabtex zip for PC systems arabtex tar Z for U IX systems we recommend to get and inspect the README file first Successfull operation on the Apple Macintosh in conjunction with OzTEX has also been reported At the time of this writing version 3 00 is current The Nasta liq font is still under development Naskh will be substituted automatically Version 2 is down ward compatible the old version 1 is obsolete and should no more be used Arab TEX is copyrighted but free use for scientific experimental and other strictly private noncommercial purposes is granted Offprints of any publica tions using Arab TEX are welcome Using Arab TEX otherwise requires a license agreement 32 Appendix B Installing Arab TEX The installation procedure is strongly syst
34. la ika Tanwtn The plural suffixes un in an are written uN iN aN or aNA Silent alf in an may be indicated by A or omitted if necessary it is supplied from the context ra guluN E5 rajulun ra guliN Je rajulin ra gulaN ves rajulan madInaTaN A madinatan gamIlaTaN L gamilatan i dan 3 ridan samA al sc samdan There is a special case ribaNU gi riban amruNU a amrun amriNU 5 camrin however amraN lA amran APPENDIX F CODING EXAMPLES FOR ARABIC 40 Tanwin fatha is traditionally put on the last consonant even if a silent alf follows Some modern conventions and also Persian practice require to put it on the ahf in this case This behaviour may be switched on by newtanwin and off by Noldtanvin newtanwin mode is the default for Persian ra gulaN 5 rajulan i_daN 13 dan A silent alif mags ra after tanwin is written aNY or aN_A hudaNY gi hudan fataN A fatan compare al hudy gl al huda al fat_A ii al fat Ta marbuta is denoted by T kalimaTuN US kalimatun kalimaTiN amp E kalimatin kalimaTaN i kalimatan fatATuN 3 amp fatatun fatATiN SU fat tin fatATaN SUS fatatan Hamza is indicated by the appropriate carrier is determined by the context 2 La os o pe Rs amruN y amrun ibiluN Jd ibilun u htuN Cx uhtun ze u 2 E L ra suN VE ra sun ar asu V arasu sa ala Ju swala 1 3 q 24 b gt
35. lly vowelized mode an aspirated consonant before lt h gt receives no sukun since the two are technically a single letter e Urdu uses the Nasta lig font if available otherwise Naskh 7 3 Pashto Afghanic e For Pashto additional codings are available see Table 7 2 Some of the given codings also occur in Urdu but with a different meaning see Sec tion 7 2 e The short vowel lt e gt is indicated by a zwarakay lt o gt by an inverted damma Observe also the following codings w hamza on waw lt h gt hamza on ha if not generated by izafet CHAPTER 7 SUPPORT FOR OTHER LANGUAGES BESIDES ARABIC23 Ee alen the wavy ha letter s panem Ez else eem REN Table 7 1 Additional codings for Urdu e The codings lt H gt lt a gt and lt e gt are used as in Persian The rules for izafet and ya i wahdat apply e For writing some Pashto words in the Urdu style write the command seturdu and afterwards switch back to the Pashto conventions by Nsetpashto 7 4 Maghribi Nearly like Arabic but using a different writing convention fa is written with one dot below the letter gaf with one dot above the normal letter form of fa The three dots of va are put below the letter CHAPTER 7 SUPPORT FOR OTHER LANGUAGES BESIDES ARABIC24 ppm EEEN ORNE BENEN eem aaneen pepe Table 7 2 Additional codings for Pashto 7 5 Other languages This is up to exper
36. ls Li r oe e sss el ejos sss esse tate fete sss isu sus yeei fefete afee epee to eeh speed t f pete paj so Pse sss ops Is jeej BE jee Table H 1 ASMO 449 code table 49 APPENDIX H ALTERNATE INPUT ENCODINGS 50 As texts coded in ASMO 449 are always rendered verbatim the commands novocalize vocalize fullvocalize and the language selection com mands setarab etc make no sense and are temporarily disabled Texts in ASMO 449 are usually not fully vowelized Thus the transliteration cannot be expected to be correct This is especially true for Egyptian texts which commonly do not differentiate between ya and al f magsura H 2 ASMO 449E ISO 8859 6 The file iso88596 sty contains a reading module for the ISO 8859 6 code extended ASMO 449 ASMO 449E It is installed by the ISIpX op tion iso88596 or by input iso88596 sty The module is activated by setcode is08859 6 all following Arabic text will be considered to be coded according to the ISO 8859 6 standard The Arab TEX notation may be reacti vated by Nsetcode arabtex ISO 8859 6 see Table H 2 is an 8 bit code closely related both to 7 bit ASCII and to ASMO 449 whereas the lower 128 positions are identical to ASCII ISO 646 the upper 128 positions contain the Arabic characters of ASMO 449 in the analogous places plus a few additional graphic and control characters We exploit the close relationship of these codes by reusing
37. o not change the writing but will show up in the transliteration Additionally at the beginning and the end of an CHAPTER 5 INPUT CODING CONVENTIONS 15 otherwise isolated word they enforce the use of the connecting form of the adjacent letter if it exists like e g in the date lt 1400 h gt e A double hyphen lt gt between two otherwise joining letters will break any ligature and will insert a horizontal stroke tatwil kasida without appear ing in the transliteration It may be used repeatedly See also Section 8 automatic stretching For special applications it can also be coded lt B gt and lt B gt will behave like an ordinary consonant and may carry vowel indicators tanwin sukun and in the combination lt BB gt Sadda 5 2 Quoting In novocalize mode see Section 5 4 a double quote lt gt will modify the meaning of the following character as follows e if a short vowel follows the appropriate diacritical mark fatha kasra damma will be put on the preceding character If lt N gt follows the short vowel the appropriate form of tanwin will be generated instead At the beginning of a word alif is assumed as the first character e if the following character is a single right quote a hamza mark will be put on the preceding character even if in conflict with the hamza rules At the beginning of a word an isolated hamza will be generated e if the following character is the invisible cons
38. ode If a silent alif magsura is wanted instead write lt aN_A gt lt aNY gt lt _A gt or lt Y gt The tanwin fatha is normally put on the last consonant of the word even if a silent alsf follows If it is instead supposed to go onto the al f as in some modern Arabic conventions or in Persian this behaviour can be achieved by the option newtanwin The option oldtanwin will restore the classical behaviour A silent alif after waw is indicated by UA or WA with a capital lt W gt 5 5 Verbatim input eI pe np pen pm mm mnn isolated hamza madda on alif Table 5 3 Verbatim codings for the carrier of hamza After disabling language specific processing by setverb or Nsetnone Arab TEX will not use any context information to determine the carrier of hamza Instead the user has to supply this information himself by the next character typed after lt gt Generally this character will be used as the carrier for examples and some exceptions see Table 5 3 A short vowel indicator may follow To ease automatic conversion an initial alif may also be coded lt A gt 5 6 Alternate input codings The Arab TEX input notation has been very carefully designed for flexibility readability and ease of use for linguists confined to standard 7 bit ASCII equip ment for processing and transmitting data However it does not make much sense recoding existing machine readable text files coded according to other stan
39. onant lt gt the connection between the adjacent letters will be broken and a small space inserted This can also be denoted lt instead of At the beginning of a word alif with wasla will be generated e otherwise a sukun will be put on the preceding character The following character will be processed again The double quote will not show up in the transliteration In vocalize mode see Section 5 4 quoting will turn a short vowel off like wise in fullvocalize mode quoting will also turn a sukun off Put differently quoting will toggle the generation of short vowel indicators and sukun on and off CHAPTER 5 INPUT CODING CONVENTIONS 16 5 3 Ligatures There is no way to explicitly enforce ligatures as a large number of them are generated automatically The results will not always look satisfactory so we recommend inspecting the output after the first run Any unwanted ligature can be suppressed by interposing the invisible character lt gt between the two letters otherwise combined into a ligature After ligsfalse in the middle of a word fewer ligatures will be produced for some texts this looks better You can return to the normal strategy by ligstrue 5 4 Vowelization There are three modes of rendering short vowels e fullvocalize Every short vowel written will generate the corresponding diacritical mark fatha kasra damma except if quoted If lt N gt follows a short vow
40. options nashbf for bold face and or nastalig when available Users of Plain TEX are consid ered specialists and have to define and load suitable fonts at the required sizes themselves The following font selection commands are available e setnash default selects the Naskh font e setnashbf selects a bold face version of Naskh e setnastalig selects the Nasta liq font If a font is not available or has not been loaded the corresponding command will select the default font With LT px the size changing commands will also operate on the additional fonts 11 Chapter 5 Input coding conventions The ASCII input notation for Arabic text has been modelled closely after the transliteration standards ISO R 233 and DIN 31 635 As these standards do not guarantee unique re transliteration and are also not 7 bit ASCII compatible some modifications were necessary These follow the general rules e whenever the transliteration uses a single letter code that letter e whenever the transliteration uses a letter with a diacritical mark put the punctuation character most closely resembling the diacritical mark before the letter and not behind it as in some other coding proposals as otherwise the readability of the input would suffer e use capital letters for writing variants 5 1 Standard Arabic and Persian characters The standard codings for Arabic and Persian are given in Table 5 1 and Ta ble 5 2 e For long vowels
41. oted UA aW or at katabUA 1555 katabi yaktubUA NGC yaktubu ramaWA 1355 ramaw yalqaW Iss yalgaw 1Most of the examples are taken from Wolfdietrich Fischer Grammatik des Klassischen Arabisch 2 Auflage Verlag Otto Harrassowitz Wiesbaden 1987 38 APPENDIX F CODING EXAMPLES FOR ARABIC 39 The defective notation of 7 can be indicated by ca _i _u and leads to the appropriate spelling dAru h_u 5515 d ru h ri gli h_i de rijli hi however ramA hu 265 ram hu yarmI hi 9j yarmi hi dih i od dihi h a dih i os h dihi tih i lt tihi hAtih i ale h tihi rabb i 5 rabbi sAl i due sali hum u A humi 1 H x P 1 qiy_amaTuN a giyamatun il_ahuN Al lahun x 7 ta sam avATuN lg samawatun tal a tun ERG talatun LE l_akin 5J lakin h a d lis hada al 11 ahu AU zal lahu 2 o 4 al rra hm_anu e ar rahm nu _d_alika ali dalika To reproduce the historical writing correctly a silent long vowel or al f magstra after _a receives no sukun and is ignored in the transliteration wou x sal aUTuN sL2 sal tun hay_aUTuN gt hayatun wou vo zak_aUTuN 555 zakatun mi sk_aUTuN Te miskatun 4 pe ar rib aU gi ar riba tawr aITuN 4 35 tawratun ay o u _ _ ram_aYhu 404 ramahu sIm aYhum et simahum The short vowel u can be written as a long vowel by _U ULY Jol ul gt ULA i eal oul i LULU gel ul gt Uldka N gt ulaka UlA ika UN 1 u
42. shbf 8 11 setnastalig 8 11 setnone 10 21 setpashto 10 21 23 seturdu 10 21 23 setverb 10 21 24 showfalse 27 showtrue 27 smallskip 8 space 8 spreadbox 8 spreadfalse 25 spreadline 9 spreadtrue 25 tracingarab 26 INDEX transfalse 19 transtrue 19 twoblocks 52 vocalize 15 16 34 Nyahdots 25 Nyahnodots 25 gt 10 21 NS 1 14 16 B 15 BB 15 11 14 15 ayn 14 hamza 13 A 12 17 38 A 14 16 41 A 46 _A 12 17 38 A 14 a 22 23 46 _a 12 16 39 a fatha 12 38 aa 12 38 abbreviation 44 abjad sty 53 abjad numbers 53 Afghanic 22 ayn 14 al 14 19 alif 17 dagger 12 16 39 initial 17 magsura 12 17 38 40 silent 17 40 Qur an 16 39 silent 17 19 38 40 small 16 39 below 16 39 Allah spelling 43 aN 13 17 39 aN_A 17 40 aNA 13 17 39 55 aNY 40 Arabic context 6 7 Arabic environment 6 Arabic group 7 Arabic item 6 Arabic number 7 Arabic quotation 6 Arabic quotes 7 Arabic word 7 arabtex tex 5 Arab TEX commands 7 8 archaic text 25 ASCII 48 50 ASMO 449 18 48 50 aspirated consonant 22 assignment 9 assimilation 14 16 19 41 automatic stretching 25 aW 38 aw 45 aWA 38 ay 45 B 15 be 47 boxing commands 8 breaking connections 15 code 7 bit 48 8 bit 50 arabtex 18 ASCII 48 50 ASMO 449 18 48 50 ISO 6
43. t add the style option atrans with WIX and with Plain TEX say input atrans sty after loading Arab TEX 6 2 Encyclopedia of Islam style For special purposes the standard transliteration output may be modified by including the KIpX option etrans or by loading the file etrans sty when working with Plain TEX After this modification the transliteration will follow the style of the Encyclopedia of Islam Chapter 7 Support for other languages besides Arabic Arab TEX is primarily intended for typesetting texts in classical and modern Arabic but it also provides some support for several other languages that are customarily written in the Arabic alphabet In order to switch to the conventions for one of these languages say setfarsi seturdu setpashto setmaghribi setverb will switch off any language specific processing setarab can be used to switch back to the Arabic conven tions After selecting the language lt and gt serve as delimiters for quotations setnone will like set verb deselect any language and will also return lt and gt to their normal TEX meaning This part of ArabTEX relies heavily on contributions from the user community we want to especially mention Ivan Dershanski who completely reimplemented the routines for processing Persian As we extensively modified these contribu tions while integrating the system we are solely responsible for any remaining or newly introduced errors 7 1 Persi
44. the ASMO 449 read ing routines after suitable modification of the input This only works correctly if the input text does not contain genuine ASCII letters as we project the Ara bic characters onto their locations in ASMO 449 Some of the code switching messages in the log file are spurious do not worry The notes on vowelization and transliteration of ASMO 449 apply also The driver file indicated for ASMO 449 will be usable after the obvious mod ifications however your TEX installation must be capable of processing 8 bit data input This is nowadays usually the case otherwise you can try to locally find some utility program that will strip the highest order bit off the characters in your file and process the result via ASMO 449 APPENDIX H ALTERNATE INPUT ENCODINGS 91 EEEFEEEEEEEEEEE EET peje het Pret DZcHSDDDUNNNNUHEH BZEDIPBBBBEESISSEEPE port shife lele HT BEEPHBBDDBESISEDEBE ETTE TEETE TAST feje epi Pelet mjes erret EE fell Ell el EE EEE afstel lee pet BEBDDBDBDESESBBEBZ Table H 2 ISO 8859 6 code table Appendix I Miscellaneous utilities The following packages are not part of ArabTEX proper and are not supported in any way but are distributed along with Arab TEX as possibly a convenience to the users There is no warranty whatsoever I 1 twoblks sty This ATpX option will define a command twoblocks 1 2 which will place the two parameters 1 and 2 usually two paragraphs into two boxes side b
45. tion 6 1 ZDMG transliteration style In addition to the arabic writing the standard scientific transliteration may also be obtained from a fully vowelized input text This mode is activated by transtrue and may be switched off again by transfalse If only the translit eration is wanted you can deactivate the arabic writing by arabfalse it can be reactivated by arabtrue If both modes are active their output will be interleaved line by line The transliteration mode assumes that the input text is in the Arabic or Persian language and has been coded according to the rules given above For words from other languages the transliteration might be in error For Arabic text the follovving special cases are handled e after the definite article a double consonant will be assimilated e an initial vowel will be replaced by an apostrophe whenever the preceding word ended with a vowel in this case a wasla appears in the Arabic writing If that is not wanted start with hamza e a silent alif or alif maqsura after lt N gt tanwin and lt U gt is omitted in the transliteration The same happens after waw if it is written as a cap ital lt W gt e To correctly reproduce some historical writings a silent long vowel after lt _a gt is omitted in the transliteration For examples see the Appendix For economy of space the transliteration module is not loaded by default If 19 CHAPTER 6 TRANSLITERATION 20 you want to use i
46. x E Sample ArabTEX output lg gt juha wa him ruhu ata sadigun ala Guha yatlubu minhu himarahu li yarkabahu fi safrat n qasiratin wa qala lahu 6s Sad iue 9 A Ste ho ll E ao JI sawfa uciduhu ilayka ft masai wa radfasu laka u ratan bel a js sal 3 en T yo fa qala guha E JE sana gt as fun giddan anni la gt astati lt u can gt uhaggiga laka ragbataka fa lhimaru laysa huna l yawma pap gl E E ORARE TY Fike Gil wa gabla zan yutimmu juha kalamahu badaa Lhimara yanhaqu fr stablihi ER G E I Oe G a A d s fa qala lahu sadiquhu Ae X qu innt asma u himaraka ya juha yanhaqu n Gr woe fa qala lahu guha G 4 Jus garibun amruka ya sadiqi a tusaddiqu l himara wa tukaddibuni t As jg kal i E E Jol S 37 Appendix F Coding examples for Arabic The short vowels fatha kasra damma are denoted as in the translitera tion by the small letters a i u manata ace manata dahaba 2 85 dahaba sariba j ariba C 7 4 P E E o qabila L qabila a zuma ek cazuma alu je salu bal E bal o ni ma nisma yaktub IS yaktub The long vowels a z are denoted by capitals A I Uorby aa iy uw LA M a gAtala NJE g tala nUzi a b ndzisa LUMI e lumi siri sy siri lawmI PH lawmi sayrI gja sayri Alif maqsura is coded as _A or Y ramY 5 rama dikrY 553 dikr al A d ala bal A v bala Silent alif The plural suffixes u aw of the verb are den
47. y side separated by space of length colsep If necessary the resulting boxes will be split across a page boundary This feature is useful if two versions of a text are to be compared They may be in different languages and one of them might be in Arabic if enclosed in begin arabtext Vend arabtext This sentence has been written twice in the English language and UL a Ll ajos AIS in the Arabic language f Jl E Ls PN NT Otherwise this command does not depend on Arab TEX in any way and indeed originated in a completely different context Beware that the two blocks should each not contain much more than one not too long paragraph of text otherwise TEX s main storage might overflow There must be no verbatim text inside the parameters of twoblocks nor any Ncatcode changes and all TEX groups and if fi sequences must be properly nested 52 APPENDIX I MISCELLANEOUS UTILITIES 53 1 2 abjad sty This file loaded as a IATEX option will define a command abjad 1 usable inside and outside of an Arabic contert It profited greatly from suggestions by Dr Benno van Dalen Utrecht University The command abjad 1 will convert its argument which has to be a legal representation of a number between 1 and 1999 to the Arabic abjad notation used in some mediaeval manuscripts The result of the conversion will not look perfect and the legal abjad number 0 can presently not be generated Improving
48. zot gt b gt qara a 158 qarata bu suN es bwsun ab usuN px abusum ag E A ae ra ufa 235 rarufa ru asA u 143 rwas u bi ruN yo berun gt E z as ilaTuN ASI cas ilatun ka iba kariba qa imuN q imun ri AsaTuN ab rpasatun su ila de swila samA uN ZU sama un x 03 po barlI uN Es barrun sU uN g swun bad uN su badun A ay A N Zo say ull P ayun say iN lt sapin say all Ks Sayan ve sA ala lu s ala mas alaTuN Jea masalatun ve Bee saw aTuN alga sauratun _ha tI aTuN tos hateatun APPENDIX F CODING EXAMPLES FOR ARABIC 41 Old Hamza convention In an older writing style that is used e g in some Qur an editions the hamza is sometimes put below its carrier or on the connecting line This style may be switched on by oldhamza and off again by newhamza as ilaTuN del asilatun ka iba lt koiba q imul ls qaimun su ila j suia say aN e Sayan ha tI aTuN Els hatratun Madda in the context a is generated automatically AkiluN si akilun qur AnuN tss qur nun ra Ahu al ra hu To reproduce the historic writing correctly it can also be explicitly written in other contexts a sdiq A uh u SANOA asdiqa uhu ya g I u K yajru s U ila dh sibila Sadda A double consonant must be written twice even if it is coded by more than one character nazzala 03 nazzala ba s
Download Pdf Manuals
Related Search
Related Contents
American Standard F109 User's Manual Sony HDE1B User's Manual Braun 375 Electric Shaver User Manual CentreCOM x610シリーズ 取扱説明書 HP LaserJet M1522 MFP Series User Guide - Hewlett FICHA DATOS TÉCNICOS GRASA GICE Copyright © All rights reserved.
Failed to retrieve file