Home

DRAFT ArabTEX Typesetting Arabic and Hebrew1 User Manual

image

Contents

1. tin rItI amp riti mErE 2 mere la rkE lt larke The diphthongs ae and ao are encoded ae and ao kaesA LS kaes paoda 1547 paoda Note U yields a waw e ma ruf with pes while O yields a waw e maghul without pes and ao is indicated by a zabar Compare pUr pur pOtA Es pota paodA 155 paoda Intervocalic hamza is written CAE AS me ma I d mar 10 1 395 kor Aspiration is produced by coding a h after the consonant to be aspirated Aspiration in Urdu is produced by adding do asma he after the consonant khEt khet ghar A ghar mujhE mughe dharm po dharm Nasalization is indicated by nun e gunnah coded as n Note that the nugta for nun is not written when it is used to represent nasalization mae n yt maen a hi nsA ahinsa t Please contact Anshuman Pandey apandey u washington edu with questions or com ments regarding this section 46 CHAPTER 6 SUPPORT FOR OTHER LANGUAGES gaf kaf khaf a the 1 gaf te haf a the 4 l m phe te 5 lham gim _ mim ghim 5 mhim ce che bart he he 0 waw he khe 353 d ER choti he dal dhal dal dhal 1 n dal ui s hamza d oh te marbuta T rhe a fe rhe nun nh n do asma he choti ye bart ye n n e gunnah a Table 6 1 The Urdu Alphabet 6 3 URDU 47 The hanging he or Ha ye hawwaz e mahfr is generated by H It does not re ceive a j
2. e some information on the intended use 64 8 2 DOTS ON YA 65 We are willing to consider any suggestion Adding a new character might be easy or else it might be outright impossible ArabTEX is rather flexible but there are also some technical limitations 8 2 Dots on ya Whether ya in the final position carries dots or not is controlled by the chosen language convention You can override this after selecting the language by yahdots and yahnodots 8 3 Vowel positioning In vowelized Arabic text the short vowel marks are by default positioned close to the basic character glyphs If this is not wanted they may be raised to approximately uniform height by the command Naccentshigh You may revert to the default strategy by Naccentslow 8 4 Abjad numerals The command abjad 1 will convert its argument which has to be a legal representation of a number between 1 and 1999 to the Arabic abjad notation used in some mediaeval manuscripts The result of the conversion will not look perfect and the legal abjad number O can presently not be generated The command abjad 1 can be used inside and outside of an Arabic context This routine profited greatly from suggestions by Dr Benno van Dalen Utrecht University 8 5 Automatic stretching For special purposes e g for headlines and for Arabic paragraphs containing long mathematical or non Arabic insertions the connection between adjacent Arabic letters m
3. e Displayed equations bracketed by and NJ or by e short left to right Roman quotations containing text and possibly also TEX IZTEX commands as argument to LR read left to right or bracketed by and deprecated These introduce a new level of grouping so if they contain any TEX IATEX assignments the effects of these will be local by default When using and gt this feature is not available within an Arabic quotation The alternate notation N is not provided 3Quotations closed by gt may not contain nested insertions 2 2 COMMANDS WITHIN AN ARABIC CONTEXT 13 2 2 Commands within an Arabic context ArabT X will process all input text contained within an Arabic context even all TEX IATEX commands These must thus be known to ArabTFX since their meaning or their implementation might be different there from the usual in terpretation Thus ArabTEX has to provide a private implementation for any admissible command and will normally try to reuse parts of the existing mecha nisms As a consequence any incompatible changes to Plain TEX and or ATEX might require corresponding changes to ArabT X too A control sequence inside an Arabic context may be of the following kinds Note that commands which are intrinsically vertical like e g vskip make no sense within Arabic quotations and may lead to unpredictable effects there e ArabT X option changing commands These may also be used outsid
4. ketAb e U di SS ketab e rAh e t_U ol rah e tu nAmeH i man ya al nameh i man bInI e An mard 3 ol Css bini e an mard pA i In zan oj cp GE pai in zan bAZU i In zan 85 5 6936 bazu i in zan Long edafe is written _i dAr _i man j d r i man hU i tU h i tu Hamze as ya ye wehdat nesbat xetab is likewise written _i nAmeH _i LU nameh 1 sormeH _i ta sormeh t gofteH _i Cif goftch i Ye ye wehdet is written I or E ketAb I gis ketab 1 rAh I al rah 1 nAmeH I Y aol nameh 3 dAnA I 3Uls dana pArU I 393 p r i dAnA I keH 4XSUI5 dand i keh pArU I keH Ki 336 p r i keh The present tense forms of the verb buden and the pronominal clitics are written as they are spoken rafteH am N RT rafteh am rafteH Im v lt rafteh m rafteH I gl lt rafteh i rafteH Id I lt rafteh id rafteH ast El cd rafteh ast rafteH and Xil lt rafteh and 44 CHAPTER 6 SUPPORT FOR OTHER LANGUAGES mard Id do t mard id asb etAn Sl asb etan An gA st Este anga st U st ia st t U st Baer tu st ketAb I st Ets ketab i st nAmeH I st ol nameh i st The preposition be can be written with or without a hyphen be man e be man be t_U E be tu be An Sb be an be In L be in beU sh be The components of compounds can be separated by or SA hebN hAneH ul Cols sahebhaneh ta ht e hwAb c taht e hab pas andAz Asi pasandaz naw AmUz So
5. uuu 4 2 4 3 4 1 1 4 1 2 4 1 3 4 1 4 4 1 5 Standard Arabic and Persian characters Vowelization x sf ss uti a pp TE al Quoting 4 ae sawa s s a a an ee Lipatures ss secas x mox a quas s Coding examples for Arabic Verbatim input o e s a ETA XX ada Alternate input encodings uuu 4 3 1 4 3 2 4 3 3 4 3 4 ASMO 449 ISO 9036 ASMO 449E ISO 8859 6 MacOS Arabic CP 1256 Arabic Windows Encoding UNICODE Arabic 22 2 2 66 6 6 6 6 6 64 6 6 11 12 13 15 15 16 16 2 CONTENTS 4 3 3 ISIRE 3342 i ns sta ehe 34 4 3 6 Buckwalter Transliteration 38 5 Transliteration 39 5 1 ZDMG transliteration style on 39 5 2 Other transliteration styles o re e 40 5 3 Capitalization 0 40 6 Support for other languages 41 6 1 Persian Farsi Dari suns 41 6 1 1 Coding examples for Persian 42 6 2 Maghribia Aux b Oe Ede qd oos ss 44 0 3 AU TOU re RR NT UL j o E 44 6 3 1 Coding examples for Urd uuu 45 6 4 Pashto Afghanic ees 48 6 5 DIA Sa e g Ss e en RR e de doa E bh 49 6 6 Kashmiri guess dede we baw X Soe dana nes 50 6 7 Uighurie u v E gt BA eg nee ESTUR S Sho ud 50 6 8 Old Malay o M cmm e mee PEU eom mm 52 6 9 Other extensions of the Perso Arabic script 52 7 Hebrew mode 5
6. hskip 13 hspace 13 indent 12 input 12 ligsfalse 22 ligstrue 22 lineskiplimit 65 Mag 11 marginpar 12 mbox 12 medskip 12 newarabfont 16 newhamza 24 newpage 12 newtanwin 21 23 noindent 12 nospace 12 novocalize 20 21 53 oj 61 Xoldarabfont 16 oldhamza 24 oldtanwin 21 23 pagebreak 12 pagenumbering abj 67 Xpar 10 12 quiet 65 rq 11 setarab 10 15 18 40 52 setarabfont 16 setcode 28 setcodefarabtex 28 30 33 53 59 setcode asmo449 28 setcode bhs 58 82 setcode cp1255 54 setcode cp1256 33 setcode hed 54 setcode hwin 54 setcodefisiri 33 setcode iso8859 6 30 setcode iso8859 8 54 setcode iso9036 28 setcode newcode 54 setcode pccode 54 setcode standard 28 53 setcode utf8 33 59 setcode witbhs 59 setfarsi 15 40 sethebfont 61 sethebrew 52 setmaghribi 15 40 setnash 12 15 setnashbf 12 15 setnastaliq 12 setnone 15 40 setpashto 15 40 47 settransfont 38 settrans english 39 settrans farsi 39 settrans gesenius 59 settrans iranica 39 settrans kashmiri 39 settrans lazard 39 settrans standard 39 59 settrans turk 39 settrans urdu 39 settrans zaw 59 settrans zdmg 39 seturdu 15 40 47 setverb 15 40 51 shlmold 61 shlmscr 61 shlmstk 61 showfalse 65 showtrue 65 smallskip 12 space 12 spreadbox 13 spreadfalse 64 spreadl
7. In this encoding vowel points dagesh and meteg cannot be used as they cannot be represented in the input Abbreviations may be expressed by a single or double apostrophe right quote The final and the medial forms of characters are equivalent ArabTpX will choose the appropriate shape automatically 7 5 BHS encodings a MT b 001 c Gen x1 nam PND PINT NN Day DM DION NOR DEN 11 3 pa 955v nenn ody nme men aby quf indi mn baa 2103 ON ody NYY IRIT nin sS ody ONA NR min n Sind Dion np 5 YA po INT pa DSN PPI M DON DONA p MN DP Spam Som n 5 2 may YN Dan 7 DRY mo pa 5339 m man Pha DRY WPI PN NPY 5 som DPI ovo WR mon p nono DEI Ws BUM NS a cy ph pom 0pm Dog QM num mana CUN DDR con 9N 251929 DON NM BU NP mppo pan ney i pb nb py vu vp NYI PONT NYA DON PD SDD My PANT NYL D 299mm PUNO Jonny AYN 25979 Dion NM MID opu wwe TY py mob vw n Mp T DHON NY gt M cry ph spac cm s mio nho vm mn pm pa man Dpwh vp DR 507 Figure 7 1 BHS Hebrew example The package bhs sty provides support for the encoding that is used in the machine readable version of BHS Biblia Hebraica Stuttgartensia After loading the package you can switch to this encoding by the command setcode bhs The patach furtivum is not coded explicitly in BHS ArabTEX tries to guess it For an example see Figure 7 1 BHS line numbers and comments are only partially supported The line
8. The Metaphysics and Cosmology of Process according to Sayh Ahmad al Ahsar Critical Edition Translation and Analysis of Observations in Wisdom State University of New York at Buffalo PhD Dissertation 1998 Arnoud Vrolijk Bringing a laugh to a scowling face A study and critical edition of the Nuzhat al Nuf s wa Mudik al Ab s by Ali bn Sud n al Basbugawi Cairo 810 1407 Damascus 868 1464 Research School CNWS School of Asian African and Amerindian Stud ies Leiden The Netherlands 1998 E S Kennedy P Kunitzsch and R P Lorch The Melon shaped Astrolabe in Arabic Astronomy Texts edited with translation and commentary Boethius Texte und Abhandlungen zur Geschichte der Mathematik und der Naturwissenschaften Band 43 Franz Steiner Verlag Stuttgart 1999 Charif Bahbouh Martina Bahbouhova 365 1 arabske prislovi a mu droslovi Dar Ibn Rushd Praha 2001 Wolfdietrich Fischer A Grammar of Classical Arabic Third Revised Edition Translated from the German by Jonathan Rodgers Yale University Press New Haven amp London 2001 Tabit ibn Qurrah On the Sector Figure and Related Texts Edited with Translation and Commentary by Richard Lorch Institute for the History of Arabic Islamic Science at the Johann Wolfgang Goethe University Frankfurt am Main 2001 Tilman Hannemann Recht und Religion in der Gro en Kabylei 18 19 Jahrhundert Zu rechtskulturellen Wandlungsprozessen im tribalen Gewohnheitsrec
9. sometimes after some adjustments to ArabTEX when detecting the presence of another system and sometimes by compatible adjustments to the other system However there are some problem areas e The resource requirements of Arab TEX and usually also of the other pack ages are very high and might reach the limits of a small TEX system Fortunately nowadays very large TEX implementations are available e The running time is not negligible however computers are still becoming faster and typesetting this very document takes only about 20 seconds on a Pentium 233 PC running emTEX e Tracking down errors in a combination of several large macro packages might be difficult and time consuming e There might be conflicts between the names of internal commands of sev eral packages The resulting effects can be very obscure there seems to be no easy solution e ArabTEX assumes that the special and punctuation characters have their original category codes both when it is loaded and when Arabic processing begins If some macro package changes these codes Arabic processing will usually be broken This does not apply to Babel nor to german sty these packages are specially handled 67 68 CHAPTER 9 COMPATIBILITY ISSUES e Conversely ArabTpX changes the category code of lt which might break other packages Loading ArabTpX as the last module usually helps and enables Arab TX to detect the presence of other packages 9 1 Arabic document cla
10. 13 assimilation 19 20 25 38 automatic stretching 64 aW 22 83 aw 41 aWA 22 ay 41 B 20 Babel 68 baselines uniform 65 be 43 bgdkpt 53 boxing commands 13 Braams Johannes 68 breaking connections 21 cantillation 54 capital letter 39 category codes 66 67 CJK 68 code 7 bit 28 8 bit 30 33 arabtex 28 ASCII 28 30 33 ASMO 449 28 30 Buckwalter 37 ISIRI 3342 28 33 ISO 646 28 30 ISO 8859 6 28 30 ISO 8859 8 54 ISO 9036 28 MS Windows 33 UNICODE 33 59 UTF 8 33 59 coding conventions 17 76 commands ArabTpx 11 12 boxing 13 illegal 13 IATEX 11 12 overview 13 size changing 12 16 TEX 11 12 user defined 13 compatibility 66 Babel 68 CJK 68 84 EDMAC 67 PicTEX 68 compounds 43 connecting form 20 copyright 0 CP 1255 54 CP 1256 33 CTAN 16 dagesh 53 58 forte 54 lene 53 orthophonicum 54 dagger alif 17 20 damma 18 20 21 inverted 20 22 Dari 40 date 20 default font 15 defective writing 17 20 22 definite article 19 25 38 Derzhanski Ivan 41 69 diacritics 20 diphthongs 41 44 display mode 11 do asmi he 44 document classes 67 dots on ya 41 64 E 40 E 41 e 41 42 47 e 41 EDMAC 67 eH 41 El Hadi Ahmed 69 emphasis 27 Encyclopedia Iranica 39 Encyclopedia of Islam 39 ending 20 environment Arabic 10 arabtext 10 IXTEX 67 picture 67 RLtext 10 INDEX Roman 10 tabbing 10 tabular
11. 69 69 69 70 74 74 75 76 List of Figures 1 1 1 2 7 1 Cl Sample Arab TEX input suce SSAA RSS 9 Sample Arab TEX output e 10 BHS Hebrew example e 59 Example of Arabic poetry source unknown 80 List of Tables 4 1 4 2 4 3 4 4 4 5 4 6 4 7 4 8 4 9 4 10 6 1 6 2 6 3 6 4 6 5 7 1 7 2 7 3 7 4 7 5 7 6 Standard encodings for Arabic and Persian consonants 19 Additional encodings generally available 20 Verbatim encodings for the carrier of hamza 29 ASMO 449 code table o 30 ISO 8859 6 and MacOS Arabic code table 32 Windows CP1256 code table o 33 UNICODE Arabic Part 1 o o 35 UNICODE Arabic Part 2 Seca SSA 36 ISIRI 3342 code table a 37 Buckwalter transliteration o 38 The Urdu Alphabet lens 46 Additional codings for Pashto nn 48 The Sindhi Alphabet nenn 49 The Kashmiri Alphabet sen 50 ArabTEX encoding of Uighurie o o 51 ISO 8859 8 and Windows CP 1255 code table 56 MAC Hebrew code table nn nennen 57 Hebrew TEX CP 1255 and ISO 8859 8 code table 58 UNICODE Hebrew ees 61 Judeo Arabic encoding 6 6 a 63 Yiddish YIVO transcription 22 63 LIST OF TABLES 8 1 Additional codings
12. IATEX system will recognize the following items e standard TEX IATEX text and commands e Arabic quotations as arguments to the command RL read right to left within a Roman paragraph A quotation may also be bracketed by lt and gt deprecated or by N lt and gt except inside a IXTEX tabbing environment An Arabic quotation may not contain more than a single paragraph of right to left text and forms a new TEX group so any assign ments will be local by default e longer Arabic text segments called Arabic environments which are brack eted by the commands begin RLtext and end RLtext or also begin arabtext and end arabtext with the same meaning even when using Plain TpX An Arabic environment consists of one or more paragraphs separated by blank lines or par commands It forms a group so assignments will be local by default Arabic quotations and Arabic environments will be called Arabic contexts in the sequel 1The former restriction that quotations must fit on the current line no more applies 2Quotations closed by gt may not contain nested insertions also observe that lt must be matched by gt not by N gt 11 12 CHAPTER 2 INPUT TO ARABTEX 2 1 Arabic text elements Every Arabic paragraph and every Arabic quotation is a sequence of the following kinds of Arabic items separated by white space or single newlines e isolated punctuation marks interpreted as the corresponding Arabic
13. Postscript T1 format Their use is highly recommended when using a Postscript interpreter or converting the output to PDF format readability is dramatically improved In Hebrew mode see Section 7 you can use the standard fonts available on CTAN after installing them locally As defaults the fonts hclassic and hcaption are provided with ArabTgX these fonts contain vowel points Also for the other fonts vowel points are provided Chapter 4 Input encoding conventions 4 1 ASCII Transliteration encoding The ASCII input notation for Arabic text has been modelled closely after the transliteration standards ISO R 233 and DIN 31635 These standards do not guarantee unique re transliteration and are also not 7 bit ASCII compatible therefore some modifications were necessary These follow the general rules e whenever the transliteration uses a single letter code that letter e whenever the transliteration uses a letter with a diacritical mark put the punctuation character most closely resembling the diacritical mark before the letter and not behind it as in some other coding proposals as otherwise the readability of the input would suffer and the encoding could become ambiguous e use capital letters for writing variants 4 1 1 Standard Arabic and Persian characters The standard encodings for Arabic and Persian consonants are given in Table 4 1 and Table 4 2 e For long vowels we use the capital letters lt A gt lt I
14. TEX will cooperate with EDMAC a Plain TEX macro package for criti cal editions written by John Lavagnino and Dominik Wujastyk If EDMAC is already present when ArabTEX is loaded the EDMAC commands will after suitable local modifications be available inside an Arabic environment Their arguments are considered Roman text but may contain Arabic quotations For further details see the EDMAC documentation EDMAC has been extended to work with IXTEX too and ArabTEX still coop erates most of the time However the three macro packages involved are very complicated and interact in very subtle ways so the user may sometimes get a surprise In this case please contact the author 9 3 USING ARABTEX WITH BABEL 69 9 3 Using ArabTpX with Babel The Babel package by Johannes Braams provides support for multi lingual texts in a large number of mostly European languages ArabTpX does not use the language switching facilities provided but is otherwise compatible If ArabTpX is used in a Babel document Roman insertions within an Arabic context are interpreted according to the presently active Babel language mode Conversely a Roman paragraph in a Babel document may contain Arabic insertions 9 4 Using ArabTfX with PicTEX With some caution ArabTpX can be used together with PicTpX However PicTpX uses the angle brackets lt and gt for labeling diagrams and this requires switching off their special meaning within ArabTEX by t
15. alif 20 Raichle Bernd 69 raphe 54 raw sty 79 reading module 28 Rebstock Ulrich 69 Rezus Adrian 69 Roman 6 generic term 6 script 6 Roman environment 10 Roman insertion 11 Roochnik Paul 69 Saba Mohamed 69 Samy Waheed 69 Schimmel Annemarie 69 Schlebbe Heribert 69 script Arabic 6 Hebrew 6 Roman 6 Sadda 19 21 24 25 on tatwil 20 Shehab Nariman 69 short vowels 18 silent alif 21 38 silent h 39 size changing 12 16 space small 19 unbreakable 19 special codings 63 stretching 12 20 64 automatic 64 sukun 20 21 42 43 46 76 on lam 19 on tatwil 20 sun letter 19 T 24 tabbing environment 10 INDEX ta marbuta 24 tanwin 18 20 21 23 38 46 fatha 23 on tatwil 20 tasdid 19 21 46 disabling 46 Urdu verbs 46 tatwil 20 26 TEX commands 11 12 TEX hash size 74 text archaic 63 erroneous 63 TEX XpT 52 74 transcription 39 transliteration 17 38 76 Encyclopedia Iranica 39 Encyclopedia of Islam 39 Farsi 39 Kashmiri 39 Lazard 39 Library of Congress 39 standard 39 turkish 39 Urdu 39 ZDMG 38 39 twoblks sty 78 U 17 22 38 U 19 _U 23 41 _u 17 20 22 u damma 18 22 U IX implementation 73 UA 21 22 uheb fd 61 uN 18 23 UNICODE 33 59 Arabic 35 36 Hebrew 60 uniform baselines 65 Urdu 43 47 user defined commands 13 UTF 8 33 59 uw 17 22 87 van Dalen Benno 64 69 verbatim
16. between adjoining characters may be produced by a double hyphen qabila Ji gabila qa bi 1a gabila q ab ila qabila q a b i la qabila qa bi 1a qabila This feature should be used with discretion automatic spreading usually leads to a better result Ties between words are indicated by a single hyphen bi baladiN alo bi baladin ta al 11_ahi we ta l lahi sa ya tI aes sa yati li yafra ha E li yafraha 28 CHAPTER 4 INPUT ENCODING CONVENTIONS wa iswadda EID wa swadda ba da mA Gis bada m tAla mA Ub tala ma fI ma e fi ma alA ma p lt ala ma A single hyphen at the beginning or end of a word will enforce the use of the joining form of the first resp the last character if that form exists for special uses only S 8 TST 8 S w S h o h h 4 h h 4 h h 8 h d d d d 1A Y l 1A M l 1400 h a Y 1400 h Digit sequences are written in the natural order 1234567890 YY amp 01YAA 1234567890 Hyphen and comma as a decimal separator do not terminate the number 123 456 789 YY 201 YAQ 123 456 759 Ligatures are generated automatically they can be suppressed by al islAmu AL al islamu al gAru zer al garu al l gAru Jal aljaru _tumma T tumma _tulmma e tumma 25 2 vi mu hammaduN AZ muhammadun mul halmmaduN Asas muhammadun Abbreviations and emphasis are indicated by emphasize or
17. breaks of the source are ignored To respect them set bhsmode 2 or bhsmode 1 adjusting textwidth might be necessary 60 CHAPTER 7 HEBREW MODE setcode witbhs switches to a variant of the BHS encoding that was de veloped by the Werkgroep Informatica of the Vrije Universiteit Amsterdam Activate it by usepackage witbhs We have slightly extended the latter en coding for special purposes lt gt after a consonant forces the use of its medial shape unconditionally even at the end of a word lt gt produces a small space that may carry vowel points and lt gt shows as a box that may carry vowel points To use this feature the IATEX package latexsym is required 7 6 UNICODE Hebrew The file utf8 sty contains a reading module for the Arabic and the Hebrew segment of UNICODE in UTF 8 encoding It is installed by the IXTEX command usepackage utf8 or by input utf8 sty UTF 8 UNICODE Transmission Format see table 7 4 is a multi byte encoding which for Arabic and Hebrew uses two bytes per character whereas ASCII characters use a single byte Far eastern languages are encoded in three bytes per character This is in contrast to UNICODE itself which always uses two bytes per character The module is activated by setcode utf8 all following Arabic and Hebrew text will be considered to be coded according to the UTF 8 encoding standard To use the correct font select the appropriate language The ArabT X notation may be reac
18. die Leipzig e A A Ambros Einf hrung in die moderne arabische Schriftsprache 1 Auflage 1969 Max Hueber Verlag M nchen e ASMO 449 7 bit coded Arabic character set for information interchange Arabic Standards and Measurements Organization 1982 e J D Becker Arabic Word Processing Comm ACM 30 7 600 610 1987 e T Borg Arabisch f r Ausl nder Ein Lehrbuch f r modernes Hochara bisch 2 Auflage 1979 Verlag Borg GmbH Hamburg 70 71 J A Boyle Grammar of Modern Persian Wiesbaden Otto Harrassowitz 1966 B Comrie ed The World s Major Languages Croom Helm London 1987 DIN 31 635 Umschrift des Arabischen Alphabets Deutsches Institut f r Normung e V 1982 L P Elwell Sutton Elementary Persian Grammar Cambridge University Press 1963 C Faulmann Das Buch der Schrift enthaltend die Schriften und Alpha bete aller Zeiten und aller V lker des gesammten sic Erdkreises K K Hof und Staatsdruckerei Wien 1878 W D Fischer Grammatik des Klassischen Arabisch 2 Auflage 1987 Verlag Otto Harrassowitz Wiesbaden A Grohmann Arabische Pal ographie Teil I und II sterreichische Akademie der Wissenschaften Philosophisch historische Klasse Denkschriften 94 1 Wien 1967 E Harder A Schimmel Arabische Sprachlehre 15 Auflage 1983 Julius Groos Verlag Heidelberg BA ile Hasim Muhammad al Hattat Qawa id al Hatti al
19. file changes2 txt Version 3 is upwards compatible with version 2 However many new features were introduced gradually among them support for additional input encodings and a multitude of languages that use the Perso Arabic script We gratefully acknowledge the cooperation of several users who contributed information doc umentation and even helped with the coding On some users request a Hebrew mode was added as well as support for nearly all the Hebrew TEX fonts that are available on the CTAN server network 78 APPENDIX B RELEASE HISTORY ArabT X version 4 00 Version 4 is an upwards compatible extension of version 3 and many modules have been rewritten All presently supported features are documented in this manual It proved impractical to indicate every extension explicitly the basic user interface is still the same In a few instances we had to abolish certain old features that to our knowledge were rarely used or not at all because of ambiguities or conflicts with the exten sions These places are indicated in the manual and are flagged by an asterisk in the margin The most important incompatible changes are e Tilde is no more used as a prefix in the transliteration encoding because of conflicts with TpX s use of tilde for a stable space Caret is now used instead in all cases e The double bar lt gt for indicating a small unbreakable space has been replaced by lt gt Users
20. indicated for ASMO 449 will be usable after the obvious mod ifications however your TEX installation must be capable of processing 8 bit data input This is nowadays usually the case otherwise you could try to locally find some utility program that will strip the highest order bit off the characters in your file and process the result via ASMO 449 32 CHAPTER 4 INPUT ENCODING CONVENTIONS er Peer penas MU Creare ipod Bela r bs ee A ee es pei Ieee ele Gc CE CERE ER EE Rr an E ee BEE ip ONES e oa sae sss SEE Bee Se 5 pals RR E AE Table 4 5 ISO 8859 6 and MacOS Arabic code table 4 3 ALTERNATE INPUT ENCODINGS 33 laa es ales e ala Aaa pese a eee Nos 5 a presse pe RS ER en ERC beer E on ho Eco ERP UNUS EN E pece eee Peres Pee F ee TT ete rr SS ERREUR E J becca PERE eo PS Be bee Table 4 6 Windows CP1256 code table 34 CHAPTER 4 INPUT ENCODING CONVENTIONS 4 3 3 CP 1256 Arabic Windows Encoding The file cp1256 sty contains a reading module for the Arabic part of the code page CP1256 used within MS Arabic Windows It is installed by the IXTEX command usepackage cp1256 or by input cp1256 sty The module is ac tivated by setcode arabwin or setcode cp1256 all following Arabic text will be considered to be coded according to the MS Arabic Windows standard The ArabTeX notation may be reactivated by setcode arabtex The code page 1256 used in MS Arab
21. or setverb no special processing see however Section 4 2 The pro cessing mode may be changed at any time even inside an Arabic environment or an Arabic quotation Arabic insertions are generally included in RL or bracketed by lt and gt By selecting a language the symbols lt and gt are also activated as shorthands to bracket short insertions in the chosen language Whereas this is usually con venient it also has some drawbacks the angle brackets can thus no more be used for other purposes except in mathematical mode where they retain their normal meaning as relational operators To return them to their normal mode of operation you can deselect them by setnone For further details on supported languages see Section 6 3 3 Font selection For producing the extended Arabic script ArabTpX uses a special strategy to build up character shapes from a collection of fragments which normally do not correspond to individual character glyphs Therefore none of the available free or commercial Arabic fonts can be used we provide our own pseudo fonts Presently the following pseudo fonts are available e xnsh14 is the default e xnsh14bf is a bold face version of xnash14 e nash14 is an older version of xnsh14 containing less glyphs and only provided for compatibility with existing documents e nash14bf is a bold face version of nash14 The font xnsh14 the default use setnashbf or bf to switch t
22. punc tuation mark e numbers i e character sequences starting with a digit and possibly continued by digits commas hyphens or slashes A number will be processed using the normal writing sequence from left to right however a final punctuation mark will be split off and processed separately e Arabic quotes coded as two left quotes or two right quotes each or as Maq and rq They should be written directly adjacent to a word e words i e character sequences starting with a letter or a special non digit character followed by a letter A final punctuation mark will be split off and processed separately The coded characters of a word will in the output be arranged from right to left e a sequence of Arabic text elements words numbers and special charac ters enclosed in curly braces and This introduces a new level of TEX grouping otherwise the constituents are processed normally This feature may be nested Output from all items will be arranged from right to left lines will be broken as necessary Inside an Arabic Environment or in an Arabic quotation you may also have e ArabTEX commands with or without parameters These will be executed immediately e Some but not all TEX IATEX commands see below These will be exe cuted immediately e Short mathematical insertions bracketed by NC and X or by two single signs They must fit on one output line and are processed as usual
23. raphe lt v gt for I lt f gt for a Vowels are encoded as follows short vowels Tong vowels defective half vowels patach A qames chateph patach segol E sere sere chateph yod segol i chireq I chireq i shewa yod qames i cholem cholem chateph chatuph waw qames u gibbus U 10 shureq u no vowel mark The matres lectionis can also be written explicitly e g lt _ey gt or E for D iy or lt I gt for I lt _ow gt or lt 0 gt for e vocalize default activates vowel points and special punctuation novocalize switches them off again e patach furtivum is written lt a gt before its carrier lt rU a_h gt gives MM e dagesh lene with lt b g d k p t gt is expressed by prefixing a dot lt b gt 3 lt g gt 1 lt d gt 1 lt k gt J lt p gt A lt t gt N the same device is used for mappiq with lt h gt lt h gt n1 e dagesh forte and likewise dagesh orthophonicum are expressed by dou bling the consonant thus two equal consonants in sequence even in 7 3 ISO 8859 8 AND HEBREW MS WINDOWS 55 novocalize mode must be separated by some short vowel indicator or lt u gt if the standard encoding is used e meteg is indicated by after the vowel e maggef is lt gt en dash a single hy
24. s n zisa 1UmI a luma sIrI siri lawml S lawmi sayrl Ge Sayri Alif maqs ra is coded as A or Y ram A 45 rama dikr A cx dikra al A d cala bal A E bala Silent alif The suffixes aw of the verb are denoted UA aW or aWA katabUA katabi yaktubUA ce yaktub ramaWA 1555 ramaw yalqaW Isak yalgaw The defective notation of z u can be indicated by _a _i _u and leads to the appropriate spelling dAru h_u oj d ru h ri gli h_i de rigli hi however ramA hu ola rama hu yarmI hi 4 5 yarmi hi dih i dihi h a dih i hadihi tih i tihi hAtih i lt hatiha rabb i 5 rabbi sAl i JLS sal hum u humi 2Most of the examples are taken from Wolfdietrich Fischer Grammatik des Klassischen Arabisch 2 Auflage Verlag Otto Harrassowitz Wiesbaden 1987 24 CHAPTER 4 INPUT ENCODING CONVENTIONS qiy_amaTuN BE giyamatun il_ahuN y xilahun sam awAtuN Slee samawatun _tal_a_tuN Es talatun l_akin ES lakin h_a_dA l hada al 11_ahu AT al lahu al rra hm_anu jes ar rahmanu _d_alika dalika To reproduce the historical writing correctly a silent long vowel or alif maqsura after _a receives no suk n and is ignored in the transliteration 2a sal aUTuN salatun hay aUTuN hayatun to zak_aUTuN 3955 zakatun mi sk_aUTuN miskatun ar rib aU El ar riba tawr_aITuN amp y tawratun 21 5 oy 1 at ram aYh
25. share their positions with the ASCII digits Text coded in ASMO449 or ISO 8859 6 can be converted trivially and unambigu ously into the Buckwalter transliteration and except for the rarely used dagger alif and wasla on alif which are missing from ASMO449 and ISO 8859 6 the reverse mapping is equally straightforward Buckwalter transliteration can be trivially converted to and from UNICODE which contains the extra characters The Buckwalter System is a true or strict orthographical transliteration as op posed to the majority of transcription romanizations of Arabic that are in tended to convey phonological or morphophonological information it has the advantages of being complete typable and displayable on common ASCII ter minals and printers Because the Roman character equivalents are chosen to be reasonably mnemonic Arabists can learn to read and write Buckwalter texts with little difficulty The Buckwalter convention is activated by setcode buck Since it preempts the curly braces and as well as the angle braces lt and gt for encoding pur poses short Arabic insertions must be denoted specially as arguments of the command TB with round parentheses On the other hand Arabic environ ments work as expected Table 4 10 Buckwalter transliteration Chapter 5 Transliteration In addition to the arabic writing the standard scientific transliteration may also be obtained from a fully vowelized input te
26. who still need the old features should contact the author there might be a workaround Appendix C Miscellaneous utilities The following packages are not part of ArabT X proper and are not supported in any way but are distributed along with ArabTEX as possibly a convenience to the users There is no warranty whatsoever C 1 twoblks sty This IXTEX option will define a command twoblocks 1 2 which will place the two parameters 1 and 2 usually two paragraphs into two boxes side by side separated by space of length colsep If necessary the resulting boxes will be split across a page boundary This feature is useful if two versions of a text are to be contrasted They may be in different languages and one of them might be in Arabic if enclosed in begin arabtext Vend arabtext This sentence has been written eraat as T CURIE EE Te twice in the English language and UL ET 2 ada e in the Arabic language aa T i Li is NT Otherwise this command does not depend on ArabTEX in any way and indeed originated in a completely different context Beware that the two blocks should each not contain much more than one not too long paragraph of text otherwise TEX s main storage might overflow There must be no verbatim text inside the parameters of twoblocks nor any catcode changes and all TEX groups and Mif fi sequences must be properly nested 79 80 APPENDIX C MISCELLANEOUS UTILITIES C 2 verse
27. 27 Verheij Arian 69 verses sty 79 visual formatting 67 vowel marks 20 vowel points 16 vowels Hebrew 53 invisible 54 long 17 20 22 41 44 positioning 64 short 18 22 41 44 silent 38 W 38 WA 21 wasla 20 21 25 38 76 waw e atf 46 waw e ma dula 46 waw e maghul 44 waw e ma ruf 44 Wujastyk Dominik 67 69 Y 17 22 ya dots 41 64 ya i wahdat 41 42 47 Yano Michio 69 ya ye maghul 44 ya ye ma ruf 44 zer 44 zr 41 zwarakay 47
28. 3 7 1 Language switching eA 53 7 2 Standard Hebrew encoding sss 54 7 3 ISO 8859 8 and Hebrew MS Windows 55 7 4 HebrewTpX oldcode and newcode 55 7 5 BHS encodings e 24 RR pu e ee 59 7 6 UNICODE Hebrew 6 6 6 6 6 lees 60 7 7 Hebrew transcription systems 6 6 les 60 1 8 Hebrew fonts sti eee sa e Jalan ah 60 7 9 Jud o Arabie si da a eee nah an na 62 CIONI AIA it eorr rn er ee sa 63 CONTENTS 8 Miscellaneous features 8 1 Additional codings eee 8 2 Dotson Ya y ariano b ER A eo 8 3 Vowelpositiohing uu eo E Sedem v wx E E 84 Abjad numerals ee 8 5 Automatic stretching a 8 6 Uniform baselines 6 een 8 7 Verbatim copy of the input sss 8 8 Propress report i202 22k x xor pk E ducc y E EORR Q Qs 8 9 Module Reporting leen 9 Compatibility issues 9 1 Arabic document classes 0 0 000008 ae 9 2 Using ArabTpX with EDMAC 0 9 3 Using ArabTpX with Babel 6 6 66 6 6 66 6 666066060606 9 4 Using ArabTpX with PiCTEX LL 9 5 Using ArabTgX with CJK ee 10 Acknowledgments A Obtaining and installing Arab TEX Al Obtaining ArabTEX sess susie essa A 2 Installing ArabTEX LL B Release history C Miscellaneous utilities CL twoblksisty Laser la APES ESE AA EZ vetses Sty 2 bu ae i PR i a 9 TAWES ra ne A FRESA 64 64 65 65 65 65 66 66 66 66 67 68 68
29. 67 extra characters 51 64 Farsi 40 fatha 18 20 21 Fischer Wolfdietrich 22 69 font additional 16 Arabic 12 15 nash10 76 nash14 15 73 74 76 nash14bf 15 74 Naskh 15 73 74 Nasta liq 15 73 xnsh14 15 74 xnsh14bf 15 74 bold 15 commercial 15 default 15 Hebrew 16 61 DeadSea 61 hcaption 61 74 hclassic 61 74 Jerusalem 61 OldJaffa 61 Shalom 61 standard 61 TelAviv 61 installation 74 nasta lig 41 43 selection 12 spelling 61 standard 16 transliteration 38 grid 65 Grobgeld Dov 52 grouping 11 27 H 40 42 46 47 h silent 39 h 20 hamza 18 21 24 41 42 44 INDEX carrier 24 27 old style 24 harakat 18 20 22 41 on tatwil 20 Haralambous Yannis 69 ha ye hawwaz e mahfi 46 hcaption 16 hclassic 16 Hebrew consonants 53 Hebrew fonts 61 Hebrew mode 52 Hebrew script 6 Hebrew vowels 53 Hebrew TEX 52 hebtex tex 52 HED 52 Heddaya Abdelsalam 69 Heer Nicholas 69 Hoekwater Taco 69 hyphen 20 26 27 I 17 22 1 41 L 19 i 41 _1 17 20 22 i kasra 18 22 implementation Mac 73 PC 73 U IX 73 iN 18 23 input switching 28 insertion mathematical 11 non Arabic 11 Roman 11 installation 74 inverted damma 20 47 invisible consonant 19 ISIRI 3342 33 ISO 646 28 30 33 ISO 8859 6 30 ISO 8859 8 54 ISO 9036 28 item 85 Arabic 11 iy 17 22 izafet 20 39 41 42 47 Urdu 46 Jabri Yussuf 69 ja
30. Arabt Maktaba an Nahda Baghdad Dar al Qalam Beirut 1400 1980 ISO R 233 1961 International System for the Transliteration of Arabic Characters International Standards Institution 1961 ISO 8859 6 Information processing 8 bit single byte coded graphic character sets Part 6 Latin Arabic alphabet International Organization for Standardization 1987 ISO 9036 Information processing Arabic 7 bit coded character set for information interchange International Organization for Standardization 1987 D E Knuth The METAFONTbook Addison Wesley Publishing Comp Reading Mass 1986 D E Knuth The TEXbook Sixth printing Addison Wesley Publishing Comp Reading Mass 1986 D E Knuth and P MacKay Mixing right to left texts with left to right texts TUGboat 8 1 14 25 1987 72 CHAPTER 10 ACKNOWLEDGMENTS e Ann K S Lambton Persian Grammar Cambridge University Press 1953 e L Lamport ATEX A Document Preparation System Addison Wesley Publishing Comp Reading Mass 1986 e J Lavagnino and D Wujastyk An Overview of EDMAC A plain TEX format for critical editions TUGboat 11 4 623 643 1990 e M Lorenz Lehrbuch des Pashto Afghanisch 2 Auflage 1982 VEB Verlag Enzyklop die Leipzig e P A MacKay Typesetting Problem Scripts BYTE 11 2 201 216 1986 e H Ritter ber einige Regeln die beim Drucken mit arabischen Typen zu beachten sind ZDMG 100 2 577 580 1951 Frie
31. DRAFT Arab TEX Typesetting Arabic and Hebrew User Manual Version 4 00 Klaus Lagally March 11 2004 Report Nr 2004 03 Universit t Stuttgart Fakult t Informatik 2This Report supersedes Reports Nr 1992 06 1993 11 and 1998 09 Abstract ArabTEX is a package extending the capabilities of TEX IATEX to generate the Perso Arabic writing from an ASCII transliteration for texts in several languages using the Arabic script It consists of a TEX macro package and an Arabic font in several sizes presently only available in the Naskhi style ArabTEX will run with Plain TEX and also with IXTEX2e It is compatible with Babel CJK LyX the EDMAC package and PicTEX with some restrictions other additions to TEX may work but have not been tried ArabTEX is primarily intended for generating the Arabic writing but the stan dard scientific transliteration can also be easily produced For languages other than Arabic that are customarily written in extensions of the Perso Arabic script some limited support is available ArabTEX defines its own input notation which is both machine and human readable and suited for electronic transmission and E Mail communication However texts in many of the Arabic standard encodings can also be processed ArabTgX also provides support for fully vowelized Hebrew both in its private ASCII input notation and in several other popular encodings Arab TEX is copyrighted by the author but free If you us
32. EX It was not fully compatible with Version 1 however moving to the new version usually caused little problems Apart from some extensions most changes were introduced in order to better conform to the transliteration standards and to have less com patibility problems with TEX and BTFX The main differences between versions 1 and 2 were e The font size was increased so the document layout changed The old font nash10 was abolished and replaced by nash14 the character locations have been assigned differently e Some Arabic characters were now coded differently ayn is denoted by a left quote and lt c gt z t and lt n gt have been assigned new mean ings in order to better conform to the standard transliteration e Many more ligatures than before were supplied This normally did not concern the user e vocalize no more generated sukun and wasla except if explicitly indi cated by quoting See fullvocalize e Arabic Environments are now always bracketed by the new control se quences begin arabtext and end arabtext even if only the translit eration is wanted ArabTrX version 3 00 The changes introduced in Version 3 00 fall into one of two categories error corrections and upward compatible extensions Details are not given here but are documented in the text file changes txt that is part of the distribution package of ArabTpX The earlier change history up to Version 3 00 is described in the text
33. ISIRI 3342 standard The ArabTeX notation may be reactivated by setcode arabtex The ISIRI 3342 code see Table 4 9 is an 8 bit code closely related to 7 bit ASCII whereas the lower 128 positions are identical to ASCII ISO 646 some of the upper 128 positions contain the Arabic Persian characters plus additional graphic and control characters The notes on vowelization and transliteration of ASMO 449 apply also 4 3 ALTERNATE INPUT ENCODINGS 35 Table 4 7 UNICODE Arabic Part 1 36 CHAPTER 4 INPUT ENCODING CONVENTIONS J oe o oo we oan we os DA80 DA90 DAA0 DABO DB80 DB90 DBA0 DBBO Table 4 8 UNICODE Arabic Part 2 4 3 ALTERNATE INPUT ENCODINGS a EA d EA E EE EE ERE ERR S ERES E pera 110 e porto tf Fedi ES ES UN IE o ER ER E EAE pesa eea e ee E RC pei d EIE AUR RR ES ele pasa eZ et seo ee cele sl PeR ae ETS pese ERU ES ER ES Fosse ses RUA E EXE ES ES AAA ES EXEC ES ERE SE Pe Table 4 9 ISIRI 3342 code table 37 38 CHAPTER 4 INPUT ENCODING CONVENTIONS 4 3 6 Buckwalter Transliteration The file buck sty contains a reading module for the transliteration devised by the lexicographer Tim Buckwalter It is installed by the IXTEX command usepackage buck or by input buck sty The Buckwalter System see Table 4 10 is a 7 bit encoding reusing the Roman letters and the special characters for encoding Arabic letters and diacritical marks the Arabic digits
34. Ro fe initial shape 5 external encoding medial shape 6 sorting position final shape 7 name isolated shape Table 6 5 Arab TEX encoding of Uighuric GQ Or G A G 7 E d 51 52 CHAPTER 6 SUPPORT FOR OTHER LANGUAGES 6 8 Old Malay The preliminary ArabTEX language mode setmalay is provided for processing Old Malay texts in the extended Arabic writing Old Malay Jawi input texts are encoded in a modification of the standard ArabTpX encoding see below Additional encodings This language mode is strictly experimental and expected to contain many er rors it will be adapted to the users requirements Please report your experience and suggestions for changes and improvements to the author 6 9 Other extensions of the Perso Arabic script This is up to experimentation by the user If setarab or setfarsi will not produce the desired result try setverb for verbatim mode The vowelization and the transliteration cannot generally be expected to be correct but might work by accident In case some character variants not yet provided are needed feel free to ask the author for help There is no simple way for the user to modify the script directly Chapter 7 Hebrew mode On the request of some users starting with Version 3 02 ArabTFX has been ex tended by some modules adding support for Hebrew Whereas the initial applica tions only called for short Hebrew quotations within Roman texts possi
35. a sequence of short Arabic paragraphs separated in the input by blank lines This example is not typical insofar as we produce both the scientific transcrip tion of the text and the Arabic writing from the same input at the same time interleaving them This is rarely required but in our example it allows us to demonstrate that the input notation is very closely related to the transcription Details about this correspondence are covered in Chapter 4 What indeed is typ ical is the fact that apart from the centered headline we supplied no formatting information whatever TEX will take care of that 1If you happen to be curious and cannot read Arabic the text contains a traditional story about a somewhat silly person named Guha trying not to lend his donkey to a friend and failing documentclass 12pt article usepackage arabtex begin document setarab choose the language specific conventions vocalize switch diacritics for short vowels on transtrue additionally switch on the transliteration arabtrue print arabic text is on by default anyway centerline RL gu hA wa himAruhu begin RLtext at_A sadIquN il_A gu hA ya tlubu minhu himArahu li yarkabahu fI safraTiN qa sIraTiN wa gAla lahu sawfa u Iduhu ilayka fI al masA i wa adfa u laka u graTaN fa gAla gu hA anA AsifuN giddaN anni 1A asta tI u an u haggiga laka ra gbataka fa al himAru laysa hunA al
36. a gi nawamiz bI _hwod 535 3 bihod Digit sequences are written in their natural order 1234567890 YYfosyAa 1234567890 6 2 Maghribi This works nearly like Arabic but using a different writing convention fa is written with one dot below the letter gaf with one dot above the normal letter form of fa The three dots of va are put below the letter Switch to this mode by setmaghribi 6 3 Urdu The Urdu mode is activated by seturdu e For Urdu additional codings are available see Table 6 1 Some of the given codings also occur in Pashto but with a different meaning see Section 6 4 e Even in fully vowelized mode an aspirated consonant before lt h gt receives no sukun since the two are technically a single letter e Urdu uses the Nasta liq font when available otherwise Naskh 6 3 URDU 45 6 3 1 Coding examples for Urdu The short vowels i and are encoded by the lowercase letters a i and u and are marked respectively by zabar zer and pes par j par dam e dam fir fir din y din sukh XL sukh dukh 5 dukh The long vowels a e and o are encoded by the capital letters A I U E and O Note alif madda is automatically generated for word initial a Ap cal ap Am am tin Y tin la rkI f larkt dUr 35 dur dEr der ba rE 2 bare mOr mor Note I yields a ya ye ma ruf with zer while E yields a ya ye maghul without zer tIn
37. ailable can be activated by shlmold shlmscr and shlmstk Their vowel points presently do not work since they must be handled differently from the ArabTEX strategy e In case a font appears to be locally available but is not found check and if required correct the exact spelling of the font name within the file uheb fd We have seen various incompatible variants on CTAN and on the InterNet e Activating other Hebrew fonts by the command sethebfont font might work Note We recommend to set lineskiplimit 20pt whenever Hebrew and Ro man script are used within the same paragraph this will lead to uniform line spacing The value of baselineskip may have to be adjusted 7 9 Judeo Arabic Some mediaeval manuscripts contain Arabic texts written using an extended Hebrew notation There appear to be several conventions one of them is given in Table 7 5 Judeo Arabic mode is supported by default Hebrew output can be globally ac tivated by the language selection command setjudarab switch back to Arabic by setarab Input text is assumed to be in one of the supported input notations for Arabic This mode is still experimental neither Hebrew vowel points nor Arabic diacrit ics are presently provided Prospective users of this mode are invited to contact the author for discussing some open issues 1Unfortunately we cannot read Hebrew so the relevant literature is not accessible to us 7 10 YIDDISH 63 Table 7 5 Ju
38. ally without hamza If a vowel precedes the word the auxiliary vowel will be omitted in the transcription and the wasla sign will be used in the spelling 2A E wa ismuhu wa smuhu f a in sarafa malls fa nsarafa 3In vowelized writing it may sometimes be advisable to introduce a kasida to prevent the vowel marks from bumping into each other 4 1 ASCII TRANSLITERATION ENCODING 27 This also works across word boundaries yA ibnI gil ya bri h a dA ibnuh_u Sl l s hada nuha qAla u hru g e Je gala hrug An auxiliary vowel at the end of the preceding word may be separated by a hyphen gad i in sarafa Sal di gad i nsarafa ra aW u al bAba sd ub raaw u l baba min i ibnih i ja min i bnihi This also works for the article preceding alif al wasl al i ismu PSI al i smu al i i stirA u aT al i stira u and even if the auxiliary vowel is omitted in the spelling ra guluN i ibnatuh u gamIlaTuN de xul ragulun i bnatuhi gamilatun mu hammaduN i al qura sIyu s al e muhammadun i l qurastyu The particles li and la must be combined with the article except before lam lil rra guli Jew lir raguli 1a1 ma gdu eval lal magdu however li llaylaTi icu li llaylati 11 11 ahi dj li llahi The Name of God is written with a special ligature if it is recognized from the input sequence 11 ah al 11 ahu Ul gt al lahu ta al 11 ahi WE ta I lahi Increased spacing Tatwil
39. amza is determined by the following input character see Section 4 2 20 CHAPTER 4 INPUT ENCODING CONVENTIONS dim with three dots below ha with three dots above CIENCIA Table 4 2 Additional encodings generally available madda on alif is generated by a right quote hamza before lt A gt lt A gt It may also be written A likewise I and U will produce madda on ya and on w w as required in some older writing conventions The coding lt gt for ayn is a single left quote beware of confusing it with hamza The invisible consonant lt gt may be inserted in order to break unwanted ligatures and to influence the hamza writing It will not show in the Arabic output or in the transliteration At the beginning of a word it will suppress a following short vowel otherwise it acts like a consonant The sequence lt gt will insert a small space as does lt gt see Section 4 1 3 below The adjacent characters will not be connected Sadda is indicated by doubling the appropriate letter coding Therefore two equal consonants in sequence have to be separated by a short vowel indicator or lt gt even in novocalize mode The definite article is separated from the following word by a hyphen It may be written in the assimilated form if it exists lt as salAmu gt or always as lt al gt in that case a subsequent sun letter must be doubled lt al ssalAmu gt t
40. an texts the Izafet connection is handled specially and a final silent h will be omitted in the transliteration 5 2 Other transliteration styles Since there is no general agreement on transcriptions a number of variants have been provided e settrans standard or settrans zdmg is the default ZDMG mode e settrans english will switch to the style of the Encyclopedia of Islam which is close to the conventions of the Library of Congress e settrans iranica will produce the style used in the Encyclopedia Iran ica e settrans farsi produces a variant of the style used in the Encyclope dia Iranica e settrans lazard switches to the conventions of Gilbert Lazard La langue des plus anciens documents de la prose persane e settrans urdu switches to the conventions for Urdu used in the ALA LC tables e settrans kashmiri switches to the conventions for Kashmiri used in the ALA LC tables this mode is also chosen automatically by setkashmiri e settrans turk will produce a style similar to modern Turkish it only makes sense for Ottoman texts e Transcription conventions for Hebrew are given in section 7 7 The transliteration mode may be switched globally at any time If the input text is not fully vowelized the transcription cannot be expected to be correct 5 3 Capitalization If transcription output is used as part of a Roman text it may be desirable to have some words start with a capital letter Th
41. ay be made elastic if they form no ligature Thus a kasida is inserted whose length will be adjusted automatically to uniformly fill the output line This feature increases the already high storage demands of ArabTgX and should therefore be used sparingly It can be switched on with spreadtrue and switched off again with spreadfalse Inside an Arabic Environment it will also be switched off automatically at the end of every paragraph 66 CHAPTER 8 MISCELLANEOUS FEATURES 8 6 Uniform baselines The Arabic and Hebrew fonts are optically compatible with the standard Ro man fonts but have larger ascenders and descenders this will lead to unequal distances between the baselines of consecutive lines especially if Roman and non Roman text are mixed within the same Roman paragraph Typesetting on a grid will improve line spacing We recommend to set lineskiplimit 20pt whenever Roman script and Arabic and or Hebrew are used within the same Roman paragraph The value of baselineskip may have to be adjusted with IATEX use Nbaselinestretch Also within an Arabic environment typesetting on a grid may lead to a better result 8 7 Verbatim copy of the input For testing purposes the Arabic input may be reproduced verbatim after showtrue in addition to the normal output Nshowfalse switches this fea ture off again Commands will not usually be shown The output will generally not look pleasant and this feature is only provided in or
42. azm even in fully vocalised mode and is not joined to a following letter rAjaH sl ragah kiH S kih naf S nah waH 03 wah nAmaH X namah yaH 4 yah Tanwin is coded by aN tagrIbaN WE tagriban faoraN bs faoran Tasdid is produced by coding the consonant twice mi t tI di mitti unnIs gal unnis Note Double consonants in Urdu verbs are written without tasdid Tasdid is disabled by inserting between the double consonants ban nA s banna gin nA US ginna jAnlnA il ganna Idafat is written e or i sEr e panjAb ser e pangab Ab e hayAt oks al ab e hayat Waw e ma dula or the waw which is passed over is written w it is omitted in the transliteration and the preceding he receives no jazm hwAb hab hwAja cls haga _hwud 93 gt hud Waw e atf or the waw of conjunction is coded as 0 gul 0 bulbul KE gul bulbul sarw 0 sanObar 5 94 59 4 sarw 0 sanobar tar 0 tAzaH tar 0 tazah Alif magsura is encoded as _A or Y fatw_A ss fatwa da wY 4 355 dawa Digit sequences are written in the natural order 0123456789 YrfosyAa 0123456789 48 CHAPTER 6 SUPPORT FOR OTHER LANGUAGES 6 4 Pashto Afghanic Se SSIES BAITS a reta nt DEE ECN Te o CEC DEE CISCO Table 6 2 Additional codings for Pashto Switch to this mode by setpashto For writing some Pashto words in the Urdu style write the command setur
43. bly con taining Arabic insertions too adding Hebrew environments proved compara tively easy We also added most commands provided by the Hebrew TEX package an alternative TEX extension developed in Israel that requires TEX XET The Hebrew date quite probably will not work correctly To process Hebrew input with ArabTpX proceed as follows e for use with Plain TEX say input hebtex a small loader module will load both Arab TFX and the Hebrew extension e with IXTEX2e say usepackage hebtex The extension provides a language mode sethebrew and several common en codings of texts in Hebrew that may be switched by the Nsetcode command One nameless encoding is compatible with Dov Grobgeld s editor HED so files prepared for HebrewTEX are supposed to be compatible In addition the standard ArabTpX encoding has been adapted to cater for Hebrew too 7 1 Language switching sethebrew switches to Hebrew mode setarab back to Arabic Remember to switch the encoding and the vowelization mode too 53 54 CHAPTER 7 HEBREW MODE 7 2 Standard Hebrew encoding setcode standard or setcode arabtex will switch to the standard ArabTEX Hebrew encoding defining the consonants as follows samekh qof s h in Note without punctuation the characters sin shin and s h in look identical otherwise sin Y has a dot to the left shin W has a dot to the right s h in W is the form without a dot Soft consonants are marked by
44. ced errors 6 1 Persian Farsi Dari The Persian mode is activated by setfarsi e All characters needed for writing Farsi are available by default The short vowels lt e gt and lt o gt are mapped to lt i gt and lt u gt the long vowels lt E gt and lt 0 gt to lt I gt and lt U gt without a vowel indicator lt H gt denotes final silent ha This ha receives no sukun even in fully vowelized mode 41 42 CHAPTER 6 SUPPORT FOR OTHER LANGUAGES For fatha or kasra followed by a final silent ha you can also write lt a gt or lt e gt in place of lt aH gt and lt eH gt deprecated The zafet connection may always be written lt i gt or lt e gt with hyphen then ArabTpX tries to determine the correct spelling from the context Likewise the ya i wahdat can always be written lt I gt or lt E gt The present tense forms of the copula are coded lt am gt lt I gt lt ast gt lt Im gt lt Id gt lt and gt In the output they are written as separate words after a little space The final ya carries no dots Farsi uses the Nasta liq font when available otherwise Naskh 6 1 1 Coding examples for Persian The short vowels e a e o 4 are denoted by the lowercase letters a e or i o or u bar bar beh amp beh bon CP bon The long vowels a a i 1 e u u O are denoted by the capital letters A I or E U or O amp lef madde is automatically generated fo
45. cial publishing agency As prospective users we mainly considered Orientalists which we believed very short on funding this proved to be a drastic understatement There was no Arabic word processing available at that time and using TEX as a platform looked like the only re maining possibility except perhaps implementing some complete system from scratch which probably would necessitate building multiple versions for various computer platforms and operating systems which we did neither dare nor could afford Basing the design on TEX required the minimal user interface to become ex tremely lean to facilitate the use by non programmers In fact only three com mands and the input notation conventions have to be learned once the user is familiar with TEX or IATEX to be able to use ArabTpX for a standard Arabic document Additional features can be looked up as required Using the TEX typesetting engine as a machine independent platform suggested implementing the internal algorithms in TpX s powerful internal macro lan guage Unfortunately it is not easy to use and errors can be very hard to find and eliminate ArabTrX version 1 00 ArabT pX version 1 00 was a prototype to check the basic feasibility of our approach and to get some operating experience Many of its features were only available in a very primitive form It is no more supported 76 77 ArabTrX version 2 00 ArabTEX Version 2 was the first stable version of ArabT
46. code table 4 3 ALTERNATE INPUT ENCODINGS 31 Texts in ASMO 449 are usually not fully vowelized thus the transliteration cannot be expected to be correct This is especially true for Egyptian texts which commonly do not differentiate between ya and alif magsura A minimal driver file for processing existing ASMO 449 text e g in a file asmotext dat could look as follows documentclass article usepackage arabtex usepackage asmo449 begin document setcode asmo449 begin RLtext input asmotext dat end RLtext end document 4 3 2 ASMO 449E ISO 8859 6 MacOS Arabic The file iso88596 sty contains a reading module for the ISO 8859 6 code extended ASMO 449 ASMO 449E It is installed by the IXTEX command usepackage iso88596 or by input iso88596 sty The module is activated by setcode iso8859 6 all following Arabic text will be considered to be coded according to the ISO 8859 6 standard The ArabTpX notation may be reactivated by setcode arabtex ISO 8859 6 see Table 4 5 is an 8 bit code closely related both to 7 bit ASCII and to ASMO 449 whereas the lower 128 positions are identical to ASCII ISO 646 the upper 128 positions contain the Arabic characters of ASMO 449 in the analogous places This code plus additional Persian Urdu graphic and control characters is supported by MacOS in Arabic mode The notes on vowelization and transliteration of ASMO 449 apply also The driver file
47. deo Arabic encoding 7 10 Yiddish Yiddish originally a dialect of Mediaeval German containing many Hebrew loan words is commonly written phonetically using an extension of the Hebrew character set whenever available otherwise using Roman ASCII letters in the YIVO standard which we were yet unable to locate For writing Yiddish using Hebrew letters load the package yiddish sty and globally activate the Yiddish mode by setcode yiddish The character as signment is given in Table 7 6 Since all required vowels are provided there will be no Hebrew vowel points nor dagesh meteg and raphe except for denoting soft consonants Some words of Hebrew origin may not be rendered correctly this way in these cases switch temporarily to Hebrew mode by setcode standard and switch back later by setcode yiddish and use the standard ArabTEX encoding Table 7 6 Yiddish YIVO transcription Chapter 8 Miscellaneous features kaf in the final position without a mark GE dal with a dot below pa nun without a dot alif maqsura ya without dots in all positions Table 8 1 Additional codings for special purposes 8 1 Additional codings To reproduce exotic erroneous or archaic texts exactly as they are written some additional codings are available see Table 8 1 If further variants are needed write to the author and indicate e the required shape e the assumed transliteration e a suggestion for the input coding
48. der to trace down errors or to demonstrate the operation of ArabTEX as in the examples above 8 8 Progress report Since ArabTEX is still rather slow due to evolving technology it is getting faster every year it will produce some terminal output while running to indicate it is still alive If that is not wanted e g on a very fast computer system or while running a batch job say Nquiet or tracingarab 0 outside an Ara bic Environment otherwise say doassign tracingarab 0 The setting tracingarab 1 will only report Arabic paragraphs a value of 2 Arabic lines and insertions a value of 3 or more individual Arabic items 8 9 Module Reporting A complete list of the modules loaded in a particular run will be put into the TEX log file before the run statistics if TEX is used This is believed to be useful when tracing down errors This list is also available to the user even with Plain TEX as the contents of the control sequence arabtexconfig Chapter 9 Compatibility issues ArabTEX relies only on part of the powerful features of the TEX typesetting engine neither mathematical mode nor the alignment mechanism are used and few of the features provided by the Plain TEX package and none of IATEX are required but may be necessary in other parts of a multi lingual document Of course TEX s macro processor is very heavily used It turned out that ArabTEX could be made to cooperate with a number of other macro packages
49. drich R ckert Grammatik Poetik und Rhetorik der Perser Wiesbaden Otto Harrassowitz 1966 e C Salemann V Shukovski Persische Grammatik 4 Auflage 1947 Verlag Otto Harrassowitz Leipzig e A Schimmel Islamic Calligraphy E J Brill Leiden Netherlands 1970 e H J Vermeer W Akhtar A Akhtar Urdu Lautlehre und Urdu Schrift 3 Auflage 1985 Julius Groos Verlag Heidelberg Publications The following list contains some examples of publications heavily relying on Arab TeX for typesetting The list is neither complete nor do we have any judgment on the content e Benno van Dalen Ancient and Medieval Astronomical Tables mathemat ical structure and parameter values Proefschrift Universiteit Utrecht 1993 e Abi Masar The Abbreviation of the Introduction to Astrology together with the Medieval Latin Translation of Adelard of Bath Edited and translated by Ch Burnett K Yamamoto and M Yano E J Brill Leiden New York K ln 1994 e U Rebstock Der Mwamalat Traktat des Ibn al Haytam Zeitschrift f r Geschichte der Arabisch Islamischen Wissenschaften 10 61 121 1995 96 73 Kusyar ibn Labban s Introduction to Astrology Edited and Translated by Michio Yano Institute for the Study of Lan guages and Cultures of Asia and Africa Tokyo 1997 Y Takashina Al Madhal gt ila TEX wa ArabT X Introduction to TEX and ArabTFX in Japanese Osaka University of Foreign Studies 1997 Idris Samawi Hamid
50. du and afterwards switch back e For Pashto additional codings are available see Table 6 2 Some of the given codings also occur in Urdu but with a different meaning e The codings lt H gt lt a gt and lt e gt are used as in Persian The rules for izafet and ya i wahdat apply e The short vowel lt e gt is indicated by a zwarakay o by an inverted damma 6 5 SINDHI 49 Table 6 3 The Sindhi Alphabet 6 5 Sindhi To activate the Sindhi mode select the language by setsindhi Sindhi input texts are encoded in a modification of the standard ArabTfX encoding The alphabet is given in Table 6 3 e Use hyphens to resolve ambiguities with aspired consonants e There are two special codings lt mIN gt or AMIN lt IN gt or MIN e The user might want to break some ligatures by inserting a vertical bar to get the correct writing or just for a better appearance of the script 50 CHAPTER 6 SUPPORT FOR OTHER LANGUAGES gt L x 3 3 c y S S d Table 6 4 The Kashmiri Alphabet 6 6 Kashmiri Select Kashmiri by Nsetkashmiri The input codes are given in Table 6 4 The transcription follows the ALA LC romanization conventions 6 7 Uighuric Switch to this mode by Nsetuighur Uighuric input texts are encoded in a modification of the standard ArabTpX en coding see column 5 of Table 6 5 Please observe that in Uighuric all characters are coded verbatim ip adi m 6 7 UIGHURIC
51. e an Arabic Context and usually follow TEX s grouping rules Changes of language Arabic and Hebrew font input encoding and transliteration conventions are global e NN for a line break the current line will be padded out on the left with spaces e or break for a line break the current line will be spread out If it comes out very badly spaced automatic stretching might help see Section 8 The current paragraph will not be reformatted e indent or par or a blank line for a new paragraph noindent for a new paragraph without indentation e Nemphasizeigroup of Arabic items or emph items will put a bar over the indicated group of Arabic items e setnash setnashbf setnastaliq and other font selection com mands see Section 3 3 e size changing ATEX commands like large etc only if ATEX is used e the following commands footnote observe that the syntax for Plain TEX and IATEX is different marginpar even with Plain TEX analogous to the IXTEX usage e the TEX IATEX commands smallskip medskip bigskip input hfill hfil vfill vfil Ny for a space space N small space newpage clearpage pagebreak with their usual meaning e mbox text puts the text argument into a box that will not be split e nospace unskip will place the adjacent items in the output in direct contact without any intervening space except in case of a line break 14 CHAPTER 2 INPUT TO ARABTEX e hspac
52. e the system for sci entific work please give appropriate credit to the software and the author e g in the colophon of a monograph We also appreciate a complimentary copy of any scientific work produced with ArabTEX ArabIEX may be redistributed and or modified under the terms of the LPPL BMTpX Project Public License distributed from CTAN archives in the directory ftp ftp dante de tex archive macros latex base lppl txt either version 1 of the License or at your option any later version There is no warranty of any kind either expressed or implied The entire risk as to the quality and performance rests with the user Please send error reports suggestions and inquiries to the author Prof Klaus Lagally Universit t Stuttgart Institut f r Formale Methoden der Informatik Universit tsstrafe 38 70569 Stuttgart GERMANY lagally informatik uni stuttgart de Copyright 1992 2004 Klaus Lagally Note This manual describes version 4 00 of ArabTpX The current version 3 11 is supposed not to change except for error corrections Contents 1 Introduction to ArabTrX 2 Input to ArabTrX 2 1 Arabic text elements 2 2 Commands within an Arabic context o 3 Running ArabTpx 3 1 Activating ArabTEX 6666 6 66 6 66 e 3 2 Language selection lee 3 3 E nt selection 22 2820 aer GORE E E Ud 4 Input encoding conventions 4 1 ASCII Transliteration encoding
53. e width will introduce the indicated amount of spacing in the output The same is true for vspace hskip and vskip e spreadbox width text spreads out the text to the indicated width This may be useful e g when typesetting poetry spreadbox width text hfill will inhibit the spreading spreadbox width hfill text hfill will center the text spreadbox width hfill Y or Nspreadboxt width just introduces the indicated amount of horizontal space as will hspace width If two boxing commands follow each other without any intervening blank space in the input there will also be no resulting space between the boxes in the output e centerline text will start a new line whose contents are centered e spreadline text will start a new line whose contents are spread out over the whole width of the page It is approximately equivalent to spreadbox hsize text User defined commands even if they require parameters may be called directly within an Arabic context if they have been previously announced to the Arab TEX processor by allowarab command_name After substituting the parameters they are expanded exactly once and the expansion text will be processed by Arab TEX again so it must be legal ArabTpX input text Even if not announced previously user defined commands may also be called by docommand command_name and parameters The command is expanded exactly once and the expansion text after suitable substituti
54. emph emphasize sl m slm emphasize ab g abg emph alayhi as salAmu ade lt alayhi s salamu 4 2 Verbatim input After disabling language specific processing by setverb ArabTgX will not use any context information to determine the carrier of hamza Instead the user has to supply this information herself by the next character typed after lt gt Generally this character will be used as the carrier for examples and some exceptions see Table 4 3 A short vowel indicator may follow To ease automatic conversion an initial alif may also be coded lt A gt 4 3 ALTERNATE INPUT ENCODINGS 29 DIET DELETE DE isolated hamza Ed madda on alif Table 4 3 Verbatim encodings for the carrier of hamza 4 3 Alternate input encodings The ArabTgX input notation has been very carefully designed for flexibility readability and ease of use for linguists confined to standard 7 bit ASCII equip ment for processing and transmitting data However it does not make much sense re coding existing machine readable text files that have been encoded ac cording to other standards Thus some alternate reading modules have been written and a general code switching procedure has been provided An alternate reading module e g asmo449 sty for the ASMO 449 code is installed by usepackage asmo449 or by input asmo449 sty Afterwards a code_name in this case asmo449 is defined Inpu
55. for special purposes 64 Chapter 1 Introduction to Arab TEX Note This manual describes Version 4 of ArabTgX which in general is upwards compatible to earlier versions Incompatible features and extensions are flagged by an asterisk n the margin ArabTFX is a package extending the capabilities of TEX IATEX to generate an extended Perso Arabic and or Hebrew script in addition to the customary left to right scripts called Roman in the sequel Besides Arabic and Hebrew there are provisions for many other languages that use some extensions of the Arabic script we shall use the term Arabic generically to denote any supported right to left script without any cultural or political connotation An ArabTEX document is usually multi lingual and may contain right to left insertions within left to right paragraphs and vice versa There are various possibilities to encode right to left insertions in addition to several standard encodings there are also 7 bit ASCII encodings modelled after various translit eration standards there is more than one convention so the intended language must be specified ArabTEX like TEX and KIEX is not a substitute for a word processor and does not follow the WYSIWYG paradigm What You See Is What You Get where the author has complete control on and therefore is also completely responsible for every detail of the visual representation of her text Instead it is an off line system mirroring
56. gt lt U gt or also lt aa gt lt iy gt lt uw gt with the same meaning e To get the defective writing of long vowels use lt _a gt lt i gt lt _u gt e alif magsura is lt _A gt or Y 18 4 1 ASCII TRANSLITERATION ENCODING 19 Le fof wpe fo e Pelee t ol et ta loto ta Z gim n gt e Pa belahan a falah Palapa eef e EHED Sci SEE ECA ee ISS ADE ORDES rep Ges SONE FEE BE IE sonora ya alif ta magsura marbuta Table 4 1 Standard encodings for Arabic and Persian consonants e The short vowels fatha kasra damma are coded lt a gt lt i gt lt u gt and need not normally be written except in the following cases at the beginning of a word where they generate alif adjacent to hamza where they will influence its carrier when the transliteration is required in the vocalize and fullvocalize modes e tanwin is coded lt aN gt lt iN gt or lt uN gt A silent alif if required is supplied automatically it may also be explicitly written lt aNA gt Likewise a silent waw may be written NU as in lt amruNU gt e hamza is denoted by a single right quote lt gt After selecting the language by setarab the carrier of hamza will be determined from the context ac cording to the rules for writing Arabic words if that is not wanted quote the hamza see Section 4 1 3 below In the setverb mode the carrier of h
57. he command setnone Therefore short Arabic insertions must be included as arguments of RL or bracketed with lt and gt 9 5 Using ArabTpX with CJK The CJK package by Werner Lemberg supporting typesetting of texts in Chi nese Japanese and Korean to our surprise proved to be compatible with ArabTEX after a very small adjustment Due to the high resource require ments of both packages a Very Big TEX may be required for processing texts of substantial size Chapter 10 Acknowledgments The development of ArabTEX would not have been possible without the as sistance of many people and it is impossible to acknowledge every individual contribution Besides our local team i e Udo Merkel and Heribert Schlebbe helpful advice came among others from Chahriar Assad Benno van Dalen Ivan Derzhanski Wolfdietrich Fischer Ahmed El Hadi Yannis Haralambous Abdelsalam Heddaya Nicholas Heer Taco Hoekwater Yussuf Jabri Iqbal Khan Tom Koornwinder Eberhard Kriiger Asif Lakehsar Jan Lodder Richard Lorch Pierre MacKay Eberhard Mattes Fathy Neamat Allah Anshuman Pandey Bernd Raichle Ulrich Rebstock Adrian Rezus Paul Roochnik Mohamed Saba Waheed Samy Annemarie Schimmel Nariman Shehab Arian Verheij Dominik Wujastyk and Michio Yano We also have to thank all users who sent error reports comments and suggestions References e B Alavi M Lorenz Lehrbuch der persischen Sprache 5 Auflage 1988 VEB Verlag Enzyklop
58. ht Dissertation Universit t Bremen 2002 Appendix A Obtaining and installing Arab TEX A 1 Obtaining ArabTpX The ArabTEX system is available from the author s institution by anonymous FTP from ftp informatik uni stuttgart de 129 69 211 2 in the direc tory pub arabtex and from many other common servers e g the CTAN net work e ftp dante de tex archive language arabtex e ftp tex ac uk tex archive language arabtex e ctan tug org tex archive language arabtex The files may be transferred individually or as a package arabtex zip for PC systems arabtex tar Z for U IX systems we recommend to get and inspect the file arabtex htm or readme txt first Successfull operation on the Apple Macintosh in conjunction with OzTpX has also been reported At the time of this writing version 4 00 is current The Nasta liq font is still under development Naskh will be substituted automatically ArabTEX is being maintained and extended if required on a regular schedule The current status can be found in the file arabtex htmor by sending an EMail message with arbitrary content to arabtex informatik uni stuttgart de 74 A 2 INSTALLING ARABTEX 75 A 2 Installing ArabTEX The installation procedure is strongly system dependent and we rec ommend securing the assistance of a local TpXpert You have to in stall the fonts provided nash14 nash14bf xnsh14 xnsh14bf hclassic hcaption with their pk and tfm fi
59. ic Windows see Table 4 6 is an 8 bit code closely related to 7 bit ASCII whereas the lower 128 positions are identical to ASCII ISO 646 some of the upper 128 positions contain the Arabic characters plus additional Persian Urdu graphic and control characters The notes on vowelization and transliteration of ASMO 449 apply also 4 3 4 UNICODE Arabic The file utf8 sty contains a reading module for the Arabic and the Hebrew segment of UNICODE in UTF 8 encoding It is installed by the ATEX command usepackage utf8 or by input utf8 sty UTF 8 UNICODE Transmission Format see tables 4 7 4 8 and 7 4 is a multi byte encoding which for Arabic and Hebrew uses two bytes per character whereas ASCII characters use a single byte Far eastern languages are encoded in three bytes per character This is in contrast to UNICODE itself which always uses two bytes per character The module is activated by setcode utf8 all following Arabic and Hebrew text will be considered to be coded according to the UTF 8 encoding standard To use the correct font select the appropriate language The ArabT X notation may be reactivated by setcode arabtex 4 3 5 ISIRI 3342 The file isiri sty contains a reading module for the ISIRI 3342 Persian Stan dard Code It is installed by the TEX command usepackage isiri or by input isiri sty The module is activated by setcode isiri all following Arabic text will be considered to be coded according to the
60. ine 13 INDEX spreadtrue 64 ta 61 tabular environment 67 tracingarab 65 transfalse 38 transtrue 38 twoblocks 78 usepackage hebtex 52 vfil 12 vfill 12 vocalize 20 21 53 76 vskip 13 vspace 13 yahdots 64 yahnodots 64 lt 10 11 15 40 67 gt 10 11 15 40 NI 12 hanging he 46 19 21 22 54 IB 20 IBB 20 11 19 21 ayn 19 hamza 18 A 17 22 27 A 19 21 24 A 42 CA 19 _A 17 21 22 a 41 42 47 _a 17 20 22 a fatha 18 22 aa 17 22 abbreviation 27 abjadnumbers 67 abjad sty 64 abjad numbers 64 accents 54 ae 44 Afghanic 47 aH 41 ayn 19 INDEX al 19 38 alif 27 dagger 17 20 22 initial 27 magsura 17 21 23 46 silent 21 23 Qur an 20 22 silent 21 23 38 small 20 22 below 20 22 Allah spelling 26 aN 18 21 23 aN_A 21 23 aNA 18 21 23 aNY 23 ao 44 arabart cls 13 67 arabbook cls 13 67 Arabic 6 generic term 6 Arabic IZTEX classes 67 Arabic context 10 12 Arabic environment 10 Arabic fonts 12 15 Arabic group 11 Arabic item 11 Arabic MS Windows 33 Arabic MS DOS 28 Arabic number 11 Arabic quotation 10 Arabic quotes 11 Arabic script 6 Arabic word 11 arabrep cls 13 67 ArabTEX commands 11 12 archaic text 63 ASCII 28 30 33 ASMO 449 28 30 aspirated consonant 43 aspiration 44 Assad Chahriar 69 assignment 13 global
61. ing form of tanwin is generated instead Defective writing The coding lt _a gt will produce a Qur an alif ac cent also called dagger alif instead of an explicit alif character which would be coded lt A gt or lt aa gt Likewise lt _i gt will produce a small alif below the preceding consonant in place of lt I gt iy gt u will produce an inverted damma in place of lt U gt uw If a long vowel follows a consonant the corresponding short vowel is implied The long vowel itself carries no diacritical mark If no vowel is given after a consonant sukun will be generated except if a double quote precedes the next consonant The lam of the definite article receives no suk n if a doubled sun letter follows alif at the beginning of a word carries wasla instead of the vowel indicator if the preceding word ended with a vowel e vocalize As above but sukun and wasla will not be generated except if explicitly indicated by quoting see section 4 1 3 e novocalize No diacritics will be generated except if explicitly asked for by quoting see section 4 1 3 22 CHAPTER 4 INPUT ENCODING CONVENTIONS In all modes a doubled consonant will generate 50000 and lt A gt always gener ates madda on alif After lt aN gt the silent alif character is generated automatically if required The silent alif may also be explicitly indicated by lt aNA gt or coded literally as l
62. is can be achieved by prefixing the command cap to the word in question If the first letter is hamza or ayn the next letter will be capitalized This feature may also be used after the article or a prefix and even in other arbitrary positions cap will only influence the following letter The Arabic writing is not affected Chapter 6 Support for other languages using Perso Arabic script ArabTEX is primarily intended for typesetting texts in classical and modern Arabic but it also provides some support for several other languages that are customarily written using the Arabic alphabet or some extension of it In order to switch to the conventions for one of these languages say Nsetfarsi seturdu setpashto setmaghribi etc Nsetverb will switch off any lan guage specific processing setarab can be used to switch back to the Arabic conventions After selecting the language lt and gt are active as delimiters for quotations setnone will return lt and gt to their normal TEX meaning Quotations still can be bracketed by lt and gt or by using RL This part of Arab TEX relies heavily on contributions from the user community we want to especially mention Ivan Derzhanski who completely reimplemented the routines for processing Persian and Anshuman Pandey for the Urdu part As we extensively modified these contributions again while integrating the system we are solely responsible for any remaining or newly introdu
63. les on the font search path of your TEX system and the sty files arabtex tex and hebtex tex on the source search path usually TEXINPUT of your sys tem Possibly you will also have to rename the pk files according to local conventions and as a last resort you can try to recreate the fonts from the mf METAFONT sources Additional fonts whenever available are installed analogously ArabTEX has been found to cooperate well with TEX versions 3 xxx IATEX2e IATEX versions 2 09 of 1991 or later MIT X NFSS and NFSS2 not required and previewers that can handle fonts of more than 128 characters TEX XET or TEX XET are not required and their additional features are presently not exploited The TEX hash size should be at least 3000 to 3500 especially when using ArabTEX in conjunction with IATEX and if the transliteration module is used Use of a BIG TEX may be necessary when using the NFSS2 due to the latter s high demand on string storage Space and time requirements are not negligible and have increased during development however ArabTEX currently still runs albeit slowly even on a PC XT standard configuration Appendix B Release history The development of the Arab TEX system began around 1991 as a private project of the author for his personal use However it turned out soon that the package if at all feasible could be of use for others also who see the need of printing Arabic text without involving a spe
64. mined by the context 2 oF amruN y amrun ibiluN hl gt ibilun u htuN Cel gt uhtun oy N i gt vl 2 ra suN Jo rasun ar asu arasu sa ala JG saala gt 2 qara a bs qara a bu suN bwsun ab usuN E ab usun 24 AIT ra ufa raufa ru asA u La 3 rwas u bi ruN X berun s d gt 5 as ilaTuN asilatun ka iba SS kariba qA imuN gt q imun Es ri AsaTuN rrasatun su ila JUL swila samA uN Fl sam un barl uN iS bardun sU uN 42 siPun bad uN 2X badun say uN d Sayun say iN zs Sayin say al s Sayan P ni sA ala Jil s ala mas alaTuN JUL masalatun z y2 saw aTuN clic sawatun _ha tI aTuN des hatratun Old Hamza convention In an older writing style that is used e g in some Qur an editions the hamza is sometimes put below its carrier or on the connecting line This style may be switched on by oldhamza and off again by newhamza ye as ilaTuN ras ilatun Ka iba kariba qA imuN ES qaimun 2 3 P UL su ila swila say aN s Sayan ha tl aTuN amp L gt hatratun Madda in the context is generated automatically m 2 m AkiluN akilun qur AnuN o quranun ra Ahu ol raahu To reproduce the historic writing correctly it can also be explicitly indi cated by A I U in other contexts a
65. mmand usepackage arabtex will do all the loading If Hebrew mode see Section 7 is required use input hebtex tex resp usepackage hebtex instead The required modules of ArabTgX will be loaded automatically With Plain TEX only add input apatch sty at the beginning of your doc ument proper after reading any other auxiliary files to get the most recent corrections ArabTEX will load many internal files automatically and defines a large numbers of internal commands These all contain an at sign 0 within their names and thus should not interfere with user defined commands Collisions with other macro packages are possible however and may lead to surprises and interesting effects ArabTEX tries to diagnose the presence of some other packages with which it could run into conflicts and sometimes locally modifies itself accordingly For this to be possible in case of doubt the Arab TEX package should be loaded last Users still running ATEX 2 09 horror should either add the option arabtex to the documentstyle command or rather upgrade to IXTEX2e 15 16 CHAPTER 3 RUNNING ARABTEX 3 2 Language selection The processing of input text in ASCII transliteration encoding is somewhat lan guage dependent Thus before the first Arabic quotation or Arabic environment you have to indicate the desired processing mode by one of the language selec tion commands setarab setfarsi seturdu setpashto setmaghribi etc
66. ng of a word alif with wasla will be generated e otherwise a sukun will be put on the preceding character The following character will be processed again The double quote will not show up in the transliteration In vocalize mode see Section 4 1 2 quoting will turn a short vowel off likewise in fullvocalize mode quoting will also turn a sukun off Put in other words quoting will toggle the generation of short vowel indicators and suk n on and off 4 1 ASCII TRANSLITERATION ENCODING 23 4 1 4 Ligatures There is no way to explicitly enforce ligatures as a large number of them are generated automatically The results will not always look satisfactory so we recommend inspecting the output after the first run Any unwanted ligature can be suppressed by interposing the invisible consonant lt gt between the two letters otherwise combined into a ligature After ligsfalse in the middle of a word fewer ligatures will be produced for some texts this looks better You can return to the normal strategy by ligstrue 4 1 5 Coding examples for Arabic Short vowels fatha kasra damma are denoted as in the transliteration by the small letters a i u mana a mana a _dahaba cdi dahaba sariba SA sariba gabila qabila a zuma ls lt azuma alu Je lt alu bal J bal ni ma ex nima yaktub ES yaktub Long vowels a z are denoted by capitals A I U or by aa iy uw qAtala Sb g tala nUzi a
67. o bold face setnash or rm will switch back 1We would prefer to use a single switching command like language setlanguage or selectlanguage but these names have already been preempted by TEX3 and the Babel package 2Note for advanced TEX users All language selecting commands except setnone set the character lt to be active If Arabic insertions are not needed or are always started with lt or RL the user may reuse the command lt for other purposes or deactivate it by setnone or catcode lt 12 to return it to its normal meaning 3 3 FONT SELECTION 17 With Plain TEX the fonts are available by default at 14 point size only which cooperates well with the cm fonts at 10 points Additional sizes are defined within the file arabtex tex they can be activated whenever needed by the command setarabfont font With IATEX the font size changing commands will also operate on the Arabic fonts We strongly recommend migrating to xnsh14 and xnsh14bf the command oldarabfont will switch to the old fonts if necessary and newarabfont will restore the default The fonts nash14 and nash14bf are of inferior quality and will be phased out gradually All fonts indicated presently are in the Naskhi style we had started to write a Nastaliq font for Persian and Urdu but ran into grave implementation problems yet unsolved Due to a donation by Taco Hoekwater the fonts xnsh14 and xnsh14 are also available in
68. o receive a Sadda and to prevent a sukun on the lam The transliteration in both cases is identical Hyphens lt gt are used for tying words together and for separating prefixes and the article in these cases they start a new word Hyphens can also be used to indicate inflectional endings a connecting vowel in Arabic or an I The former use of A I and U has been discontinued in ArabTEX version 4 4 1 ASCII TRANSLITERATION ENCODING 21 izafet connection in Persian Hyphens will show up in the transliteration Additionally at the beginning and or the end of an otherwise isolated word they enforce the use of the connecting form of the adjacent letter if it exists like e g in the date lt 1400 h gt e A double hyphen lt gt between two otherwise joining letters will break any ligature and will insert a horizontal stroke tatwil kasida without appear ing in the transliteration It may be used repeatedly See also Section 8 5 automatic stretching For special applications it can also be coded lt B gt and lt B gt will behave like any ordinary consonant and may carry vowel indicators tanwin sukun and in the combination lt BB gt Sadda 4 1 2 Vowelization There are three modes of rendering short vowels e fullvocalize Every short vowel written will generate the corresponding diacritical mark fatha kasra damma except if quoted If lt N gt follows a short vowel the correspond
69. ode ISO 8859 8 In fact these are three different encodings catered for by a single reading module see the code assignments in Table 7 3 Observe that the He brew characters in ISO 8859 8 are supported both by this reading module and by the Hebrew MS Windows encoding mentioned in Section 7 3 The code switch ing commands setcode hed setcode newcode setcode pccode and setcode iso8859 8 all activate the default reading module 56 CHAPTER 7 HEBREW MODE LA AAA EXE e eed EET Bl baba OSE REE Ps T D ei Spee ele eet deh e ERR PP Fee deine Ae A PERSA We E eee Table 7 1 ISO 8859 8 and Windows CP 1255 code table 7 4 HEBREWTEX OLDCODE AND NEWCODE 97 Ciruelas aa Eps Csa FE s P ES ERR e pla ost da Pata ate e a Aa ADORA AO A EPE ee a Eee cpp SS Pret j n ES E REBERE Presea Ey E BE TOE BEDO EE BE PARA TE ee Fp eee Table 7 2 MAC Hebrew code table 58 CHAPTER 7 HEBREW MODE 1 e ee as PRECE SO petra TE pe EX ZEE ES ES I d EO c RN RR E pese ep ADOS EEE Re Aaa AA paa Sea SA Aaa ima apesar j eee TAO BR Pp mee el s Tis Peo ETE be Table 7 3 HebrewTEX CP 1255 and ISO 8859 8 code table 7 5 BHS ENCODINGS 59
70. on of parameters will be processed by ArabTRX again Parameter assignments inside an Arabic context may be performed by doassign parameter value The effect is normally local except if the form doassign global parameter value is used Any non recognized command will generate an error message and will be echoed verbatim in the output Even though ArabTEX tries hard to get into synchro nization again additional spurious errors may occur Inside an Arabic Context normally no further IXTEX or ArabTEX environment may be nested this restriction does not apply to the experimental IATEX doc ument classes arabrep cls arabart cls arabbook cls see Section 9 1 which are provided for right to left documents For a list of all available commands consult the Index to this report As a reminder the command arabstat will cause a list of all commands that are presently valid inside Arabic text to appear in the TEX log file 4This is no strong restriction as the expansion may contain docommand calls again Chapter 3 Running Arab TfX 3 1 Activating ArabTpX ArabTEX can be used both with Plain TEX and with WTpX but is activated differently in both cases With Plain TEX a small loader program is activated by the command input arabtex at the beginning of an input text It will define a default font prepare a minimal environment simulating the very few IATEX like features needed and load the ArabTgX macro files With KIEX2 the co
71. phen will be ignored e Prefixes may be separated by a single hyphen which appears in the tran scription without changing the Hebrew writing e For those rare cases where a consonant is missing input bar quote or lt N gt to produce a small space this may also carry vowels Accents and cantillation marks are not supported e For special applications as in the table above the input encoding M will produce a box O which may carry vowels the TFX package latexsym is required to show the box e Abbreviations are marked by Nabbr 7 3 ISO 8859 8 and Hebrew MS Windows ISO 8859 8 is an 8 bit encoding extending 7 bit ASCII with Hebrew letters Within Hebrew MS Windows the code page CP 1255 provides a superset of ISO 8859 8 containing a full complement of Hebrew vowels and a host of mis cellaneous special characters ArabTeX provides within the package cp1255 sty a reading module for a subset of CP1255 containing all Hebrew characters from ISO 8859 8 and all Hebrew vowels of CP1255 but omitting the extra special characters see Table 7 1 This encoding is activated by the switching command setcode cp1255 or setcode hwin 7 4 HebrewTpX oldcode and newcode There is a default reading module for the Hebrew characters in code positions 96 122 HebrewTEX pccode in code positions 128 154 as generated by the editor HED and also in code positions 224 250 Hebrew TEX newc
72. r word initial a Ab el ab bad SU bad bId de bid bUd 353 bud Note that I yields a ya ye me ruf with zir whilst E yields a ya ye mejhul without zir Similarly U yields a waw e me ruf with pis whilst O yields a waw e mejhul without pis tir W tir tE g amp teg dUr 393 dur zOr 207 The diphthongs i and u are written ay and aw pay 3 pay naw naw Intervocalic hemze is written pA IZ XU paz miyA I dl miyaz mIgU I fa MIJET tawAnA I lls tawanax zanA sU I dt Zanasur Silent word final waw is generated by _U or 0 t_U y tu d_U 93 du tO to dO do 1We gratefully acknowledge the voluntary help by Ivan Derzhanski who wrote this chapter and implemented the language specific processing As we extensively modified his routines during system integration all responsibility for any remaining or new errors rests with us 6 1 PERSIAN FARSI DARI 43 Waw e me dul is written w it is omitted in the transliteration and the pre ceding xe receives no jezm hwAb bl s hab hwI s his hwod 335 hod Ha ye hewwez e mezxfi is generated by H or optionally by e a or A the latter forms are deprecated It does not receive a jezm even in fully vocalised mode and is not joined to a following letter _hAneH se haneh c e a gt eh naH nah yal_aH als yalah yal A als yalah hAneHhA ul h nehh _hAneH hA l ail haneh ha Short edafe is written e or i
73. s sty This is a small utility for typesetting classical Arabic poetry in two parallel blocks such that every line contains two half verses For its use see the file itself SH CE Logi Us sy pH Ju al dad NIE u AI fo GES ll gie de pia E dy sal suy JR dls er gi SIR Lee All Juli G EU de Sx TE ii CAS CB ii gb ui GU UE us e Figure C 1 Example of Arabic poetry source unknown C 3 raw sty This is a small utility to ease the processing of input files that have been pro duced by some OCR reading program It will deactivate most of TEX s special characters This package depends strongly on the special application if you need it or a variant of it enquire with the author INDEX Index quoting 21 19 21 11 20 54 Xu 12 19 21 lt 10 11 15 40 LR 11 NRL 10 15 40 NN 12 abjad 64 abj ctr 67 accentshigh 64 accentslow 64 allowarab 13 arabfalse 38 arabstat 13 arabtexconfig 65 arabtrue 38 baselineskip 65 baselinestretch 65 begin RLtext 10 begin arabtext 10 76 bigskip 12 cap 39 centerline 13 clearpage 12 colsep 78 doassign 13 docommand 13 ds 61 emphasize 12 end RLtext 10 end arabtext 10 76 footnote 12 fullvocalize 18 20 21 76 hc 61 81 hfil 12 hfill 12 13 hp 61
74. sdig A uh u rol rasdig uhu ya g I u sg yagru sU ila du sipila 26 CHAPTER 4 INPUT ENCODING CONVENTIONS Sadda A double consonant must be written twice even if it is coded by more than one character nazzala J5 nazzala ba s sAruN NUS bassarun nawwara 55 nawwara z sayyiduN A sayyidun sa AluN Jile sa gt alun sabiyyuN v sabiyyun aduwwuN HS aduwwun Instead of iyy uww one can also write Iy Uw sablyuN gt sabiyun adUwuN HE lt aduwun x te Assimilation the definite article may be always written al a following sun letter must be written twice like in the Arabic spelling The translitera tion and the use of sukun are adjusted accordingly al ddAru ERU gt ad daru al rra gulu ja ar radulu al ssanaTu Ed as sanatu al nnAru so an naru al gAru adi al garu al bAbu odi al babu al llaylaTu sh al laylatu al 1lisAnu Sui al lisanu al 11 ahu 3l al lahu The article may also be written in the assimilated form with identical result ad dAru ERI gt ad daru ar ra gulu jesi ar ragulu as sanaTu za as sanatu an nAru soi an naru In some special cases the literal spelling must be used alla dI P alladi alla dIna RO ralladina allatI A allati however 0 al lla dAni o al ladani al llatAni ge al latani al llawAtI E ral lawati Wasla an auxiliary vowel at the beginning of a word is always written but in the middle of a sentence gener
75. sses The experimental I4TEX2z classes arabart arabbook arabrep extend the standard classes article book and report in several respects The overall document layout has been arabized page numbers are in Indic numer als and columns run from right to left The format of running heads depends on the context of the corresponding sectioning commands Within Arabic environments which are bracketed by begin RLtext and end RLtext most LXTEX commands and environments are allowed includ ing all sectioning commands tabular tabbing even tableofcontents and use an Arabic looking format All arguments that denote text to be typeset are interpreted according to the currently activated Arabic encoding Other arguments keep their ATEX standard meaning including the preamble of tabular whose columns are processed from left to right visual format ting Generally only the basic functionality is available optional arguments in brackets are not yet supported The commands pagenumbering abj and abj ctr generate abjad nu merals for page numbers and or arbitrary ATEX counters The document will start out in Roman mode but may even be made into a single Arabic environment Outside of Arabic environments the IATEX commands revert to their standard meaning The picture environment and mathematical displays presently only work in Roman mode but may contain Arabic insertions 9 2 Using ArabTpX with EDMAC Arab
76. t A gt in novocalize mode If a silent alif magsura is wanted instead write lt aN_A gt lt aNY gt lt _A gt or lt Y gt The tanwin fatha is normally positioned on the last consonant of the word even if a silent alif follows If it is instead supposed to go onto the alif as required by some modern Arabic writing conventions or in Persian this behaviour can be achieved by the option newtanwin The option oldtanwin will restore the classical behaviour A silent alif after waw is indicated by UA or WA with a capital W 4 1 3 Quoting In novocalize mode see Section 4 1 2 a double quote lt gt will modify the meaning of the following character as follows e if a short vowel follows the appropriate diacritical mark fatha kasra damma will be put on the preceding character If lt N gt follows the short vowel the appropriate form of tanwin will be generated instead At the beginning of a word alif is assumed as the first character e if the following character is a single right quote a hamza mark will be put on the preceding character even if in conflict with the hamza rules At the beginning of a word lt gt will generate an isolated hamza e if the following character is the invisible consonant lt gt the connection between the adjacent letters will be broken and a small space inserted This can also be denoted lt gt instead of lt gt At the beginni
77. t encoding is switched by the command setcode code_name that changes the coding for Arabic text globally Encoding may be switched several times in the same document pro vided the appropriate reading modules are installed setcode arabtex or setcode standard reverts to the standard ArabTpX notation Texts coded in an alternate encoding are always rendered verbatim there fore novocalize vocalize fullvocalize and the language selection com mands like setarab etc generally make no sense and are temporarily disabled 4 3 1 ASMO 449 ISO 9036 ASMO 449 see Table 4 4 is a 7 bit code differing from ASCII ISO 646 mainly by replacing the Roman letters by the Arabic letter characters and diacritical marks the Arabic digits share their positions with the ASCII digits The posi tions of special and control characters in both codes are identical ASMO 449 is supported by Arabic MS DOS The file asmo449 sty contains a reading module for the ASMO 449 code identi cal to ISO 9036 It is installed by the TEX command usepackage asmo449 or by input asmo449 sty The module is activated by setcode asmo449 or setcode is09036 all following Arabic text will be considered to be coded according to the ASMO 449 standard 30 CHAPTER 4 INPUT ENCODING CONVENTIONS J ES ES ES E Pc le Ae EE a EEA Fe e E sese sand Gi bus s Ta ac Ei ce ara Seopa ee ie eh ERAM ee ae RE EEE ES RES ES E ee Table 4 4 ASMO 449
78. ten from right to left and thus have to be rendered accordingly and a way of setting parameters influencing the rendering of Arabic text Additionally we have to support a representation of the Arabic text that can be handled using the available standard equipment Internally of course there is much more to it TEX may be very good at render ing left to right texts but Arabic runs from right to left so we have to instruct TEX how to do it the other way and if we want to mix Arabic and traditional left to right text within the same paragraph things can get extremely compli cated This shows indeed ArabTpX is very large and comparatively slow For tunately computer technology is advancing at a high pace so that the penalty of using very involved algorithms will decrease just by simple waiting and we thus never were too much concerned about efficiency even if we tried to avoid the worst blunders In order to see how ArabTEX works let us consider a very simple example Figure 1 1 contains the complete TEX input text Figure 1 2 the corresponding output In this example using ATEX we use a standard article format we activate the ArabTEX package and we set a few options our text is supposed to be in Arabic we want to see all vowel marks this is not always done in Arabic printing we additionally want to get the scientific transcription and of course the Arabic writing The document proper consists of a centered headline and
79. the classical publication process where an author concentrating on the content and the structure of her paper only will deliver a manuscript to the publisher who will take care of a reasonable visual rendering of the text The publisher involved in our case is TEX the famous typesetting program by D E Knuth and the amount of technical typesetting knowledge embodied in the algorithms of TEX is such that in general the author will tacitly accept the resulting output if she had stated her preferences in a sufficiently precise way 8 CHAPTER 1 INTRODUCTION TO ARABTEX otherwise the formatting task has to be repeated under changed specifications as also in the classical case TEX is easily extensible by providing its own macro programming language enabling the user to implement packages that contain algorithms embodying additional typesetting knowledge A prominent example is IATEX by L Lamport a package providing standard formatting styles for several common document classes One of the basic concepts of ATEX but already contained in TEX itself is structured markup the user generally only indicates the logical structure of the document whereas the formatting rules are laid down within descriptions of standard document classes ArabTpX at the user interface needs to add only very few features we provide a way to indicate which parts of the input text are considered to be in Arabic or some other supported language writ
80. tivated by setcode arabtex 7 7 Hebrew transcription systems transtrue activates the standard ZDMG transcription and there are provi sions for additional transcription systems e settrans zaw switches to the conventions of Zeitschrift f r die Alttes tamentliche Wissenschaft recommended e settrans gesenius activates the system used in W Gesenius Hebrew Grammar 26th edition deprecated e settrans standard restores the standard ZDMG transcription 7 8 Hebrew fonts e As a default the fonts hclassic default and hcaption are dis tributed with ArabTEX Switch to hcaption by hp and back by hc 7 8 HEBREW FONTS 61 Ts uo s i oc o om om N D680 D690 D6A0 D6BO D780 D790 D7A0 D7B0 Table 7 4 UNICODE Hebrew 62 CHAPTER 7 HEBREW MODE These fonts have been designed and donated by Joel Hoffman who also wrote several macro packages from which we took a few ideas for posi tioning punctuation There is a variety of other usable fonts on the CTAN archives e If no vowel points are required all the standard fonts DeadSea OldJaffa TelAviv and Jerusalem can be used if locally available They are activated by the commands ds Noj ta jm the command hc switches back to the default hclassic font e The Shalom family of fonts if av
81. u 445 ramahu sIm_aYhum er simahum The short vowel u can be written as a long vowel by _U gt _U1_A Jol sula ULA i Vol uli gt _U1U yal la gt _UlAka gt ulaka _ULA ika eU ulaika Tanwin The indefinite endings un in an are written uN iN aN or aNA Silent alif in an may be indicated by A or omitted if neces sary it is supplied from the context ra guluN Es ragulun ra guliN Je ragulin ra gulaN des ragulan madInaTaN X madinatan gamIlaTaN Le gamilatan i_daN IH idan samA aN l sam an There is a special case ribaNU riban amruNU n camrun amriNU ge camrin however amraN Wa camran Tanwin fatha is traditionally put on the last consonant even if a silent alif follows Some modern conventions and also Persian practice require to put it on the alif in this case This behaviour may be switched on by newtanwin and off by oldtanwin newtanwin mode is the default for Persian ra gulaN ragulan i daN fal dan A silent alif magsura after tanwin is written aNY or aN_A hudaNY SAK hudan fataN_A fatan compare al hudY sig al huda al fat_A ESI al fata 4 1 ASCII TRANSLITERATION ENCODING 25 Ta marbuta is denoted by T kalimaTuN is kalimatun kalimaTiN is kalimatin kalimaTaN kalimatan fatATuN fatatun fatATiN fatatin fatATaN fatatan Hamza is indicated by its carrier is deter
82. xt This mode is activated by transtrue and may be switched off again by transfalse If only the transliteration is wanted you can deactivate the arabic writing by Narabfalse it can be reactivated by arabtrue If both modes are active their output will be interleaved line by line The font used for the transliteration is normally italic it can be changed globally by settransfont font 5 1 ZDMG transliteration style The ZDMG transliteration is in fact a family of closely related but slightly dif ferent transliteration conventions for several languages using the Perso Arabic script Therefore for producing it correctly the appropriate language mode must have been selected For Arabic text the following special cases are handled e after the definite article a double consonant will be assimilated e an initial vowel will be replaced by an apostrophe whenever the preceding word ended with a vowel in this case a wasla appears in the Arabic writing If that is not wanted start with hamza e a silent alif or alif maqs ra after lt N gt tanwin and lt U gt is omitted in the transliteration The same happens after waw if it is written as a cap ital lt W gt e To correctly reproduce some historical writings a silent long vowel af ter lt _a gt is omitted in the transliteration For examples see section 4 1 5 1The former option atrans is no more necessary 39 40 CHAPTER 5 TRANSLITERATION For Persi
83. yawma wa qabla an yutimmu gu hA kalAmahu bada a al himAru yanhaqu fl i s tablihi fa qAla lahu sadIquhu innI asma u himAraka yA gu hA yanhaqu fa qAla lahu gu hA garIbuN amruka yA sadIqI a tu saddiqu al himAra wa tuka_d_dibunI end RLtext end document Figure 1 1 Sample ArabTEX input 10 CHAPTER 1 INTRODUCTION TO ARABTEX oe amp juh wa himaruhu ata sadiqun la guha yatlubu minhu himarahu li yarkabahu fi safratin qasiratin wa qala lahu J s Sas SY ie cite Gas sawfa uiduhu silayka fr I masgi wa Padfasu laka gt ugratan EN aU eh shall d a Ie fa qala guha E Je ana gt asifun giddan anni la castatiu tan gt uhaggiga laka ragbataka fa lhimaru laysa huna l yawma dad al Han des di E abili ivi wa qabla an yutimmu guha kalamahu badaa I himaru a fe stablihi ERE AW ddl iy i 2 ol ss fa qala lahu sadiquhu inni gt asmasu himaraka ya guha yanhaqu _ fa qala lahu guha e gj garibun amruka ya icd a tusaddiqu himara wa tukaddibuni RIS q Sad she G A Cu Figure 1 2 Sample ArabTpX output Chapter 2 Input to ArabTEX ArabTEX is activated by the command input arabtex with Plain TEX or usepackage arabtex with IATEX After activating ArabTFX select one of the language specific Arabic writing styles e g Nsetarab see Section 3 2 Your modified TEX
84. zm 42 kasida 20 26 64 kasra 18 20 21 Khan Iqbal 69 Knuth Donald E 6 Koornwinder Tom 69 Kr ger Eberhard 69 Kurdish 40 la 26 Lakehsar Asif 69 Lamport Leslie 7 language selection 10 15 IXTEX commands 11 12 67 IXTEX environment 67 Lavagnino John 67 Lemberg Werner 68 li 26 Library of Congress 39 ligature 22 27 76 breaking 19 20 22 27 Lodder Jan 69 long vowels 17 20 Lorch Richard 69 Macintosh 73 MacKay Pierre 69 MacOS Arabic 30 madda 19 21 41 44 Maghribi 43 mappiq 53 maggef 54 mathematical insertion 11 matres lectionis 53 Mattes Eberhard 69 Merkel Udo 69 METAFONT 74 meteg 54 58 86 MIT X 74 Module list 65 MS Arabic Windows 33 MS Windows 33 N 20 21 38 naming conflict 66 nasalization 44 Naskh 15 16 73 74 Nasta liq 16 41 43 73 Neamat Allah Fathy 69 nesting 11 13 NFSS 74 nikudot 16 no vowel 54 non Arabic insertion 11 NU 18 23 numbers 27 43 abjad 64 Arabic 11 nun e gunnah 44 O 40 41 0 46 option abjad 64 asmo449 28 iso88596 28 twoblks 78 Ottoman 40 Pandey Anshuman 44 69 Pashto 43 47 patach furtivum 53 PC implementation 73 Persian 40 Persian copula 41 PicIEX 68 picture environment 67 pis 41 Progress report 65 pseudo fonts 15 punctuation 11 quotation Arabic 10 INDEX non Arabic 11 Roman 11 quoting 18 20 21 Qur an

Download Pdf Manuals

image

Related Search

Related Contents

USER MANUAL - A Wind Of Change  manuel d`installation ci-joint  Suitcase Fusion 5 User Guide Mac  Northwood - ITS Networking & Telecommunications  A4 Tech. PK-35N User's Manual  RCA F36688 User's Manual  Samsung SGH-E900 User Manual  IE User manual - Repro Schicker AG  Dräger X-am 5600    

Copyright © All rights reserved.
Failed to retrieve file