Edit | History | Changes Home page | Site map | Search | Recent changes | Help

Arabic IL0->IL1 Annotation Manual

Note: For the primary basic steps refer to the the English manual. This manual only addresses the points that are different from the English manual and the Arabic specific instructions.

I. Keep the top most header node that has the whole sentence in it. This is necessary for TIAMAT to work.

II. In English we remove the articles (a,an,the). Do the same thing here. For example: the arabic Al<iliktroniap (the electronic) becomes like a dicitionary entry: <iliktroniy -- so (1) the Al- goes away and (2) the feminine marker goes away. The feminine marker should only go away for words that can be masculine. for example, AltijArap (commerce) doesn't become masculine.

III. For prepositions, do the same as english and make them all separate including the ones that are not written separately (like b-,k- and l-).

IV. Relations should be specified. Example: Waly Eahod dubay is subject and minTaqap is object, etc. (if it is not subj/obj/iobj it is a modifier).

V. For now, we might have to clean the word manually: Dubai in Arabic is dubay~ (not dub~iy = ursine)


Version 2, Wed 16 Jun 2004 10:01:59 - created Wed 16 Jun 2004 10:01:18
Edit | History | Changes Home page | Site map | Search | Recent changes | Help