Edit | History | Changes Home page | Site map | Search | Recent changes | Help

Arabic Corpus
Arabic Annotation Corpus

The Arabic corpus is a subset of the LDC's "Multiple-Translation Arabic (MTA) Part 1". The articles were selected to (1) reflect the domain of focus (economics) and (2) satisfy a global word count of 2000 words) -- each of the 9 articles is about 200-300 words. The four English translations per article are the best of all human translations (in the following order from top: ahd > ahg > ahi > ahh).

To view these articles, CLICK HERE. The article numbers are listed below together with their headlines.

<Article#>::<LDC#>: <Headline>

A1::artb_004.sgm: Inauguration of free zone in Dubai for e-commerce

A2::artb_030.sgm: World needs at least $600 billion to solve water problems''

A3::artb_046.sgm: "Tourism and Shopping Festival" opens in Egypt

A4::artb_056.sgm: Egyptian tourism recovers partially since the Al-Aqsar attack

A5::artb_067.sgm: Egypt's Tourism Minister expects a revival of tourism next autumn

A6::artb_522.sgm: Jordanian monarch meets Syrian Prime Minister

A7::artb_528.sgm: Investment projects in Yemen amount to over 591 billion riyals

A8::artb_556.sgm: Kuwait money market index drops by 23.5 points

A9::artb_564.sgm: Developing external trade in largest Chinese trade center


Version 5, Mon 01 Dec 2003 16:01:45 [NYH] - created Wed 29 Oct 2003 15:07:32 [NYH]
Edit | History | Changes Home page | Site map | Search | Recent changes | Help