Cross lingual and Multilingual Text Retrieval

General Course Information

On this web page, information (slides, etc.) for the seminar 'Cross lingual and Multilingual Text Retrieval', which is will be given during Winter Term 2010/2011 by Farag Ahmed, Marcus Nitsche and Andreas Nürnberger. Prerequisites for attending this course, is basic knowledge of computer science (Vordiplom). Therefore, this course is open to all students of computer science having  Vordiplom. 

Description of topics

The increasing diversity of the Internet web sites has created millions of multilingual resources in the world wide web. At first glance, it seems that increasingly more information can be retrieved by non-English speaking people. However, in order to retrieve information written by these languages, some issues need to be tackled. In this seminar we will deal with different problems related to multilingual text retrieval, analyzing different approaches that help to overcome some issues regarding these languages such as:

  • Word sense disambiguation (Word translation disambiguation)
  • CLIR interaction tools (incl. appropriate user interfaces / interaction sequences)
  • Query expansion
  • Comparable corpora construction
  • Corpora alignment (word or sentence level)
  • Text segmentation (e.g Chinese text segmentation)
  • POS Tagging (Part of speech tagging)
  • Cross-language information retrieval evaluation
  • Language identification etc.

Course Schedule and Room Assignments

Title Time Start Room


Monday 11:00-13:00 18.10.2010


Course Staff

If you have any questions concerning the lectures or assignments please contact (if possible by email)

Requirements for the 'Schein'

The following criteria has to be fullfilled for the 'Schein':

  • 30 minutes presentation about an elected topic (topics can be elected in the first lesson)
    delivery of the slides 1 week before the presentation date
  • active participation and reasonable discussion about the presentations
  • write a scientific paper (hints will be given at the beginning of the seminar), 7-10 pages
    delivery of the paper 1 week before the presentation date
  • participation in the other presentations

If you need a 'benoteten Schein' you have to announce it in the first lesson!

Hints on the delivery

The delivery of the paper, the slides and the reports can be done by email to or Please send also the source file (word or latex document). The paper and the slides have to be send one week before the presentation date by wednesday at 8:00 am. The reports have to be send one day before the presentation date of the examined talk at 8:00 am. These deadlines must be strictly adhered.


We will provide the lecture slides and further material during the course.



Last Modification: 14.11.2011 - Contact Person:

Sie können eine Nachricht versenden an: Prof. Dr.-Ing. Andreas Nürnberger