Workshop took place in Geneva. Proceedings are available at ACL anthology, with other COLING 2004 Workshops.
In an ever expanding information society, most information systems are now facing the
"multilingual challenge". Lexical resources play an essential role in these information
systems. Such lexical resources need to provide information on many languages in a
common framework and should be (re)usable in many applications (for automatic
or human use).
Many centers have been involved in national, european and international projects
dedicated to building harmonized language resources and creating expertise in the
maintenance and further development of standardized linguistic data. These resources
include dictionaries, lexicons, thesauri, wordnets, and annotated corpora developed
along the lines of best practices and recommendations. However, since the late 90's
most efforts in scaling up these resources remain the responsibility of the local
authorities, usually, with very low funding (if any) and few opportunities for
academic recognition of this work. Hence, it is not surprising that many of the
resource holders and developers have become reluctant to give free access to the
latest resources. It is therefore currently rather unclear what the actual status
is of these language resources.
The goal of this workshop is to study problems involved in the development/management/reuse
of lexical resources in a multilingual context. Moreover, this workshop will provide a forum
for reviewing the state of language resources. The workshop is meant to bring to the
international community qualitative and quantitative information about the most recent
developments in the area of linguistic resources and their use in applications.
As the workshop takes place in Europe, special attention will be given to submissions
on resources for European languages (especially Central and Eastern European languages),
but submission of innovative developments involving any language will be welcome.
The intent of the workshop is not only to review academic developments but also to hear
from industry, how these resources are used in new technology developments and what the
current needs are in the field. Presentations, project notes and position papers are
Authors of papers on multilingual linguistic ressources are encouraged to apply.
Papers could be on - but not limited to - any of the following topics:
- models for language/application-independent linguistic data
- linguistic data acquisition and maintenance
- building multilingual linguistic resources
- lexical architectures to link data between languages
- use of linguistic resources for applications in a multilingual context
- reuse of existing lexical resources in a specific multilingual application
- human use of lexical resources in a multilingual context
- quality evaluation and standardization
Papers should be original, hence report on previously unpublished work.
They should emphasize completed rather than
intended work. Papers submitted to other conferences must
reflect this fact on the title page.
As the review process will not be blind, papers should contain the names and affiliations of authors.
Submissions must be electronic only, and should consist of full papers of max. 8 pages (inclusive
of references, tables, figures and equations). Project notes and position papers should not exceed
4 pages. Authors should follow the
COLING 2004 formating guidelines, available at:
All submissions should be made through the MLR2004 workshop web site's submission form (url:
http://mlr2004.imag.fr/). They should
consist of a single PDF file.
Before submitting your document, please verify that all non-standard fonts are correctly
embedded in your PDF document (you can do this by disabling the "Use Local Font" option when
viewing you PDF document with Adobe Reader).
Formating requirements for the final version of accepted papers will be
posted as soon as they become available.
Deadline for submission of Papers: May 11th 2004
Notification of Acceptance: June 2nd 2004
Deadline for final paper submission: June 16th 2004
Workshop: August 28th 2004
Gilles Sérasset, GETA-CLIPS, IMAG, Université Joseph Fourier- Grenoble I, France
Susan Armstrong, ISSCO, Université de Genève, Suisse
Christian Boitet, GETA-CLIPS, IMAG, Université Joseph Fourier- Grenoble I, France
Andrei Popescu-Belis, ISSCO, Université de Genève, Suisse
Dan Tufis, RACAI, University of Bucharest, Romania
Gilles Sérasset GETA CLIPS-IMAG, Université Joseph Fourier - Grenoble I, France (chair)
Susan Armstrong ISSCO, Université de Genève, Switzerland
Pushpak Battacharya IIT, Mumbai, India
Igor Boguslavski IITP, Moscow, Russia
Christian Boitet GETA CLIPS-IMAG, Université Joseph Fourier - Grenoble I, France
Pierrette Bouillon ISSCO, Université de Genève, Switzerland
Jim Breen Monash University, Australia
Nicoletta Calzolari CNR, Pisa, Italy
Dan Cristea University Al.I.Cuza Iasi, Romania
Patrick Drouin OLST, University of Montreal,Canada
Sanae Fujita NTT, Kyoto, Japan
Ulrich Heid IMS-CL, University of Stuttgart, Germany
Hitoshi Isahara CRL, Nara, Japan
Kyo Kageura NII, Tokyo, Japan
Chuah Choy Kim USM, Penang, Malaisie
Mathieu Mangeot NII, Tokyo, Japan
Alain Polguère OLST, University of Montreal,Canada
Andrei Popescu-belis ISSCO, Université de Genève, Switzerland
Jean Senellart SYSTRAN, France
Mandel Shi Xiamen University, China
Virach Sornlertlamvanich Thai Computational Linguistics Laboratory, CRL, Thailand
Pr. Kumiko Tanaka-Ishii Tokyo University, Japan
Philippe Thoiron CRTT, Université de Lyon 2, France
Dan Tufis RACAI, Uni Bucharest, Romania
Michael Zock LIMSI, Orsay, France