Unsupervised segmentation of words
into morphemes -- Challenge 2005
lt!-- ltfont colorredgtPages under
Part of the
U Network of Excellence PASCAL Challenge Program
. Participation is open to all.
The objective of the Challenge is to design a
statistical machine learning algorithm that segmen
ts words into the smallest meaning-bearing units o
f language, morphemes. Ideally, these are basic
vocabulary units suitable for different tasks,
such as text understanding, machine translation, i
nformation retrieval, and statistical language mod
eling. The scientific goals are
To learn of the phenomena underlying word
construction in natural languages To disc
over approaches suitable for a wide range of
languages To advance machine learning method
ology The results will be presented in
a workshop arranged
in connection with other PASCAL challenges on mach
ine learning. Please read the
rules and see the
schedule. The
datasets are available
for download. Instructions on how to
submit your camera-ready
documents are given on the Workshop page.
We are looking forwa
rd to an interesting competition!
Mikko Kurimo, Mathias Creutz
and Krista Lagus Neural Networks Research
Centre, Helsinki University of Technology
The organizers Program comittee
Lev
ent Arslan, Boğaziçi University
Samy Bengio, IDIAP Tolga Cilogu, Middl
e-East Technical University John Goldsmith,
University of Chicago Kadri Hacioglu, Color
ado University Chun Yu Kit, City University
of Hong Kong Dietrich Klakow, Saarland Univ
ersity Jan Nouza,Technical University of Li
berec Erkki Oja, Helsinki University of Tech
nology Richard Wicentowski, Swarthmore Colle
ge Murat Saraclar, Boğaziçi Univer
sity References
Mathias Creutz and Krista Lagus (2005).
Unsupervised Morpheme Segmentation and
Morphology Induction from Text Corpora Using Mor
fessor 1.0. Publications in Computer and
Information Science, Report A81, Helsinki
University of Technology, March.
Article (PDF)
Teemu Hirsimäki, Mathias Creutz, Vesa
Siivola, Mikko Kurimo, Janne Pylkkönen, and
Sami Virpioja (2005). Unlimited vocabulary
speech recognition with morph language models
applied to Finnish. Preprint accepted for pu
blication in
Computer Speech and Language.
Article (PDF)
(PDF)nbsplt/agt lt/blockquotegt lt!-- ltpgt The Chal
lenge Poster PDF, textlt/pgt --gt
