Home
This conference builds on a series of annual workshops and conferences on statistical machine
translation, going back to 2006:
- the NAACL-2006 Workshop on
Statistical Machine Translation,
- the ACL-2007 Workshop on Statistical
Machine Translation,
- the ACL-2008
Workshop on Statistical Machine Translation,
- the EACL-2009 Workshop on
Statistical Machine Translation,
- the ACL-2010 Workshop on
Statistical Machine Translation
- the EMNLP-2011 Workshop on
Statistical Machine Translation,
- the NAACL-2012 Workshop on
Statistical Machine Translation,
- the ACL-2013 Workshop on
Statistical Machine Translation,
- the ACL-2014 Workshop on
Statistical Machine Translation,
- the EMNLP-2015 Workshop on
Statistical Machine Translation,
- the First Conference on
Machine Translation (at ACL-2016),
- the Second Conference on
Machine Translation (at EMNLP-2017),
- the Third Conference on
Machine Translation (at EMNLP-2018).
- the Fourth Conference on
Machine Translation (at ACL-2019).
- the Fifth Conference on
Machine Translation (at EMNLP-2020).
- the Sixth Conference on
Machine Translation (at EMNLP-2021).
IMPORTANT DATES
Release of training data for shared tasks | February/March, 2022 |
Evaluation periods for shared tasks | June/July, 2022 |
Paper submission deadline | 7th September, 2022 |
Paper notification | 11th October, 2022 |
Camera-ready version due | 20th October, 2022 |
Conference | 7th - 8th December, 2022 |
All deadlines are in AoE (Anywhere on Earth). Dates are specified with respect to EMNLP 2022.
OVERVIEW
This year's conference will feature the following shared tasks:
- a general MT translation task (former News task),
- a biomedical translation task,
- a code-mixing translation task,
- an unsupervised and very low resource translation task,
- an automatic post-editing task,
- a sign language translation task,
- a word-level autocompletion task,
- a metrics task (assess MT quality with or without reference translation),
- a quality estimation task (assess MT quality without access to any reference),
- an MT efficiency task,
- a translation suggestion task (generate the suggestions for incorrect spans of the MT sentence),
- a chat translation task,
- a large-scale multilingual MT task,
- an unsupervised and very low resource supervised translation task
In addition to the shared tasks, the conference will also feature scientific papers on topics related to MT.
Topics of interest include, but are not limited to:
- MT models (neural, statistical etc. )
- analysis of neural models for MT
- using comparable corpora for MT
- selection and preparation of data for MT
- semi-supervised and unsupervised learning for MT, transfer learning
- multilingual MT
- incorporating linguistic information into MT
- MT inference
- manual and automatic methods for evaluating MT
- quality estimation for MT
We encourage authors to evaluate their approaches to the above topics
using the common data sets created for the shared tasks.
REGISTRATION AND VISA INFORMATION
These will both be handled by EMNLP 2022.
SHARED TASKS
WMT has a number of MT-related shared tasks. Please consult the links at the top of the page for more details.
PAPER SUBMISSION INFORMATION
WMT accepts two types of submissions: research papers and system papers. Both
types of papers are submitted electronically, have
the same deadlines, and should
follow EMNLP2022 formatting guidelines.
WMT will participate in the ACL Rolling Review.
Any ARR-reviewed paper that received all of its reviews and
meta-reviews available by October 1, 2022 can be committed to WMT and will be considered for
publication at the conference.
Research Papers
Research papers should describe original research corresponding to the categories listed above.
Research papers that have been or will be submitted to other meetings or publications must indicate this at submission time, and must be withdrawn from the other venues if accepted and published at WMT 2022.
We will not accept for publication papers that overlap significantly in content or results with papers that have been or will be published elsewhere.
It is acceptable to submit work that has been made available as a technical report (or similar, e.g. in arXiv) without citing it.
For the research track, papers should be anonymised, be between 6 and 10 pages in length (excluding references) and may
include supplementary material.
We encourage individuals who are submitting research papers to evaluate
their approaches using the training resources provided by this conference
and past workshops, so that their experiments can be repeated by others
using these publicly available corpora.
System Papers
System papers must describe one or more shared task submissions. System paper submissions that we cannot link to a shared
task submission will be rejected without review. System papers can overlap with other published work, and do not have to follow the double
submission policy. There is no maximum length for system papers, but normally a short paper (4-6 pages) is appropriate. System
papers should not be anonymised.
PRESENTATION AND POSTER FORMAT
System description papers will be presentated as posters. For the in-person poster session, the poster panels are 3.28’ (1m wide) x 8.20’ (2.5m tall). We recommend to have posters that are portrait in orientation.
ANNOUNCEMENTS
If you are participating in any task, please subscribe to the Google groups
mailing list for updates. You can also
consult the list archives on this page.
Note that there have been isues with subscribing to the mailing list in recent years. If you are unable to subscribe, or
if your subscription is not approved, please contact Barry Haddow (address at the foot of the page).
INVITED TALK
Invited Talk by Ondrej Bojar on "Speech Translation: When Two Superhuman Technologies Combined Fail" (video, demo videos)
ORGANIZERS
Loïc Barrault (University of Sheffield)
Rachel Bawden (Inria)
Ondřej Bojar (Charles University)
Fethi Bougares (University of Le Mans)
Rajen Chatterjee (Apple)
Marta R. Costa-jussà (Universitat Politècnica de Catalunya
Anton Dvorkovich (Neurodub)
Christian Federmann (Microsoft)
Mark Fishel (University of Tartu)
Alexander Fraser (LMU Munich)
Markus Freitag (Google)
Yvette Graham (DCU)
Roman Grundkiewicz (Microsoft)
Paco Guzman (Facebook)
Barry Haddow (University of Edinburgh)
Matthias Huck (LMU Munich)
Antonio Jimeno Yepes (IBM Research Australia)
Rebecca Knowles (NRC)
Tom Kocmi (Microsoft)
Philipp Koehn (University of Edinburgh / Johns Hopkins University)
André Martins (Unbabel)
Christof Monz (University of Amsterdam)
Makoto Morishita (NTT)
Masaaki Nagata (NTT)
Toshiaki Nakazawa (University of Tokyo)
Matteo Negri (FBK)
Aurélie Névéol (LIMSI, CNRS)
Mariana Neves (German Federal Institute for Risk Assessment)
Martin Popel (Charles University)
Matt Post (Johns Hopkins University)
Mariya Shmatova (Neurodub )
Marco Turchi (FBK)
Marcos Zampieri (Rochester Institute of Technology)
PROGRAM COMMITTEE
TBC
ANTI-HARASSMENT POLICY
WMT follows the ACL's anti-harassment policy
CONTACT
For general questions, comments, etc. please send email
to phi@jhu.edu.
For task-specific questions, please contact the relevant organisers.