FAUST - Feedback Analysis for User adaptive Statistical Translation

FP7-ICT-2009-4

Publications and Presentations

Journal of Machine Translation

  • N-gram posterior probability confidence measures for statistical machine translation: an empirical study. A. de Gispert, G. Blackwood, G. Iglesias, and W. Byrne. Machine Translation. 2012. http://dx.doi.org/10.1007/s10590-012-9132-2

Prague Bulletin of Mathematical Linguistics

FSMNLP 2012

  • Lattice-based minimum error rate training using weighted finite-state transducers with tropical polynomial weights. A. Waite, G. Blackwood, and W. Byrne. In Proceedings of the 10th International Workshop on Finite State Methods and Natural Language Processing (FSMNLP 2012), Donostia-San Sebastian, Spain, July 2012. http://aclweb.org/anthology-new/W/W12/W12-6219.pdf

EACL 2012

  • Syntax-Based Word Ordering Incorporating a Large-Scale Language Model. Yue Zhang, Graeme Blackwood and Stephen Clark. Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL-12), pp.736-746, Avignon, France, 2012. [PDF]

COLING 2012

  • Dealing with Input Noise in Statistical Machine Translation. Lluƒ­s Formiga and Josƒ© A.R. Rodrƒ­guez Fonollosa. (Also as TALP Technical Report [PDF])

AMTA 2012

  • A Graph-based Strategy to Streamline Translation Quality Assessments. Daniele Pighin, Lluƒ­s Formiga, Lluƒ­s Mƒ rquez. [PDF]

AMTA MONOMT 2012

  • Improving English to Spanish Out-of-Domain Translations by Morphology Generalization and Generation. Lluƒ­s Formiga, Adolfo Hernandez, Jose B. Marino, and Enric Monte. [PDF (Page 6)]

ACL 2012

  • A Graphical Interface for MT Evaluation and Error Analysis. Meritxell Gonzƒ¡lez and Jesƒºs Gimƒ©nez and Lluƒ­s Mƒ rquez. [PDF]

SSST-6 (2012)

  • Using Parallel Features in Parsing of Machine-Translated Sentences for Correction of Grammatical Errors. Rudolf Rosa, Ond™ej Dušek, David Mareček, and Martin Popel [PDF]

TSD 2012

  • Dependency Relations Labeller for Czech. Rudolf Rosa, David Marecek, and Martin Popel [PDF]

WMT 2012

  • Formemes in English-Czech Deep Syntactic MT. Ondrej Dušek, Zdenek ‚Žabokrtsky, Martin Popel, Martin Majliš, Michal Novƒ¡k, and David Marec‚ek. [PDF]
  • DEPFIX: A System for Automatic Correction of Czech MT Outputs. Rudolf Rosa, David Marec‚ek, Ondrej Dušek. [PDF]
  • The UPC Submission to the WMT 2012 Shared Task on Quality Estimation. Daniele Pighin, Meritxell Gonzƒ¡lez and Lluƒ­s Mƒ rquez
    [PDF]
  • The TALP-UPC phrase-based translation systems for WMT12: Morphology simplification and domain adaptation.Lluis Formiga; Carlos A. Henrƒ¬quez Q.; Adolfo Hernƒ¡ndez; Josƒ¨ B. Mariƒ±o; Enric Monte; Josƒ¨ A. R. Fonollosa
    [PDF]

LREC 2012

  • An Analysis (and an Annotated Corpus) of User Responses to Machine Translation Output. Daniele Pighin, Lluƒ­s Mƒ rquez and Jonathan May [PDF]
  • The FAUST Corpus of Adequacy Assessments for Real-World Machine Translation Output Daniele Pighin, Lluƒ­s Mƒ rquez and Lluƒ­s Formiga [PDF
  • The Joy of Parallelism with CzEng. Ond…™ej Bojar, Zden„›k ‚Žabokrtskƒ½, Ond…™ej Dušek, Petra Galuš„ƒ¡kovƒ¡, Martin Majliš, David Mare„ek, Ji…™ƒ­ Maršƒ­k, Michal Novƒ¡k, Martin Popel and Aleš Tamchyna PDF

Workshop on Iberian Cross-Language NLP Tasks

  • Pivot strategies as an alternative for statistical machine translation tasks involving Iberian languages Carlos A. Henriquez Q., Marta R. Costa-Jussƒ , Rafael Banchs, Lluis Formiga and Josƒ© B. Mariƒ±o. Workshop on Iberian Cross-Language NLP tasks. September 2011. Huelva, Spain [PDF]

EMNLP 2011

  • Hierarchical Phrase-based Translation Representations. G. Iglesias, A. Allauzen, W. Byrne, A. de Gispert and M. Riley. Edinburgh, Scotland, July 2011 [PDF]
  • Syntax-Based Grammaticality Improvement using CCG and Guided Search.Yue Zhang and Stephen Clark. [PDF]
  • Corroborating Text Evaluation Results with Heterogeneous Measures. Enrique Amigƒƒ‚³, Julio Gonzalo, Jesƒƒ‚ºs Gimƒƒ‚©nez and Felisa Verdejo [PDF][bib]

EAMT 2011

  • Hierarchical Phrase-based Translation with Weighted Finite-State Transducers. Presented by EAMT 2010 Best Phd awardee Gonzalo Iglesias at the conference.
  • Deriving translations units using small additional corpora. Carlos A. Henriquez Q., Josƒƒ‚© B. Mariƒƒ‚±o and Rafael E. Banchs. Proceedings of the 15th Annual Conference of the European Association for Machine Tranlation, 121--128, May, 2011. Leuven, Belgium.

WMT 2011

  • Two-step translation with grammatical post-processing . David Mareƒƒ„ƒ‚‚ekek, Rudolf Rosa, Ondƒƒ…™ej Bojar and Petra Galušƒƒ„ƒ‚‚ƒƒ‚¡kovƒƒ‚¡. Will be presented at WMT11. [PDF]
  • Influence of Parser Choice on Dependency-Based MT. Martin Popel, David Mareƒƒ„ƒ‚‚ek, Nathan Green and Zdenƒƒ„›k ƒƒ…ƒ‚‚½abokrtskƒƒ‚½. Will be presented at WMT11. [PDF]

ACL 2011

  • Shift-reduce CCG Parsing [PDF]
    Yue Zhang and Stephen Clark
  • Transition-based Dependency Parsing with Rich Non-local Features [PDF]
    Yue Zhang and Joakim Nivre
  • Automatic Projection of Semantic Structures: an Application to Pairwise Translation Ranking. Daniele Pighin and Lluƒƒ‚­s Mƒƒ rquez. Proceedings of SSST-5, pages 1-9. [PDF],
Journal of Machine Translation
  • Linguistic Measures for Automatic Machine Translation Evaluation. Jesƒƒ‚ºs Gimƒƒ‚©nez and Lluƒƒ‚­s Mƒƒ rquez. Machine Translation, 2010,Volume 24, Numbers 3-4, pages 209-240.[PDF]
The Prague Bulletin of Mathematical Linguistics

CICLING 2011

  • Combining Diverse Word-Alignment Symmetrizations Improves Dependency Tree Projection.David Marecek. [PDF]

FALA2010

  • Multiclass Classification for Morphology Generation in Statistical Machine Translation. Adolfo Hernƒƒ‚¡ndez Huerta, Enric Monte Moreno, Josƒƒ‚© B. Mariƒƒ‚±o Acebal. FALA 2010 "VI Jornadas en Tecnologƒƒ‚­a del Habla" and II Iberian SLTech Workshop, pages 179-182, Vigo, Spain, 10-12 November, 2010.

IWSLT2010

  • UPC-BMIC-VDU system description for the IWSLT 2010: testing several collocation segmentations in a phrase-based SMT system. Carlos A. Henrƒƒ‚­quez Q., Marta R. Costa-jussƒƒ , Vidas Daudaravicius, Rafael E. Banchs and Josƒƒ‚© B. Mariƒƒ‚±o. Proceedings of the International Workshop on Spoken Language Translation, 189--195, December, 2010

Softissimo Initial Analysis of Translation Feedback

Baseline System Descriptions for WMT 2010

  • [UCAM] The CUED HiFST System for the WMT10 Translation Shared Task [PDF] [bib]
    Juan Pino, Gonzalo Iglesias, Adriƒƒ de Gispert, Graeme Blackwood, Jamie Brunning and William Byrne
  • [UPC] Using Collocation Segmentation to Augment the Phrase Table [PDF] [bib]
    Carlos A. Henrƒƒ‚­quez Q., Marta Ruiz Costa-jussƒƒ , Vidas Daudaravicius, Rafael E. Banchs and Josƒƒ‚© B. Mariƒƒ‚±o
  • [CUNI] Maximum Entropy Translation Model in Dependency-Based MT Framework [PDF] [bib]
    Zdenek Zabokrtsky, Martin Popel and David Marecek
  • [CUNI] 2010 Failures in English-Czech Phrase-based MT [PDF] [bib]
    Ondrej Bojar and Kamil Kos

Baseline System Descriptions for Metrics MaTr Challenge at WMT 2010

  • [UPC] Document-Level Automatic MT Evaluation based on Discourse Representations. [PDF] [bib]
    Elisabet Comelles, Jesus Gimenez, Lluis Marquez, Irene Castellon and Victoria Arranz.


COLING 2010

  • [UCAM] Chart Pruning for Fast Lexicalised-Grammar Parsing. [PDF]
    Yue Zhang, Byung-Gyu Ahn, Stephen Clark, Curt Van Wyk, James R. Curran and Laura Rimell

ACL 2010

  • Data Sparseness in Machine Translation Evaluation
    Ondrej Bojar, Kamil Kos, and David Marecek

EAMT 2010

  • FAUST: Feedback Analysis for User adaptive Statistical Translation – First Year Progress.
    project overview poster
  • Robust Estimation of Feature Weights in Statistical Machine Translation.
    Cristina Espaƒƒ‚±a-Bonet and Lluƒƒ‚­s Mƒƒ rquez

EMNLP 2010

  • A fast decoder for joint segmentation and POS tagging. [PDF]
    Yue Zhang and Stephen Clark

Language Technology Days : 22-23 March 2010, Luxembourg