Publikationen
ZORA-Abfrage
ZORA Publikationsliste
Download-Optionen
Publikationen
-
Name Consistency in LLM-based Machine Translation of Historical Texts Proceedings of the Machine Translation Summit. Presented at the 20th Machine Translation Summit, Association for Computational Linguistics.
-
LLM-based Translation for Latin: Summaries Improve Machine Translation Proceedings of the Swiss Text Analytics Conference. Presented at the SwissText 2025, Association for Computational Linguistics.
-
LLM-based Translation Across 500 Years. The Case for Early New High German In: 20th Conference on Natural Language Processing (KONVENS 2024), Wien, Österreich, 10 September 2024 - 13 September 2024. Association for Computational Linguistics, 368-375.
-
SwissText 2024 Shared Task: Automatic Classification of the United Nations’ Sustainable Development Goals (SDGs) and Their Targets in English Scientific Abstracts In: Proceedings of the 9th edition of the Swiss Text Analytics Conference, Chur, Switzerland, 10 June 2024 - 11 June 2024. Association for Computational Linguistics, 193-197.
-
Tracing Linguistic Footprints of ChatGPT Across Tasks, Domains and Personas in English and German In: The 9th edition of the Swiss Text Analytics Conference, Chur, Switzerland, 10 Juni 2024 - 11 Juni 2024. Association for Computational Linguistics, 102-112.
-
SDG Classification Using Instruction-Tuned LLMs In: Proceedings of the 9th edition of the Swiss Text Analytics Conference, Chur, Switzerland, 10 Juni 2024 - 11 Juni 2024. Association for Computational Linguistics, 148-156.
-
LLM-based Machine Translation and Summarization for Latin In: Third Workshop on Language Technologies for Historical and Ancient Languages -- LT4HALA (at LREC/COLING), Torino, 25 May 2024.
-
Multilingual Workflows in Bullinger Digital: Data Curation for Latin and Early New High German Journal of Open Humanities Data, 10(12):12.
-
Machine vs. Human: Exploring Syntax and Lexicon in German Translations, with a Spotlight on Anglicisms In: The 24th Nordic Conference on Computational Linguistics (NoDaLiDa), Tórshavn, Faroe Islands, 23 May 2023 - 25 May 2023. University of Tartu Library, 215-227.
-
Bullingers Briefwechsel zugänglich machen: Stand der Handschriftenerkennung In: DHd 2023, Trier, 13 March 2023 - 17 March 2023.
-
European Language Equality - Report on the German Language Berlin, Germany: European Language Equality (ELE).
-
Lessons Learnt from Bullinger Digital In: Open Up Digital Editions Conference 2024, Zurich, 24 January 2024 - 26 January 2024. Center Digital Editions & Edition Analytics (University Library Zurich) and Research and Infrastructure Support RISE (University of Basel), 75-76.
-
The Adaptability of a Transformer-Based OCR Model for Historical Documents In: Document Analysis and Recognition -- ICDAR 2023 Workshops, San José, 2023. Springer, 34-48.
-
Language Report German In: Rehm, Georg; Way, Andy . European Language Equality: A Strategic Agenda for Digital Language Equality. Cham: Springer International Publishing, 147-150.
-
Binomials in Swedish corpora – ‘Ordpar 1965’ revisited In: Volodina, Elena; Dannélls, Dana; Berdicevskis, Aleksandrs; Forsberg, Markus; Virk, Shafqat . Live and Learn : Festschrift in honor of Lars Borin. Göteborg: Department of Swedish, Multilingualism and Language Technology, University of Gothenburg, 139-144.
-
Transformer-based HTR for Historical Documents In: Workshop on Computational Methods in the Humanities 2022, Lausanne, 9 Juni 2022 - 10 Juni 2022.
-
Nunc profana tractemus. Detecting Code-Switching in a Large Corpus of 16th Century Letters In: Proceedings of LREC-2022, Marseille, 21 June 2022 - 26 June 2022, LREC.
-
Evaluation of HTR models without Ground Truth Material In: LREC 2022, Marseille, 21 June 2022 - 23 June 2022, European Language Resources Association.
-
A Multilingual Simplified Language News Corpus In: 2nd Workshop on Tools and Resources to Empower People with REAding DIfficulties (READI) within the 13th Language Resources and Evaluation Conference, Marseille, France, 24 June 2022. European Language Resources Association, 25-30.
-
Improving Specificity in Review Response Generation with Data-Driven Data Filtering In: Proceedings of the Fifth Workshop on e-Commerce and NLP (ECNLP 5), Dublin, Ireland, 26 May 2022, Association for Computational Linguistics.
-
Ein Briefwechsel-Korpus des 16. Jahrhunderts in Frühneuhochdeutsch In: Kupietz, Marc; Schmidt, Thomas . Neue Entwicklungen in der Korpuslandschaft der Germanistik. Tübingen: Narr Francke Attempto GmbH + Co. KG, 33-42.
-
Machine Translation of 16th Century Letters from Latin to German In: Second Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA 2022), Marseille, 25 Juni 2022. LREC, 43-50.
-
WikiFlash: Generating Flashcards from Wikipedia Articles In: Mantoro, T; et al . Neural Information Processing. ICONIP 2021. Lecture Notes in Computer Science. Cham: Springer, 138-149.
-
Binomial adverbs in Germanic and Romance Languages : A corpus-based study In: Lavid-López, Julia; Maíz-Arévalo, Carmen; Zamorano-Mansilla, Juan Rafael . Corpora in Translation and Contrastive Research in the Digital Age : Recent advances and explorations. Amsterdam: John Benjamins, 326-342.
-
Harmonization Sometimes Harms In: Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS), Winterthur, 23 June 2020 - 25 June 2020, swisstext-and-konvens-2020.
-
Ranking Georeferences for Efficient Crowdsourcing of Toponym Annotations in a Historical Corpus of Alpine Texts In: 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS), Zurich, 23 June 2020 - 25 June 2020. CEUR-WS, online.
-
Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS) Edited by: Ebling, Sarah; Tuggener, Don; Hürlimann, Manuela; Cieliebak, M; Volk, Martin (2020). Zurich: CEUR-ws.
-
How Much Data Do You Need? About the Creation of a Ground Truth for Black Letter and the Effectiveness of Neural OCR In: Proceedings of the 12th Language Resources and Evaluation Conference, Marseille, 1 May 2020 - 2 May 2020. ACL Anthology, 3551-3559.
-
Historical Newspaper Content Mining: Revisiting the impresso Project's Challenges in Text and Image Processing, Design and Historical Scholarship In: Digital Humanities 2020, Ottawa, 22 July 2020 - 2020.
-
Benchmarking Data-driven Automatic Text Simplification for German In: Gala, Nuria; Wilkens, Rodrigo . Proceedings of the 1st Workshop on Tools and Resources to Empower People with REAding DIfficulties (READI). Marseille: European Language Resources Association, 41-48.
-
An Empirical Analysis of Linguistic, Typographic, and Structural Features in Simplified German Texts In: Sixth Italian Conference on Computational Linguistics (CLiC-it 2019), Bari, 13 November 2019 - 15 November 2019, CLiC-it.
-
Geotagging a diachronic corpus of alpine texts: comparing distinct approaches to toponym recognition In: RANLP 2019, Workshop on Language technology for digital historical archives with a special focus on Central-, (South-)Eastern Europe, Middle East and North Africa, Varna, Bulgaria, 5 September 2019. RANLP, 11-18.
-
Post-editing Productivity with Neural Machine Translation: An Empirical Assessment of Speed and Quality in the Banking and Finance Domain In: Machine Translation Summit XVII, Dublin, Ireland, 19 August 2019 - 23 August 2019, 267-272.
-
Modelling Large Parallel Corpora: The Zurich Parallel Corpus Collection In: Challenges in the Management of Large Corpora (CMLC-7), Cardiff, Wales, 22 Juli 2019 - 22 Juli 2019.
-
Has Machine Translation Achieved Human Parity? A Case for Document-level Evaluation In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, 31 October 2018 - 4 November 2018. Association for Computational Linguistics, 4791-4796.
-
Cutter – a Universal Multilingual Tokenizer In: Swiss Text Analytics Conference, Winterthur, 12 June 2018 - 13 June 2018. CEUR-WS, 75-81.
-
mtrain: A Convenience Tool for Machine Translation In: Proceedings of the 21st Annual Conference of the European Association for Machine Translation, Alacant, Spain, 28 May 2018 - 30 May 2018. Universitat d'Alacant, 357.
-
Parallel Corpora, Terminology Extraction and Machine Translation In: 16. DTT-Symposion. Terminologie und Text(e), Mannheim, 22 March 2018 - 24 March 2018, 3-14.
-
A multilingual gold standard for translation spotting of German compounds and their corresponding multiword units in English, French, Italian and Spanish In: Mitkov, Ruslan; Monti, Johanna; Corpas Pastor, Gloria; Seretan, Violeta . Multiword Units in Machine Translation and Translation Technology. Amsterdam: John Benjamins, 125-145.
-
Crowdsourcing the OCR Ground Truth of a German and French Cultural Heritage Corpus Journal for Language Technology and Computational Linguistics, 33(1):25-47.
-
Lessons from a Massive Open Online Course (MOOC) on Natural Language Processing for Digital Humanitie In: Teaching NLP for Digital Humanitie, Berlin, 12 September 2017 - 12 September 2017, 17-22.
-
Multilingwis2 – Explore Your Parallel Corpus In: Proceedings of the 21st Nordic Conference on Computational Linguistics, NoDaLiDa, Gothenburg, Sweden, 22 May 2017 - 24 May 2017. Linköping University Electronic Press, Linköpings universitet, 247-250.
-
Building a Corpus of Multi-lingual and Multi-format International Investment Agreements In: 29th International Conference on Legal Knowledge and Information Systems (Jurix), Nice, December 2016. I O S Press, 203-206.
-
Crossing Sentence Boundaries in Statistical Machine Translation In: MultiLingual, December 2016, p.50-52.
-
Bi-particle adverbs, PoS-tagging and the recognition of german separable prefix verbs In: KONVENS 2016, Bochum, 19 September 2016 - 21 September 2016.
-
Rule-based Automatic Text Simplification for German In: 13th Conference on Natural Language Processing (KONVENS 2016), Bochum, Germany, 19 September 2016 - 21 September 2016, s.n..
-
Building a Parallel Corpus on the World's Oldest Banking Magazine In: KONVENS, Bochum, 19 September 2016 - 21 September 2016, s.n..
-
Efficient Exploration of Translation Variants in Large Multiparallel Corpora Using a Relational Database In: 4th Workshop on the Challenges in the Management of Large Corpora, Portorož, 28 May 2016 - 28 May 2016, 20-23.
-
Crowdsourcing an OCR Gold Standard for a German and French Heritage Corpus In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), Portorož, Slovenia, 23 May 2016 - 28 May 2016. European Language Resources Association (ELRA), 975-982.
-
Multilingwis – A Multilingual Search Tool for Multi-Word Units in Multiparallel Corpora In: Corpas Pastor, Gloria . Computerised and Corpus-based Approaches to Phraseology: Monolingual and Multilingual Perspectives/Fraseología computacional y basada en corpus: perspectivas monolingües y multilingües. Geneva: Tradulex, n/a.
-
Detecting Document-level Context Triggers to Resolve Translation Ambiguity In: Second Workshop on Discourse in Machine Translation (DiscoMT), Lisbon, 17 September 2015, s.n..
-
Leveraging Compounds to Improve Noun Phrase Translation from Chinese and German In: ACL-IJCNLP 2015 Student Research Workshop, Beijing, China, 28 July 2015.
-
Pre-reordering for Statistical Machine Translation of Non-fictional Subtitles In: The 18th Annual Conference of the European Association for Machine Translation (EAMT 2015), Antalya, Turkey, May 2015, s.n..
-
Reflections and a Proposal for a Query and Reporting Language for Richly Annotated Multiparallel Corpora In: Gintare, Grigonyte; Clematide, Simon; Utka, Andrius; Volk, Martin . Proceedings of the Workshop on Innovative Corpus Query and Visualization Tools at NODALIDA 2015, May 11-13, 2015, Vilnius, Lithuania. Linköping, Sweden: Linköping University Electronic Press, Linköpings universitet, 6-16.
-
Proceedings of the Workshop on Innovative Corpus Query and Visualization Tools at NODALIDA 2015, May 11-13, 2015, Vilnius, Lithuania Edited by: Grigonyte, Gintare; Clematide, Simon; Utka, Andrius; Volk, Martin (2015). Linköping: Linköping University Electronic Press, Linköpings universitet.
-
Detecting Code-Switching in a Multilingual Alpine Heritage Corpus In: Proceedings of the First Workshop on Computational Approaches to Code Switching, Doha, Qatar, 25 October 2014. Association for Computational Linguistics, 24-33.
-
Cleaning the Europarl Corpus for Linguistic Applications In: Konvens 2014, Hildesheim, 8 October 2014 - 10 October 2014, Stiftung Universität Hildesheim.
-
Enforcing Consistent Translation of German Compound Coreferences In: Konvens, Hildesheim, Germany, 8 October 2014 - 10 October 2014, s.n..
-
Evaluating the fully automatic multi-language translation of the Swiss avalanche bulletin In: Fourth Workshop on Controlled Natural Language (CNL), Galway, Ireland, 20 August 2014 - 22 August 2014. s.n., 44-54.
-
Innovations in parallel corpus search tools In: Ninth International Conference on Language Resources and Evaluation (LREC'14), Reykjavik, 26 May 2014 - 31 May 2014, European Language Resources Association (ELRA).
-
Proceedings of the twelth workshop on treebanks and linguistic theories (TLT 12) Edited by: Kübler, Sandra; Osenova, Petya; Volk, Martin (2013). Sofia: Institute of Information and Communication Technologies. Bulgarian Academy of Sciences.
-
Reconstructing Complete Lemmas for Incomplete German Compounds In: International Conference of the German Society for Computational Linguistics and Language Technology (GSCL), Darmstadt, 25 September 2013 - 27 September 2013, 1-13.
-
Exploiting Synergies Between Open Resources for German Dependency Parsing, POS-tagging, and Morphological Analysis In: Recent Advances in Natural Language Processing (RANLP 2013), Hissar, Bulgaria, 7 September 2013 - 13 September 2013, 601-609.
-
Statistical machine translation for automobile marketing texts In: Proceedings of the XIV Machine Translation Summit, Nice, 2 September 2013 - 6 September 2013, 265-272.
-
Assessing post-editing efficiency in a realistic translation environment In: MT Summit XIV Workshop on Post-editing Technology and Practice, Nice, 2 September 2013 - 2 September 2013, 83-91.
-
Building a German/Simple German Parallel Corpus for Automatic Text Simplification In: The Second Workshop on Predicting and Improving Text Readability for Target Reader Populations (PITR 2013), Sofia, Bulgaria, 8 August 2013.
-
Mining for Domain-specific Parallel Text from Wikipedia In: Proceedings of the Sixth Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 2013 - August 2013, 112-120.
-
Combining statistical machine translation and translation memories with domain adaptation In: NODALIDA 2013, Nordic Conference of Computational Linguistics, Oslo, Norway, 22 Mai 2013 - 24 Mai 2013. Linköping University Electronic Press, 331-341.
-
Statistical machine translation of subtitles: From OpenSubtitles to TED In: Gurevych, Iryna; Biemann, Chris; Zesch, Torsten . Language Processing and Knowledge in the Web. Berlin Heidelberg: Springer, 132-138.
-
Parallel subtitle corpora and their applications in machine translation and translatology Perspectives: Studies in Translatology, 21(4):595-610.
-
Using parallel treebanks for machine translation evaluation In: The 11th International Workshop on Treebanks and Linguistic Theories, Lisbon, Portugal, 30 November 2012 - 1 December 2012, 145-156.
-
Term evolution: use of biomedical terminologies In: AAAI-2012 Fall Symposium on Information Retrieval and Knowledge Discovery in Biomedical Text, Arlington, 3 November 2012 - 5 November 2012. AAAI, 79-80.
-
Change of biomedical domain terminology over time In: Human Language Technologies – The Baltic Perspective (Baltic HLT 2012), Tartu, 4 October 2012 - 5 October 2012. I O S Press, 74-81.
-
From subtitles to parallel corpora In: The 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, 28 May 2012 - 30 May 2012. European Association for Machine Translation, 3-6.
-
Semi-automatic annotation of semantic relations in a Swiss German sign language lexicon In: 5th Workshop on the Representation and Processing of Sign Languages: Interactions between Corpus and Lexicon, LREC 2012, Istanbul, 21 May 2012 - 27 May 2012, 31-36.
-
Towards a Wikipedia-extracted alpine corpus In: The Fifth Workshop on Building and Using Comparable Corpora, Istanbul, Turkey, 26 May 2012 - 26 May 2012.
-
Digging for names in the mountains: Combined person name recognition and reference resolution for German alpine texts In: 5th Language & Technology Conference, Poznan, Poland, 25 November 2011 - 27 November 2011.
-
Word-aligned parallel text : a new resource for contrastive language studies In: Supporting Digital Humanities, Conference 2011, Copenhagen, Denmark, 17 November 2011 - 18 November 2011.
-
From multilingual web-archives to parallel treebanks in five minutes In: Conference of the German Society for Computational Linguistics and Language Technology (GSCL) 2011, Hamburg, Germany, 28 September 2011 - 30 September 2011, 57-62.
-
From historic books to annotated XML: Building a large multilingual diachronic corpus In: Conference of the German Society for Computational Linguistics and Language Technology (GSCL) 2011, Hamburg, Germany, 28 September 2011 - 30 September 2011, 75-80.
-
Reducing OCR errors in Gothic-script documents In: 8th International Conference on Recent Advances in Natural Language Processing (RANLP 2011), Hisar, 16 September 2011, 97-103.
-
Reducing OCR errors in Gothic-script documents ERCIM News, (86):29-30.
-
The Text+Berg corpus: an alpine french-german parallel resource In: TALN 2011, Montpellier, 27 June 2011 - 1 July 2011.
-
Combining semantic and syntactic generalization in example-based machine translation In: EAMT 2011, Leuven, Belgium, 30 May 2011 - 31 May 2011, 209-216.
-
Iterative, MT-based sentence alignment of parallel texts In: NODALIDA 2011, Nordic Conference of Computational Linguistics, Riga, 11 May 2011 - 13 May 2011.
-
Disambiguation of English contractions for machine translation of TV subtitles In: NODALIDA 2011, Nordic Conference of Computational Linguistics, Riga, 11 May 2011 - 13 May 2011.
-
Proceedings of the Workshop on Annotation and Exploitation of Parallel Corpora Edited by: Ahrenberg, L; Tiedemann, J; Volk, Martin (2010). Tartu: Northern European Association for Language Technology.
-
Machine translation of TV subtitles for large scale production In: Second Joint EM+/CNGL Workshop, Denver, 4 November 2010, 53-62.
-
MT-based sentence alignment for OCR-generated parallel texts In: The Ninth Conference of the Association for Machine Translation in the Americas (AMTA 2010), Denver, 31 October 2010 - 4 November 2010.
-
Reducing OCR errors by combining two OCR systems In: ECAI 2010 Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH 2010), Lisbon, Portugal, 16 August 2010, 61-65.
-
Combining parallel treebanks and geo-tagging In: Fourth Linguistic Annotation Workshop (LAW IV), Uppsala, 15 July 2010 - 16 July 2010.
-
Challenges in building a multilingual alpine heritage corpus In: seventh international conference on Language Resources and Evaluation (LREC), Malta, 19 May 2010 - 21 May 2010.
-
Towards mapping of alpine route descriptions In: GIR'10: 6th Workshop on Geographic Information Retrieval, Zurich, Switzerland, 18 February 2010 - 19 February 2010, 15-16.
-
Searching Answers: Festschrift in Honour of Michael Hess on the Occasion of His 60th Birthday Edited by: Clematide, S; Klenner, M; Volk, Martin (2009). Münster, Germany: Monsenstein und Vannerdat.
-
A Target-Driven Evaluation of Morphological Components for German In: Clematide, S; Klenner, M; Volk, Martin . Searching Answers -- Festschrift in Honour of Michael Hess on the Occasion of his 60th birthday. Münster: MV-Wissenschaft, 85-99.
-
A New Hybrid Dependency Parser for German In: Chiarcos, Christian; de Castilho, Richard Eckart; Stede, Manfred . Von der Form zur Bedeutung: Texte automatisch verarbeiten / From Form to Meaning: Processing Texts Automatically. Proceedings of the Biennial GSCL Conference 2009. Tübingen: Narr, 115-124.
-
Maschinelle und computergestützte Übersetzung In: Carstensen, K U . Computerlinguistik und Sprachtechnologie. Eine Einführung. Heidelberg: Spektrum, 642-658.
-
Modelling relevance-driven language evolution In: Clematide, S; Klenner, M; Volk, Martin . Searching Answers: Festschrift in Honour of Michael Hess on the Occasion of His 60th Birthday. Münster, Germany: Monsenstein und und Vannerdat, 49-56.
-
ACE can be described by itself In: Clematide, S; Klenner, M; Volk, Martin . Searching Answers: Festschrift in honour of Michael Hess on the occasion of his 60th birthday. Münster, Germany: Monsenstein und und Vannerdat, 45-48.
-
Classifying named entity in an alpine heritage corpus Künstliche Intelligenz:40-43.
-
Datorerna råpluggar översättning Språktidningen, 5:58-61.
-
Natural language processing in law - change we need In: Clematide, S; Klenner, M; Volk, Martin . Searching Answers: Festschrift in Honour of Michael Hess on the Occasion of His 60th Birthday. Münster, Germany: Monsenstein und Vannerdat, 11-19.
-
A morpho-syntactic generation service for German glossary entries In: Clematide, S; Klenner, M; Volk, Martin . Searching Answers: Festschrift in Honour of Michael Hess on the Occasion of His 60th Birthday. Münster, Germany: Monsenstein und Vannerdat, 33-43.
-
A Quechua-Spanish parallel treebank In: 7th Conference on Treebanks and Linguistic Theories, Groningen, 2009.
-
Using linguistic annotations in statistical machine translation of film subtitles In: Nodalida, Odense, 2009.
-
Detecting Protein-Protein Interactions in Biomedical Literature Using a Parser In: Clematide, Simon; Klenner, Manfred; Volk, Martin . Searching Answers. Münster: MV Verlag, 109-118.
-
A framework for constituent-dependency conversion In: 8th Conference on Treebanks and Linguistic Theories, Milano, 2009.
-
How many mountains are there in Switzerland? Explorations of the SwissTopo name list In: Clematide, S; Klenner, M; Volk, Martin . Searching answers: A festschrift for Michael Hess on the occasion of his 60th birthday. Münster: Monsenstein und Vannerdat, 127-140.
-
Extending the TIGER query language with universal quantification In: KONVENS 2008: 9. Konferenz zur Verarbeitung natürlicher Sprache, Berlin, Germany, 30 September 2008 - 2 October 2008, 5-17.
-
Human judgements in parallel treebank alignment In: COLING Workshop on Human Judgements in Computational Linguistics, Manchester, UK, 23 August 2008.
-
Automatic phrase alignment. Using statistical n-gram alignment for syntactic phrase alignment In: 6th Workshop on Treebanks and Linguistic Theories, Bergen, 2007.
-
Frame-Semantic Annotation on a Parallel Treebank In: Nodalida Workshop on Building Frame Semantics Resources for Scandinavian and Baltic Languages, Tartu, 2007.
-
A search tool for parallel treebanks In: ACL Workshop on Linguistic Annotation, Prague, 2007.
-
Alignment tools for parallel treebanks In: GLDV Frühjahrstagung, Tübingen, Germany, 2007.
-
Using the Stockholm TreeAligner In: 6th Workshop on Treebanks and Linguistic Theories, Bergen, 2007.
-
Comparing French PP-attachment to English, German and Swedish In: Nodalida Workshop on Building Frame Semantics Resources for Scandinavian and Baltic Languages, Tartu, 2007.
-
XML-based phrase alignment in parallel treebanks In: EACL Workshop on Multi-dimensional Markup in Natural Language Processing, Trento, 2006.
-
Phrase alignment in parallel treebanks In: 5th Workshop on Treebanks and Linguistic Theories, Prague, 2006.
-
How bad is the problem of PP-attachment? A comparison of English, German and Swedish In: ACL-SIGSEM Workshop on Prepositions, Trento, 2006.
-
Presentation and representation of parallel treebanks In: Treebanking for Discourse and Speech. Proc. Of the {NODALIDA} 2005 Special Session on Treebanks for Spoken Language and Discourse, Joensuu, Finland, 20 May 2005 - 21 May 2005, 147-159.
-
Teaching treebanking In: Holmboe, H . Nordisk Sprogteknologi. Nordic Language Technology. Årbog for Nordisk Sprogteknologisk Forskningsprogram 2000-2004. Copenhagen: Museum Tusculanums Forlag, 143-159.
-
Requirements for a parallel treebank search tool In: GLDV-Conference, Bonn, Germany, 2005.
-
Treebanks: a whitepaper In: Holmboe, H . Nordisk Sprogteknologi. Nordic Language Technology. Årbog for Nordisk Sprogteknologisk Forskningsprogram 2000-2004. Copenhagen: Museum Tusculanums Forlag, ?-?.
-
Enriching an ontology with WordNet based on similarity measures In: MEANING-2005 Workshop, Trento, 2005.
-
Bootstrapping parallel treebanks In: Workshop on Linguistically Interpreted Corpora (LINC) at COLING, Genève, 2004.
-
Evaluation resources for concept-based cross-lingual information retrieval in the medical domain In: LREC-2004, Lisbon, Portugal, 2004.
-
Automatic node insertion for treebank deepening In: Third Workshop on Treebanks and Linguistic Theories (TLT) 2004, Tübingen, Germany, 2004.
-
Semantic relations in concept-based cross-language medical information retrieval In: Workshop on Adaptive Text Extraction and Mining, Cavtat_Dubrovnik, 2003.
-
A cross-language document retrieval system based on semantic annotation In: EACL, Budapest, Hungary, 2003.
-
German prepositions and their kin. A survey with respect to the resolution of PP attachment ambiguities In: Workshop on The Linguistic Dimensions of Prepositions and their Use in Computational Linguistics Formalisms and Applications, Toulouse, 2003.
-
Ontologies in cross-language information retrieval In: 2nd Conference on Professional Knowledge Management, Luzern, 2003.
-
Combining unsupervised and supervised methods for PP attachment disambiguation In: COLING-2002, Taipeh, 2002.
-
The length factor in automatic bilingual terminology extraction In: 6th International Conference on Terminology and Knowledge Engineering, Nancy, 2002.
-
A systematic evaluation of concept-based cross-language information retrieval in the medical domain In: 3rd Dutch-Belgian Information Retrieval Workshop, Leuven, 2002.
-
Using the web as corpus for linguistic research In: Pajusalu, R; Hennoste, T . ähendusepüüdja. Catcher of the Meaning. A Festschrift for Professor Haldur Õim. Tartu: University of Tartu, 1-10.
-
Semantic annotation for concept-based cross-language medical information retrieval International Journal of Medical Informatics, 67(1-3):97-112.
-
Linguistische und semantische Annotation eines Zeitungskorpus In: GLDV-Jahrestagung, Giessen, 28 March 2001 - 30 March 2001, 201-209.
-
Learn-filter-apply-forget. Mixed approaches to named entity recognition In: 6th International Workshop on Applications of Natural Language for Informations Systems, Madrid, Spain, 2001.
-
Exploiting the WWW as a corpus to resolve PP attachment ambiguities In: Corpus Linguistics, Lancaster, 2001.
-
LUIS - Ein natürlichsprachliches, universitäres Informationssystem In: Unternehmen Hochschule, Wien, 2001, 115-126.
-
Evaluating translation quality as input to product development In: Proc. of Second Int. Conference On Language Resources And Evaluation, Athens, 2000.
-
Scaling up: using the WWW to resolve PP attachment ambiguities In: Proc. of Konvens-2000, Ilmenau, 2000.
-
Choosing the right lemma when analysing German nouns In: Multilinguale Corpora: Codierung, Strukturierung, Analyse. 11. Jahrestagung der GLDV, Frankfurt, 1999.
-
Aspects of the translation of English subordinate clauses into German In: Problems and Potential of English-to-German MT systems. Workshop at the 8th International Conference on Theoretical and Methodological Issues in Machine Translation, Chester, 1999.
-
Comparing a statistical and a rule-based tagger for German In: Proc. of KONVENS-98, Bonn, 1998.
-
Das Evaluieren von Software für die maschinenunterstützte Übersetzung In: Équivalences 97 - Die Akten. Computerwerkzeuge am Übersetzer-Arbeitsplatz: Theorie und Praxis. Bern: ASSTI, 105-117.
-
Zur Problematik der maschinellen Übersetzung von Nebensätzen zwischen den Sprachen Englisch und Deutsch In: Evaluation of the Linguistic Performance of Machine Translation Systems. Proceedings of the Workshop at the KONVENS-98, Bonn, Germany, 1998 - 1998.
-
Markup of a Test Suite with SGML In: Nerbonne, J . Linguistic Databases. Stanford: Center for the Study of Language and Information,, 59-76.
-
Adding manual constraints and lexical look-up to a brill-tagger for German In: ESSLLI-98 Workshop on Recent Advances in Corpus Annotation, Saarbrücken, 1998.
-
Statistische Verfahren zur Zuordnung von Präpositionalphrasen In: KONVENS-98, Bonn, Germany, 1998 - 1998.
-
The automatic translation of idioms. Machine translation vs. translation memory systems In: Weber, Nico . Machine translation: theory, applications, and evaluation. An assessment of the state of the art. St. Augustin: Gardez!-Verlag, 167-192.
-
Experiences with the GTU grammar development environment In: Workshop on Computational Environments For Grammar Development And Linguistic Engineering at the ACL/EACL Joint Conference, Madrid, Spain, 11 July 1997 - 12 July 1997, 107-113.
-
Probing the lexicon in evaluating commercial MT systems In: Proceedings of the eighth conference on European chapter of the Association for Computational Linguistics. Madrid: European Chapter Meeting of the ACL, 112-119.
-
Constraint Logic Programming for Computational Linguistics In: Retoré, C . Logical Aspects of Computational Linguistics. Berlin: Springer, 406-425.
-
Hybride NLP-Systeme und das Problem der PP-Anbindung In: Berichtsband des Workshops, Freiburg, Germany, 1997.
-
Parsing with ID/LP and PS rules In: Natural Language Processing and Speech Technology. Results of the 3rd KONVENS Conference (Bielefeld), Berlin, 1996, 342-353.
-
Die Rolle der Valenz bei der Auflösung von PP-Mehrdeutigkeiten In: Mehl, S . Präpositionalsemantik und PP-Anbindung. Duisburg: Gerhard-Mercator-Universität - GH - Duisburg, Institut für Informatik, 32-38.
-
Einsatz einer Testsatzsammlung im Grammar Engineering 1995, University of Zurich, Faculty of Arts.
-
GTU - A workbench for the development of natural language grammars In: Conference on Practical Applications of Prolog, Paris, France, 1995 - 1995.
-
GTU - Eine Grammatik-Testumgebung In: KONVENS-94, Vienna, Austria, 28 September 1994 - 30 September 1994, 427-430.