- Fokusthemen: Digitale Linguistik, Historische Linguistik, Maschinelle Sprachverarbeitung und Maschinelles Lernen
- Mitglied in UZH-Verbünden: NCCR Evolving Language, UFSP Sprache und Raum (SpuR)
Publikationen
Eigene Publikationsliste und ZORA-Abfrage
ZORA Publikationsliste
Download-Optionen
Publikationen
-
Detecting and Mapping Hate in Religious Contexts In T. Schlag & K. Yadav (Eds.), Religious Communication, Interaction and Transformation in a Culture of Digitality : Insights into the Zurich University Research Priority Program “Digital Religion(s)” (pp. 153–183). De Gruyter. https://doi.org/10.1515/9783111721729
-
Name Consistency in LLM-based Machine Translation of Historical Texts (P. Bouillon, J. Gerlach, S. Girletti, L. Volkart, R. Rubino, R. Sennrich, A. C. Farinha, M. Gaido, J. Daems, H. Moniz, & S. Szoc, Eds.; XX; pp. 204–219). Association for Computational Linguistics. https://aclanthology.org/2025.mtsummit-1.16/
-
LLM-based Translation for Latin: Summaries Improve Machine Translation (J. Gerber, M. Cieliebak, D. Tuggener, & M. Hürlimann, Eds.; No. 10; pp. 75–80). Association for Computational Linguistics. https://aclanthology.org/2025.swisstext-1.7/
-
LLM-based Translation Across 500 Years. The Case for Early New High German Proceedings of the Conference on Natural Language Processing, 368–375. https://aclanthology.org/2024.konvens-main.37.pdf
-
SwissText 2024 Shared Task: Automatic Classification of the United Nations’ Sustainable Development Goals (SDGs) and Their Targets in English Scientific Abstracts 193–197. https://aclanthology.org/2024.swisstext-1.45/
-
Tracing Linguistic Footprints of ChatGPT Across Tasks, Domains and Personas in English and German (C. Capol, M. Cieliebak, A. Weichselbra, C. Musat, & L. Zimmerman, Eds.; pp. 102–112). Association for Computational Linguistics. https://aclanthology.org/2024.swisstext-1.9/
-
SDG Classification Using Instruction-Tuned LLMs 148–156. https://aclanthology.org/2024.swisstext-1.13/
-
LLM-based Machine Translation and Summarization for Latin (R. Sprugnoli & M. Passarotti, Eds.).
-
Multilingual Workflows in Bullinger Digital: Data Curation for Latin and Early New High German Journal of Open Humanities Data, 10, 12. https://doi.org/10.5334/johd.174
-
Machine vs. Human: Exploring Syntax and Lexicon in German Translations, with a Spotlight on Anglicisms 215–227. https://aclanthology.org/2023.nodalida-1.22
-
Bullingers Briefwechsel zugänglich machen: Stand der Handschriftenerkennung DHd 2023, Trier. https://doi.org/10.5281/zenodo.7688632
-
Language Report German In G. Rehm & A. Way (Eds.), European Language Equality: A Strategic Agenda for Digital Language Equality (pp. 147–150). Springer International Publishing. https://doi.org/10.1007/978-3-031-28819-7_18
-
The Adaptability of a Transformer-Based OCR Model for Historical Documents Lecture Notes in Computer Science, 34–48. https://doi.org/10.1007/978-3-031-41498-5_3
-
European Language Equality - Report on the German Language (M. Giagkou, S. Piperidis, G. Rehm, & J. Dunne, Eds.). European Language Equality (ELE). https://european-language-equality.eu/wp-content/uploads/2022/03/ELE___Deliverable_D1_16__Language_Report_German_.pdf
-
Lessons Learnt from Bullinger Digital 75–76. https://doi.org/10.5281/zenodo.10400571
-
Binomials in Swedish corpora – ‘Ordpar 1965’ revisited In E. Volodina, D. Dannélls, A. Berdicevskis, M. Forsberg, & S. Virk (Eds.), Live and Learn : Festschrift in honor of Lars Borin (pp. 139–144). Department of Swedish, Multilingualism and Language Technology, University of Gothenburg.
-
Transformer-based HTR for Historical Documents Workshop on Computational Methods in the Humanities 2022, Lausanne. https://wp.unil.ch/llist/files/2022/06/COMHUM_2022_paper_6.pdf
-
Nunc profana tractemus. Detecting Code-Switching in a Large Corpus of 16th Century Letters Proceedings of LREC-2022, Marseille. https://lrec2022.lrec-conf.org/en/
-
Evaluation of HTR models without Ground Truth Material LREC 2022, Marseille. http://www.lrec-conf.org/proceedings/lrec2022/pdf/2022.lrec-1.467.pdf
-
A Multilingual Simplified Language News Corpus 25–30. https://aclanthology.org/2022.readi-1.4
-
Improving Specificity in Review Response Generation with Data-Driven Data Filtering Proceedings of the Fifth Workshop on e-Commerce and NLP (ECNLP 5), Dublin. https://doi.org/10.18653/v1/2022.ecnlp-1.15
-
Ein Briefwechsel-Korpus des 16. Jahrhunderts in Frühneuhochdeutsch In M. Kupietz & T. Schmidt (Eds.), Neue Entwicklungen in der Korpuslandschaft der Germanistik (pp. 33–42). Narr Francke Attempto GmbH + Co. KG. https://doi.org/10.24053/9783823396024
-
Machine Translation of 16th Century Letters from Latin to German 43–50. http://www.lrec-conf.org/proceedings/lrec2022/workshops/LT4HALA/pdf/2022.lt4hala2022-1.7.pdf
-
WikiFlash: Generating Flashcards from Wikipedia Articles In T. Mantoro & et al (Eds.), Neural Information Processing. ICONIP 2021. Lecture Notes in Computer Science (vol 13111; pp. 138–149). Springer. https://doi.org/10.1007/978-3-030-92273-3_12
-
Binomial adverbs in Germanic and Romance Languages : A corpus-based study. In J. Lavid-López, C. Maíz-Arévalo, & J. R. Zamorano-Mansilla (Eds.), Corpora in Translation and Contrastive Research in the Digital Age : Recent advances and explorations (pp. 326–342). John Benjamins. https://doi.org/10.1075/btl.158.13gra
-
Harmonization Sometimes Harms (S. Ebling, D. Tuggener, M. Hürlimann, & M. Volk, Eds.). swisstext-and-konvens-2020. http://swisstext-and-konvens-2020.org
-
Ranking Georeferences for Efficient Crowdsourcing of Toponym Annotations in a Historical Corpus of Alpine Texts CEUR Workshop Proceedings, online. http://ceur-ws.org/Vol-2624/paper11.pdf
-
Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS) : Vol. Vol-2624. CEUR-ws. http://ceur-ws.org/Vol-2624/
-
How Much Data Do You Need? About the Creation of a Ground Truth for Black Letter and the Effectiveness of Neural OCR 3551–3559. https://www.aclweb.org/anthology/2020.lrec-1.436
-
Benchmarking Data-driven Automatic Text Simplification for German In N. Gala & R. Wilkens (Eds.), Proceedings of the 1st Workshop on Tools and Resources to Empower People with REAding DIfficulties (READI) (pp. 41–48). European Language Resources Association. https://www.aclweb.org/anthology/2020.readi-1.7
-
An Empirical Analysis of Linguistic, Typographic, and Structural Features in Simplified German Texts Sixth Italian Conference on Computational Linguistics (CLiC-it 2019), Bari. http://ceur-ws.org/Vol-2481/paper4.pdf
-
Geotagging a diachronic corpus of alpine texts: comparing distinct approaches to toponym recognition 11–18. https://doi.org/10.26615/978-954-452-059-5_003
-
Post-editing Productivity with Neural Machine Translation: An Empirical Assessment of Speed and Quality in the Banking and Finance Domain 267–272. https://www.aclweb.org/anthology/W19-6626
-
Modelling Large Parallel Corpora: The Zurich Parallel Corpus Collection (P. Bański, A. Barbaresi, H. Biber, E. Breiteneder, S. Clematide, M. Kupietz, H. Lüngen, & C. Iliadi, Eds.). Leibniz-Institut für Deutsche Sprache. https://doi.org/10.14618/ids-pub-9020
-
Has Machine Translation Achieved Human Parity? A Case for Document-level Evaluation 4791–4796. http://www.aclweb.org/anthology/D18-1512
-
Cutter – a Universal Multilingual Tokenizer In M. Cieliebak, D. Tuggener, & F. Benites (Eds.), CEUR Workshop Proceedings (No. 2226; pp. 75–81). CEUR-WS. http://ceur-ws.org/Vol-2226/
-
mtrain: A Convenience Tool for Machine Translation 357–357. http://rua.ua.es/dspace/handle/10045/76099
-
Parallel Corpora, Terminology Extraction and Machine Translation (P. Drewer, F. Mayer, K.-D. Schmitz, & M. Volk, Eds.; pp. 3–14). s.n.
-
A multilingual gold standard for translation spotting of German compounds and their corresponding multiword units in English, French, Italian and Spanish In R. Mitkov, J. Monti, G. Corpas Pastor, & V. Seretan (Eds.), Multiword Units in Machine Translation and Translation Technology (No. 341; pp. 125–145). John Benjamins. https://doi.org/10.1075/cilt.341
-
Crowdsourcing the OCR Ground Truth of a German and French Cultural Heritage Corpus Journal for Language Technology and Computational Linguistics, 33, 25–47. https://jlcl.org/content/2-allissues/1-heft1-2018/jlcl_2018-1_2.pdf
-
Multi-word Adverbs – How well are they handled in Parsing and Machine Translation? The 3rd Workshop on Multi-word Units in Machine Translation and Translation Technology (MUMTTT 2017), London.
-
Multilingwis2 – Explore Your Parallel Corpus Linköping Electronic Conference Proceedings, 247–250. http://www.ep.liu.se/ecp/article.asp?issue=131&article=031&volume=#
-
Crossing Sentence Boundaries in Statistical Machine Translation MultiLingual, 50–52.
-
Building a Corpus of Multi-lingual and Multi-format International Investment Agreements In F. Bex & S. Vilata (Eds.), Frontiers in Artificial Intelligence and Applications (No. 294; pp. 203–206). I O S Press. https://doi.org/10.3233/978-1-61499-726-9-203
-
Building a Parallel Corpus on the World’s Oldest Banking Magazine. KONVENS, Bochum.
-
Rule-based Automatic Text Simplification for German 13th Conference on Natural Language Processing (KONVENS 2016), Bochum.
-
Bi-particle adverbs, PoS-tagging and the recognition of german separable prefix verbs KONVENS 2016, Bochum. https://www.linguistics.rub.de/konvens16/program/accepted.html
-
Efficient Exploration of Translation Variants in Large Multiparallel Corpora Using a Relational Database (P. Bański, M. Kupietz, H. Lüngen, A. Witt, A. Barbaresi, H. Biber, E. Breiteneder, & S. Clematide, Eds.; pp. 20–23). s.n. http://www.lrec-conf.org/proceedings/lrec2016/workshops/LREC2016Workshop-CMLC_Proceedings.pdf
-
Crowdsourcing an OCR Gold Standard for a German and French Heritage Corpus 975–982. http://www.lrec-conf.org/proceedings/lrec2016/pdf/917_Paper.pdf
-
Multilingwis – A Multilingual Search Tool for Multi-Word Units in Multiparallel Corpora In G. Corpas Pastor (Ed.), Computerised and Corpus-based Approaches to Phraseology: Monolingual and Multilingual Perspectives/Fraseología computacional y basada en corpus: perspectivas monolingües y multilingües (p. n/a). Tradulex.
-
Detecting Document-level Context Triggers to Resolve Translation Ambiguity Second Workshop on Discourse in Machine Translation (DiscoMT), Lisbon.
-
Leveraging Compounds to Improve Noun Phrase Translation from Chinese and German ACL-IJCNLP 2015 Student Research Workshop, Beijing.
-
Pre-reordering for Statistical Machine Translation of Non-fictional Subtitles The 18th Annual Conference of the European Association for Machine Translation (EAMT 2015), Antalya. http://www.eamt2015.org/files/downloads/EAMT2015_Proceedings.pdf
-
Proceedings of the Workshop on Innovative Corpus Query and Visualization Tools at NODALIDA 2015, May 11-13, 2015, Vilnius, Lithuania (Vol. 111). Linköping University Electronic Press, Linköpings universitet. http://www.ep.liu.se/ecp/111/ecp15111.pdf
-
Reflections and a Proposal for a Query and Reporting Language for Richly Annotated Multiparallel Corpora In G. Gintare, S. Clematide, A. Utka, & M. Volk (Eds.), Proceedings of the Workshop on Innovative Corpus Query and Visualization Tools at NODALIDA 2015, May 11-13, 2015, Vilnius, Lithuania (No. 111; pp. 6–16). Linköping University Electronic Press, Linköpings universitet. http://www.ep.liu.se/ecp_home/index.en.aspx?issue=111
-
Detecting Code-Switching in a Multilingual Alpine Heritage Corpus 24–33. https://doi.org/10.3115/v1/W14-3903
-
Enforcing Consistent Translation of German Compound Coreferences Konvens, Hildesheim.
-
Cleaning the Europarl Corpus for Linguistic Applications Konvens 2014, Hildesheim.
-
Evaluating the fully automatic multi-language translation of the Swiss avalanche bulletin 44–54. https://doi.org/10.1007/978-3-319-10223-8_5
-
Innovations in parallel corpus search tools Ninth International Conference on Language Resources and Evaluation (LREC’14), Reykjavik. http://www.lrec-conf.org/proceedings/lrec2014/pdf/504_Paper.pdf
-
Proceedings of the twelth workshop on treebanks and linguistic theories (TLT 12) Institute of Information and Communication Technologies. Bulgarian Academy of Sciences. http://www.bultreebank.org/TLT12/TLT12Proceedings.pdf
-
Reconstructing Complete Lemmas for Incomplete German Compounds 1–13. https://doi.org/10.1007/978-3-642-40722-2_1
-
Exploiting Synergies Between Open Resources for German Dependency Parsing, POS-tagging, and Morphological Analysis 601–609. http://www.aclweb.org/anthology/R/R13/R13-1079.pdf
-
Statistical machine translation for automobile marketing texts In K. Sima’an, M. L. Forcada, D. Grasmick, & H. Depraetere (Eds.), Machine Translation Summit XIV (Nice, September 2–6, 2013): main conference proceedings (pp. 265–272). http://www.mtsummit2013.info/files/proceedings/main/proceedings.pdf
-
Assessing post-editing efficiency in a realistic translation environment In S. O’Brien, M. Simard, & L. Specia (Eds.), Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (Nice, September 2, 2013) (pp. 83–91). European Association for Machine Translation.
-
Building a German/Simple German Parallel Corpus for Automatic Text Simplification The Second Workshop on Predicting and Improving Text Readability for Target Reader Populations (PITR 2013), Sofia.
-
Mining for Domain-specific Parallel Text from Wikipedia 112–120. http://www.aclweb.org/anthology/W13-2514
-
Combining statistical machine translation and translation memories with domain adaptation In S. Oepen, K. Hagen, & J. B. Johannesse (Eds.), Proceedings of the 19th Nordic Conference of Computational Linguistics (NODALIDA 2013), May 22–24, 2013, Oslo University, Norway (pp. 331–341). Linköping University Electronic Press. http://www.ep.liu.se/ecp/085/030/ecp1385030.pdf
-
Parallel subtitle corpora and their applications in machine translation and translatology Perspectives: Studies in Translatology, 21, 595–610. https://doi.org/10.1080/0907676X.2013.831920
-
Statistical machine translation of subtitles: From OpenSubtitles to TED In I. Gurevych, C. Biemann, & T. Zesch (Eds.), Language Processing and Knowledge in the Web (No. 8105; Vol. 8105, pp. 132–138). Springer. https://doi.org/10.1007/978-3-642-40722-2_14
-
Using parallel treebanks for machine translation evaluation 145–156. http://tlt11.clul.ul.pt/
-
Change of biomedical domain terminology over time In A. Tavast, K. Muischnek, & M. Koit (Eds.), Frontiers in Artificial Intelligence and Applications (No. 247; pp. 74–81). I O S Press. https://doi.org/10.3233/978-1-61499-133-5-74
-
From subtitles to parallel corpora Proceedins of the 16th Annual Conference of the European Association for Machine Translation EAMT’2012, 3–6. http://www.mt-archive.info/EAMT-2012-Fishel.pdf
-
Semi-automatic annotation of semantic relations in a Swiss German sign language lexicon Proceedings of the 5th Workshop on the Representation and Processing of Sign Languages: Interactions between Corpus and Lexicon, LREC 2012, 31–36. http://www.lrec-conf.org/proceedings/lrec2012/workshops/24.Proceedings_SignLanguage.pdf
-
Towards a Wikipedia-extracted alpine corpus The Fifth Workshop on Building and Using Comparable Corpora, Istanbul. http://www.lrec-conf.org/proceedings/lrec2012/workshops/16.BUCC2012%20Proceedings.pdf
-
Parallel treebanking Spanish-Quechua: How and how well do they align? (7/13). online. http://elanguage.net/journals/lilt/article/view/2695
-
Digging for names in the mountains: Combined person name recognition and reference resolution for German alpine texts 5th Language & Technology Conference, Poznan. https://doi.org/10.1007/978-3-319-08958-4_16
-
Word-aligned parallel text : a new resource for contrastive language studies. Supporting Digital Humanities, Conference 2011, Copenhagen.
-
From multilingual web-archives to parallel treebanks in five minutes (H. Hedeland, T. Schmidt, & K. Wörner, Eds.; No. 96; pp. 57–62). Universität Hamburg. http://www.corpora.uni-hamburg.de/gscl2011/en/?download=AZM96.pdf
-
From historic books to annotated XML: Building a large multilingual diachronic corpus (No. 96). 75–80. http://www.corpora.uni-hamburg.de/gscl2011/downloads/AZM96.pdf
-
Reducing OCR errors in Gothic-script documents ERCIM News, 29–30. http://ercim-news.ercim.eu/
-
The Text+Berg corpus: an alpine french-german parallel resource TALN 2011, Montpellier.
-
Combining semantic and syntactic generalization in example-based machine translation (M. L. Forcada, H. Depraetere, & V. Vandeghinste, Eds.; pp. 209–216). http://www.ccl.kuleuven.be/EAMT2011/proceedings/pdf/eamt2011proceedings.pdf
-
Iterative, MT-based sentence alignment of parallel texts NODALIDA 2011, Nordic Conference of Computational Linguistics, Riga.
-
Disambiguation of English contractions for machine translation of TV subtitles NODALIDA 2011, Nordic Conference of Computational Linguistics, Riga.
-
Proceedings of the Workshop on Annotation and Exploitation of Parallel Corpora (Vol. 10). Northern European Association for Language Technology. http://dspace.utlib.ee/dspace/handle/10062/15893
-
Machine translation of TV subtitles for large scale production 53–62. http://web.me.com/emcnglworkshop/JEC2010/Program.html
-
MT-based sentence alignment for OCR-generated parallel texts The Ninth Conference of the Association for Machine Translation in the Americas (AMTA 2010), Denver. http://amta2010.amtaweb.org/AMTA/papers/2-14-SennrichVolk.pdf
-
Reducing OCR errors by combining two OCR systems 61–65. http://ilk.uvt.nl/LaTeCH2010/paperlist.html
-
Combining parallel treebanks and geo-tagging Fourth Linguistic Annotation Workshop (LAW IV), Uppsala.
-
Challenges in building a multilingual alpine heritage corpus seventh international conference on Language Resources and Evaluation (LREC), Malta.
-
Towards mapping of alpine route descriptions In R. S. Purves, P. Clough, & C. B. Jones (Eds.), Proceedings of the 6th Workshop on Geographic Information Retrieval (GIR’10) (pp. 15–16). https://doi.org/10.1145/1722080.1722083
-
Searching Answers: Festschrift in Honour of Michael Hess on the Occasion of His 60th Birthday Monsenstein und Vannerdat. http://www.mv-buchshop.de/catalog/product_info.php/cPath/36_51/products_id/1338
-
Nominal anaphora. Can we tame the beasts? In S. Clematide, M. Klenner, & M. Volk (Eds.), Searching Answers : Festschrift in Honour of Michael Hess on the Occasion of His 60th Birthday (pp. 77–84). Monsenstein und Vannerdat.
-
A Target-Driven Evaluation of Morphological Components for German In S. Clematide, M. Klenner, & M. Volk (Eds.), Searching Answers -- Festschrift in Honour of Michael Hess on the Occasion of his 60th birthday (pp. 85–99). MV-Wissenschaft.
-
Modelling relevance-driven language evolution In S. Clematide, M. Klenner, & M. Volk (Eds.), Searching Answers: Festschrift in Honour of Michael Hess on the Occasion of His 60th Birthday (pp. 49–56). Monsenstein und und Vannerdat. http://www.mv-buchshop.de/catalog/product_info.php/cPath/36_51/products_id/1338
-
Detecting Protein-Protein Interactions in Biomedical Literature Using a Parser In S. Clematide, M. Klenner, & M. Volk (Eds.), Searching Answers (pp. 109–118). MV Verlag.
-
Classifying named entity in an alpine heritage corpus Künstliche Intelligenz, 40–43.
-
How many mountains are there in Switzerland? Explorations of the SwissTopo name list In S. Clematide, M. Klenner, & M. Volk (Eds.), Searching answers: A festschrift for Michael Hess on the occasion of his 60th birthday (pp. 127–140). Monsenstein und Vannerdat.
-
Maschinelle und computergestützte Übersetzung In K. U. Carstensen (Ed.), Computerlinguistik und Sprachtechnologie. Eine Einführung (pp. 642–658). Spektrum.
-
Datorerna råpluggar översättning Språktidningen, 5, 58–61. http://www.spraktidningen.se/art.lasso?id=09558
-
A New Hybrid Dependency Parser for German In C. Chiarcos, R. E. de Castilho, & M. Stede (Eds.), Von der Form zur Bedeutung: Texte automatisch verarbeiten / From Form to Meaning: Processing Texts Automatically. Proceedings of the Biennial GSCL Conference 2009 (pp. 115–124). Narr.
-
A framework for constituent-dependency conversion 8th Conference on Treebanks and Linguistic Theories, Milano.
-
ACE can be described by itself In S. Clematide, M. Klenner, & M. Volk (Eds.), Searching Answers: Festschrift in honour of Michael Hess on the occasion of his 60th birthday (pp. 45–48). Monsenstein und und Vannerdat.
-
A morpho-syntactic generation service for German glossary entries In S. Clematide, M. Klenner, & M. Volk (Eds.), Searching Answers: Festschrift in Honour of Michael Hess on the Occasion of His 60th Birthday (pp. 33–43). Monsenstein und Vannerdat. http://www.mv-buchshop.de/catalog/product_info.php/cPath/36_51/products_id/1338
-
Natural language processing in law - change we need In S. Clematide, M. Klenner, & M. Volk (Eds.), Searching Answers: Festschrift in Honour of Michael Hess on the Occasion of His 60th Birthday (pp. 11–19). Monsenstein und Vannerdat. http://www.mv-buchshop.de/catalog/product_info.php/cPath/36_51/products_id/1338
-
A Quechua-Spanish parallel treebank 7th Conference on Treebanks and Linguistic Theories, Groningen.
-
Extending the TIGER query language with universal quantification 5–17. https://files.ifi.uzh.ch/cl/volk/papers/Marek_Lundborg_Volk__KONVENS_2008.pdf
-
Human judgements in parallel treebank alignment COLING Workshop on Human Judgements in Computational Linguistics, Manchester. http://www.aclweb.org/anthology-new/W/W08/W08-1208.pdf
-
The automatic translation of film subtitles: a machine translation success story? In J. Nivre, M. Dahllöf, & B. Megyesi (Eds.), Resourceful Language Technology: Festschrift in Honor of Anna Sågvall Hein (pp. 202–214). Uppsala University. http://publications.uu.se/abstract.xsql?dbid=8933
-
Comparing French PP-attachment to English, German and Swedish Nodalida Workshop on Building Frame Semantics Resources for Scandinavian and Baltic Languages, Tartu.
-
Evaluating MT with translations or translators: what is the difference? MT-Summit, Copenhagen.
-
Alignment tools for parallel treebanks GLDV Frühjahrstagung, Tübingen.
-
Automatic phrase alignment. Using statistical n-gram alignment for syntactic phrase alignment 6th Workshop on Treebanks and Linguistic Theories, Bergen.
-
Using the Stockholm TreeAligner 6th Workshop on Treebanks and Linguistic Theories, Bergen.
-
A search tool for parallel treebanks ACL Workshop on Linguistic Annotation, Prague.
-
Frame-Semantic Annotation on a Parallel Treebank Nodalida Workshop on Building Frame Semantics Resources for Scandinavian and Baltic Languages, Tartu.
-
XML-based phrase alignment in parallel treebanks EACL Workshop on Multi-dimensional Markup in Natural Language Processing, Trento.
-
How bad is the problem of PP-attachment? A comparison of English, German and Swedish ACL-SIGSEM Workshop on Prepositions, Trento.
-
Phrase alignment in parallel treebanks 5th Workshop on Treebanks and Linguistic Theories, Prague.
-
Treebanks: a whitepaper In H. Holmboe (Ed.), Nordisk Sprogteknologi. Nordic Language Technology. Årbog for Nordisk Sprogteknologisk Forskningsprogram 2000-2004 (p. ?-?). Museum Tusculanums Forlag.
-
Teaching treebanking In H. Holmboe (Ed.), Nordisk Sprogteknologi. Nordic Language Technology. Årbog for Nordisk Sprogteknologisk Forskningsprogram 2000-2004 (pp. 143–159). Museum Tusculanums Forlag.
-
Enriching an ontology with WordNet based on similarity measures MEANING-2005 Workshop, Trento.
-
Requirements for a parallel treebank search tool GLDV-Conference, Bonn.
-
Automatic node insertion for treebank deepening Third Workshop on Treebanks and Linguistic Theories (TLT) 2004, Tübingen.
-
Bootstrapping parallel treebanks Workshop on Linguistically Interpreted Corpora (LINC) at COLING, Genève.
-
German prepositions and their kin. A survey with respect to the resolution of PP attachment ambiguities Workshop on The Linguistic Dimensions of Prepositions and their Use in Computational Linguistics Formalisms and Applications, Toulouse.
-
Ontologies in cross-language information retrieval 2nd Conference on Professional Knowledge Management, Luzern.
-
Semantic relations in concept-based cross-language medical information retrieval Workshop on Adaptive Text Extraction and Mining, Cavtat_Dubrovnik.
-
Using the web as corpus for linguistic research In R. Pajusalu & T. Hennoste (Eds.), ähendusepüüdja. Catcher of the Meaning. A Festschrift for Professor Haldur Õim (pp. 1–10). University of Tartu.
-
The length factor in automatic bilingual terminology extraction 6th International Conference on Terminology and Knowledge Engineering, Nancy.
-
Semantic annotation for concept-based cross-language medical information retrieval International Journal of Medical Informatics, 67, 97–112. https://doi.org/10.1016/S1386-5056(02)00058-8
-
Combining unsupervised and supervised methods for PP attachment disambiguation COLING-2002, Taipeh.
-
A systematic evaluation of concept-based cross-language information retrieval in the medical domain 3rd Dutch-Belgian Information Retrieval Workshop, Leuven.
-
Learn-filter-apply-forget. Mixed approaches to named entity recognition 6th International Workshop on Applications of Natural Language for Informations Systems, Madrid.
-
Exploiting the WWW as a corpus to resolve PP attachment ambiguities Corpus Linguistics, Lancaster.
-
Evaluating translation quality as input to product development Proc. of Second Int. Conference On Language Resources And Evaluation, Athens.
-
Scaling up: using the WWW to resolve PP attachment ambiguities Proc. of Konvens-2000, Ilmenau.
-
Aspects of the translation of English subordinate clauses into German Problems and Potential of English-to-German MT systems. Workshop at the 8th International Conference on Theoretical and Methodological Issues in Machine Translation, Chester.
-
Choosing the right lemma when analysing German nouns Multilinguale Corpora: Codierung, Strukturierung, Analyse. 11. Jahrestagung der GLDV, Frankfurt.
-
The automatic translation of idioms. Machine translation vs. translation memory systems In N. Weber (Ed.), Machine translation: theory, applications, and evaluation. An assessment of the state of the art (No. 1; pp. 167–192). Gardez!-Verlag.
-
Zur Problematik der maschinellen Übersetzung von Nebensätzen zwischen den Sprachen Englisch und Deutsch Evaluation of the Linguistic Performance of Machine Translation Systems. Proceedings of the Workshop at the KONVENS-98, Bonn.
-
Statistische Verfahren zur Zuordnung von Präpositionalphrasen KONVENS-98, Bonn.
-
Adding manual constraints and lexical look-up to a brill-tagger for German ESSLLI-98 Workshop on Recent Advances in Corpus Annotation, Saarbrücken.
-
Das Evaluieren von Software für die maschinenunterstützte Übersetzung In Équivalences 97 - Die Akten. Computerwerkzeuge am Übersetzer-Arbeitsplatz: Theorie und Praxis (pp. 105–117). ASSTI.
-
Markup of a Test Suite with SGML In J. Nerbonne (Ed.), Linguistic Databases (No. 77; pp. 59–76). Center for the Study of Language and Information,.
-
Comparing a statistical and a rule-based tagger for German Proc. of KONVENS-98, Bonn.
-
Constraint Logic Programming for Computational Linguistics In C. Retoré (Ed.), Logical Aspects of Computational Linguistics (No. 1328; pp. 406–425). Springer. https://doi.org/10.1007/BFb0052169
-
Probing the lexicon in evaluating commercial MT systems In Proceedings of the eighth conference on European chapter of the Association for Computational Linguistics (pp. 112–119). European Chapter Meeting of the ACL. https://doi.org/10.3115/979617.979632
-
Hybride NLP-Systeme und das Problem der PP-Anbindung Hybride konnektionistische, statistische und symbolische Ansätze zur Verarbeitung natürlicher Sprache. Berichtsband des Workshops, Freiburg. http://www.dfki.de/~busemann/ki97/ki97-ws03.html
-
Die Rolle der Valenz bei der Auflösung von PP-Mehrdeutigkeiten In S. Mehl (Ed.), Präpositionalsemantik und PP-Anbindung (No. 16; pp. 32–38). Gerhard-Mercator-Universität - GH - Duisburg, Institut für Informatik.
-
Parsing with ID/LP and PS rules 342–353.
-
Einsatz einer Testsatzsammlung im Grammar Engineering (Dissertation, University of Zurich)
-
GTU - A workbench for the development of natural language grammars Conference on Practical Applications of Prolog, Paris.
-
Was ist Linguistic Engineering? Künstliche Intelligenz, 8, 15–22.
-
GTU - Eine Grammatik-Testumgebung 427–430.