Martin Volk

Martin Volk, Prof. Dr.

Professor of Computational Linguistics

Tel.: +41 44 63 54317

Fokusthemen: Digitale Linguistik, Historische Linguistik, Maschinelle Sprachverarbeitung und Maschinelles Lernen
Mitglied in UZH-Verbünden: NCCR Evolving Language, UFSP Sprache und Raum (SpuR)

Publikationen
Eigene Publikationsliste und ZORA-Abfrage

ZORA Publikationsliste

Download-Optionen

Format für Download Link

Download alsCSV Download alsRIS Download alsBIBTEX

Publikationen

Schneider, G., Volk, M., & Goldzycher, J. (2025). Detecting and Mapping Hate in Religious Contexts In T. Schlag & K. Yadav (Eds.), Religious Communication, Interaction and Transformation in a Culture of Digitality : Insights into the Zurich University Research Priority Program “Digital Religion(s)” (pp. 153–183). De Gruyter. https://doi.org/10.1515/9783111721729
Fischer, D. P., & Volk, M. (2025). Name Consistency in LLM-based Machine Translation of Historical Texts (P. Bouillon, J. Gerlach, S. Girletti, L. Volkart, R. Rubino, R. Sennrich, A. C. Farinha, M. Gaido, J. Daems, H. Moniz, & S. Szoc, Eds.; XX; pp. 204–219). Association for Computational Linguistics. https://aclanthology.org/2025.mtsummit-1.16/
Fischer, D. P., & Volk, M. (2025). LLM-based Translation for Latin: Summaries Improve Machine Translation (J. Gerber, M. Cieliebak, D. Tuggener, & M. Hürlimann, Eds.; No. 10; pp. 75–80). Association for Computational Linguistics. https://aclanthology.org/2025.swisstext-1.7/
Volk, M., Fischer, D. P., Scheurer, P., Schwitter, R., & Ströbel, P. (2024). LLM-based Translation Across 500 Years. The Case for Early New High German Proceedings of the Conference on Natural Language Processing, 368–375. https://aclanthology.org/2024.konvens-main.37.pdf
Clematide, S., Volk, M., Fankhauser, T., Hilty, L., & Bernard, J. (2024). SwissText 2024 Shared Task: Automatic Classification of the United Nations’ Sustainable Development Goals (SDGs) and Their Targets in English Scientific Abstracts 193–197. https://aclanthology.org/2024.swisstext-1.45/
Shaitarova, A., Bauer, N., Vamvas, J., & Volk, M. (2024). Tracing Linguistic Footprints of ChatGPT Across Tasks, Domains and Personas in English and German (C. Capol, M. Cieliebak, A. Weichselbra, C. Musat, & L. Zimmerman, Eds.; pp. 102–112). Association for Computational Linguistics. https://aclanthology.org/2024.swisstext-1.9/
Fankhauser, T., Clematide, S., & Volk, M. (2024). SDG Classification Using Instruction-Tuned LLMs 148–156. https://aclanthology.org/2024.swisstext-1.13/
Volk, M., Fischer, D. P., Fischer, L., Scheurer, P., & Ströbel, P. (2024). LLM-based Machine Translation and Summarization for Latin (R. Sprugnoli & M. Passarotti, Eds.).
Ströbel, P. B., Fischer, L., Müller, R., Scheurer, P., Schroffenegger, B., Suter, B., & Volk, M. (2024). Multilingual Workflows in Bullinger Digital: Data Curation for Latin and Early New High German Journal of Open Humanities Data, 10, 12. https://doi.org/10.5334/johd.174
Shaitarova, A., Göhring, A., & Volk, M. (2023). Machine vs. Human: Exploring Syntax and Lexicon in German Translations, with a Spotlight on Anglicisms 215–227. https://aclanthology.org/2023.nodalida-1.22
Ströbel, P. B., Hodel, T., Fischer, A., Scius-Bertrand, A., Wolf, B., Janka, A., Widmer, J., Scheurer, P., & Volk, M. (2023, March 17). Bullingers Briefwechsel zugänglich machen: Stand der Handschriftenerkennung DHd 2023, Trier. https://doi.org/10.5281/zenodo.7688632
Hegele, S., Heinisch, B., Popp, A., Marheinecke, K., Rios, A., Gromann, D., Volk, M., & Rehm, G. (2023). Language Report German In G. Rehm & A. Way (Eds.), European Language Equality: A Strategic Agenda for Digital Language Equality (pp. 147–150). Springer International Publishing. https://doi.org/10.1007/978-3-031-28819-7_18
Ströbel, P. B., Hodel, T., Boente, W., & Volk, M. (2023). The Adaptability of a Transformer-Based OCR Model for Historical Documents Lecture Notes in Computer Science, 34–48. https://doi.org/10.1007/978-3-031-41498-5_3
Hegele, S., Heinisch, B., Popp, A., Marheinecke, K., Rios, A., Gromann, D., Volk, M., & Rehm, G. (2023). European Language Equality - Report on the German Language (M. Giagkou, S. Piperidis, G. Rehm, & J. Dunne, Eds.). European Language Equality (ELE). https://european-language-equality.eu/wp-content/uploads/2022/03/ELE___Deliverable_D1_16__Language_Report_German_.pdf
Ströbel, P., Scheurer, P., & Volk, M. (2023). Lessons Learnt from Bullinger Digital 75–76. https://doi.org/10.5281/zenodo.10400571
Volk, M., & Graën, J. (2022). Binomials in Swedish corpora – ‘Ordpar 1965’ revisited In E. Volodina, D. Dannélls, A. Berdicevskis, M. Forsberg, & S. Virk (Eds.), Live and Learn : Festschrift in honor of Lars Borin (pp. 139–144). Department of Swedish, Multilingualism and Language Technology, University of Gothenburg.
Ströbel, P. B., Clematide, S., Hodel, T., & Volk, M. (2022, June 10). Transformer-based HTR for Historical Documents Workshop on Computational Methods in the Humanities 2022, Lausanne. https://wp.unil.ch/llist/files/2022/06/COMHUM_2022_paper_6.pdf
Volk, M., Fischer, L., Scheurer, P., Schwitter, R., Ströbel, P., & Suter, B. (2022, June). Nunc profana tractemus. Detecting Code-Switching in a Large Corpus of 16th Century Letters Proceedings of LREC-2022, Marseille. https://lrec2022.lrec-conf.org/en/
Ströbel, P. B., Clematide, S., Volk, M., Schwitter, R., Hodel, T., & Schoch, D. (2022, June). Evaluation of HTR models without Ground Truth Material LREC 2022, Marseille. http://www.lrec-conf.org/proceedings/lrec2022/pdf/2022.lrec-1.467.pdf
Hauser, R., Vamvas, J., Ebling, S., & Volk, M. (2022). A Multilingual Simplified Language News Corpus 25–30. https://aclanthology.org/2022.readi-1.4
Kew, T., & Volk, M. (2022, May). Improving Specificity in Review Response Generation with Data-Driven Data Filtering Proceedings of the Fifth Workshop on e-Commerce and NLP (ECNLP 5), Dublin. https://doi.org/10.18653/v1/2022.ecnlp-1.15
Scheurer, P., Raphael, M., Bernard, S., Ströbel, P. B., Benjamin, S., & Volk, M. (2022). Ein Briefwechsel-Korpus des 16. Jahrhunderts in Frühneuhochdeutsch In M. Kupietz & T. Schmidt (Eds.), Neue Entwicklungen in der Korpuslandschaft der Germanistik (pp. 33–42). Narr Francke Attempto GmbH + Co. KG. https://doi.org/10.24053/9783823396024
Fischer, L., Scheurer, P., Schwitter, R., & Volk, M. (2022). Machine Translation of 16th Century Letters from Latin to German 43–50. http://www.lrec-conf.org/proceedings/lrec2022/workshops/LT4HALA/pdf/2022.lt4hala2022-1.7.pdf
Cheng, Y., Ding, Y., Foucher, S., Pascual, D., Richter, O., Volk, M., & Wattenhofer, R. (2021). WikiFlash: Generating Flashcards from Wikipedia Articles In T. Mantoro & et al (Eds.), Neural Information Processing. ICONIP 2021. Lecture Notes in Computer Science (vol 13111; pp. 138–149). Springer. https://doi.org/10.1007/978-3-030-92273-3_12
Graën, J., & Volk, M. (2021). Binomial adverbs in Germanic and Romance Languages : A corpus-based study. In J. Lavid-López, C. Maíz-Arévalo, & J. R. Zamorano-Mansilla (Eds.), Corpora in Translation and Contrastive Research in the Digital Age : Recent advances and explorations (pp. 326–342). John Benjamins. https://doi.org/10.1075/btl.158.13gra
Klenner, M., Göhring, A., & Amsler, M. (2020). Harmonization Sometimes Harms (S. Ebling, D. Tuggener, M. Hürlimann, & M. Volk, Eds.). swisstext-and-konvens-2020. http://swisstext-and-konvens-2020.org
Goldzycher, J., Meraner, I., Volk, M., & Clematide, S. (2020). Ranking Georeferences for Efficient Crowdsourcing of Toponym Annotations in a Historical Corpus of Alpine Texts CEUR Workshop Proceedings, online. http://ceur-ws.org/Vol-2624/paper11.pdf
Ebling, S., Tuggener, D., Hürlimann, M., Cieliebak, M., & Volk, M. (Eds.). (2020). Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS) : Vol. Vol-2624. CEUR-ws. http://ceur-ws.org/Vol-2624/
Ströbel, P. B., Clematide, S., & Volk, M. (2020). How Much Data Do You Need? About the Creation of a Ground Truth for Black Letter and the Effectiveness of Neural OCR 3551–3559. https://www.aclweb.org/anthology/2020.lrec-1.436
Ehrmann, M., Bunout, E., Clematide, S., Düring, M., Fickers, A., Kalyakin, R., Kaplan, F., Romanello, M., Schroeder, P., Ströbel, P. B., van Beek, T., Volk, M., & Wieneke, L. (2020). Historical Newspaper Content Mining: Revisiting the impresso Project’s Challenges in Text and Image Processing, Design and Historical Scholarship. Digital Humanities 2020, Ottawa.
Säuberli, A., Ebling, S., & Volk, M. (2020). Benchmarking Data-driven Automatic Text Simplification for German In N. Gala & R. Wilkens (Eds.), Proceedings of the 1st Workshop on Tools and Resources to Empower People with REAding DIfficulties (READI) (pp. 41–48). European Language Resources Association. https://www.aclweb.org/anthology/2020.readi-1.7
Battisti, A., Ebling, S., & Volk, M. (2019, November 15). An Empirical Analysis of Linguistic, Typographic, and Structural Features in Simplified German Texts Sixth Italian Conference on Computational Linguistics (CLiC-it 2019), Bari. http://ceur-ws.org/Vol-2481/paper4.pdf
Kew, T., Shaitarova, A., Meraner, I., Clematide, S., Goldzycher, J., & Volk, M. (2019). Geotagging a diachronic corpus of alpine texts: comparing distinct approaches to toponym recognition 11–18. https://doi.org/10.26615/978-954-452-059-5_003
Läubli, S., Amrhein, C., Düggelin, P., Gonzalez, B., Zwahlen, A., & Volk, M. (2019). Post-editing Productivity with Neural Machine Translation: An Empirical Assessment of Speed and Quality in the Banking and Finance Domain 267–272. https://www.aclweb.org/anthology/W19-6626
Graën, J., Kew, T., Shaitarova, A., & Volk, M. (2019). Modelling Large Parallel Corpora: The Zurich Parallel Corpus Collection (P. Bański, A. Barbaresi, H. Biber, E. Breiteneder, S. Clematide, M. Kupietz, H. Lüngen, & C. Iliadi, Eds.). Leibniz-Institut für Deutsche Sprache. https://doi.org/10.14618/ids-pub-9020
Läubli, S., Sennrich, R., & Volk, M. (2018). Has Machine Translation Achieved Human Parity? A Case for Document-level Evaluation 4791–4796. http://www.aclweb.org/anthology/D18-1512
Graën, J., Bertamini, M., & Volk, M. (2018). Cutter – a Universal Multilingual Tokenizer In M. Cieliebak, D. Tuggener, & F. Benites (Eds.), CEUR Workshop Proceedings (No. 2226; pp. 75–81). CEUR-WS. http://ceur-ws.org/Vol-2226/
Läubli, S., Müller, M., Horat, B., & Volk, M. (2018). mtrain: A Convenience Tool for Machine Translation 357–357. http://rua.ua.es/dspace/handle/10045/76099
Volk, M. (2018). Parallel Corpora, Terminology Extraction and Machine Translation (P. Drewer, F. Mayer, K.-D. Schmitz, & M. Volk, Eds.; pp. 3–14). s.n.
Clematide, S., Lehner, S., Graën, J., & Volk, M. (2018). A multilingual gold standard for translation spotting of German compounds and their corresponding multiword units in English, French, Italian and Spanish In R. Mitkov, J. Monti, G. Corpas Pastor, & V. Seretan (Eds.), Multiword Units in Machine Translation and Translation Technology (No. 341; pp. 125–145). John Benjamins. https://doi.org/10.1075/cilt.341
Clematide, S., Furrer, L., & Volk, M. (2018). Crowdsourcing the OCR Ground Truth of a German and French Cultural Heritage Corpus Journal for Language Technology and Computational Linguistics, 33, 25–47. https://jlcl.org/content/2-allissues/1-heft1-2018/jlcl_2018-1_2.pdf
Volk, M., & Graën, J. (2017, November 14). Multi-word Adverbs – How well are they handled in Parsing and Machine Translation? The 3rd Workshop on Multi-word Units in Machine Translation and Translation Technology (MUMTTT 2017), London.
Clematide, S., Meraner, I., Bubenhofer, N., & Volk, M. (2017). Lessons from a Massive Open Online Course (MOOC) on Natural Language Processing for Digital Humanitie 17–22.
Graën, J., Sandoz, D., & Volk, M. (2017). Multilingwis2 – Explore Your Parallel Corpus Linköping Electronic Conference Proceedings, 247–250. http://www.ep.liu.se/ecp/article.asp?issue=131&article=031&volume=#
Mascarell, L., Rios, A., & Volk, M. (2016, December). Crossing Sentence Boundaries in Statistical Machine Translation MultiLingual, 50–52.
Sugisaki, K., Volk, M., Polanco, R., Alschner, W., & Skougarevskiy, D. (2016). Building a Corpus of Multi-lingual and Multi-format International Investment Agreements In F. Bex & S. Vilata (Eds.), Frontiers in Artificial Intelligence and Applications (No. 294; pp. 203–206). I O S Press. https://doi.org/10.3233/978-1-61499-726-9-203
Volk, M., Amrhein, C., Aepli, N., Müller, M., & Ströbel, P. (2016, September 21). Building a Parallel Corpus on the World’s Oldest Banking Magazine. KONVENS, Bochum.
Suter, J., Ebling, S., & Volk, M. (2016, September 21). Rule-based Automatic Text Simplification for German 13th Conference on Natural Language Processing (KONVENS 2016), Bochum.
Volk, M., Clematide, S., Graën, J., & Ströbel, P. (2016, September 21). Bi-particle adverbs, PoS-tagging and the recognition of german separable prefix verbs KONVENS 2016, Bochum. https://www.linguistics.rub.de/konvens16/program/accepted.html
Graën, J., Clematide, S., & Volk, M. (2016). Efficient Exploration of Translation Variants in Large Multiparallel Corpora Using a Relational Database (P. Bański, M. Kupietz, H. Lüngen, A. Witt, A. Barbaresi, H. Biber, E. Breiteneder, & S. Clematide, Eds.; pp. 20–23). s.n. http://www.lrec-conf.org/proceedings/lrec2016/workshops/LREC2016Workshop-CMLC_Proceedings.pdf
Clematide, S., Furrer, L., & Volk, M. (2016). Crowdsourcing an OCR Gold Standard for a German and French Heritage Corpus 975–982. http://www.lrec-conf.org/proceedings/lrec2016/pdf/917_Paper.pdf
Clematide, S., Graën, J., & Volk, M. (2016). Multilingwis – A Multilingual Search Tool for Multi-Word Units in Multiparallel Corpora In G. Corpas Pastor (Ed.), Computerised and Corpus-based Approaches to Phraseology: Monolingual and Multilingual Perspectives/Fraseología computacional y basada en corpus: perspectivas monolingües y multilingües (p. n/a). Tradulex.
Mascarell, L., Fishel, M., & Volk, M. (2015, September 17). Detecting Document-level Context Triggers to Resolve Translation Ambiguity Second Workshop on Discourse in Machine Translation (DiscoMT), Lisbon.
Pu, X., Mascarell, L., Popescu-Belis, A., Fishel, M., Luong, N.-Q., & Volk, M. (2015, July 28). Leveraging Compounds to Improve Noun Phrase Translation from Chinese and German ACL-IJCNLP 2015 Student Research Workshop, Beijing.
Plamada, M., Linder, G., Ströbel, P., & Volk, M. (2015, May). Pre-reordering for Statistical Machine Translation of Non-fictional Subtitles The 18th Annual Conference of the European Association for Machine Translation (EAMT 2015), Antalya. http://www.eamt2015.org/files/downloads/EAMT2015_Proceedings.pdf
Grigonyte, G., Clematide, S., Utka, A., & Volk, M. (Eds.). (2015). Proceedings of the Workshop on Innovative Corpus Query and Visualization Tools at NODALIDA 2015, May 11-13, 2015, Vilnius, Lithuania (Vol. 111). Linköping University Electronic Press, Linköpings universitet. http://www.ep.liu.se/ecp/111/ecp15111.pdf
Clematide, S. (2015). Reflections and a Proposal for a Query and Reporting Language for Richly Annotated Multiparallel Corpora In G. Gintare, S. Clematide, A. Utka, & M. Volk (Eds.), Proceedings of the Workshop on Innovative Corpus Query and Visualization Tools at NODALIDA 2015, May 11-13, 2015, Vilnius, Lithuania (No. 111; pp. 6–16). Linköping University Electronic Press, Linköpings universitet. http://www.ep.liu.se/ecp_home/index.en.aspx?issue=111
Volk, M., & Clematide, S. (2014). Detecting Code-Switching in a Multilingual Alpine Heritage Corpus 24–33. https://doi.org/10.3115/v1/W14-3903
Mascarell, L., Fishel, M., Korchagina, N., & Volk, M. (2014, October 10). Enforcing Consistent Translation of German Compound Coreferences Konvens, Hildesheim.
Graën, J., Batinić, D., & Volk, M. (2014, October 10). Cleaning the Europarl Corpus for Linguistic Applications Konvens 2014, Hildesheim.
Winkler, K., Kuhn, T., & Volk, M. (2014). Evaluating the fully automatic multi-language translation of the Swiss avalanche bulletin 44–54. https://doi.org/10.1007/978-3-319-10223-8_5
Volk, M., Graën, J., & Callegaro, E. (2014, May 31). Innovations in parallel corpus search tools Ninth International Conference on Language Resources and Evaluation (LREC’14), Reykjavik. http://www.lrec-conf.org/proceedings/lrec2014/pdf/504_Paper.pdf
Kübler, S., Osenova, P., & Volk, M. (Eds.). (2013). Proceedings of the twelth workshop on treebanks and linguistic theories (TLT 12) Institute of Information and Communication Technologies. Bulgarian Academy of Sciences. http://www.bultreebank.org/TLT12/TLT12Proceedings.pdf
Aepli, N., & Volk, M. (2013). Reconstructing Complete Lemmas for Incomplete German Compounds 1–13. https://doi.org/10.1007/978-3-642-40722-2_1
Sennrich, R., Volk, M., & Schneider, G. (2013). Exploiting Synergies Between Open Resources for German Dependency Parsing, POS-tagging, and Morphological Analysis 601–609. http://www.aclweb.org/anthology/R/R13/R13-1079.pdf
Läubli, S., Fishel, M., Weibel, M., & Volk, M. (2013). Statistical machine translation for automobile marketing texts In K. Sima’an, M. L. Forcada, D. Grasmick, & H. Depraetere (Eds.), Machine Translation Summit XIV (Nice, September 2–6, 2013): main conference proceedings (pp. 265–272). http://www.mtsummit2013.info/files/proceedings/main/proceedings.pdf
Läubli, S., Fishel, M., Massey, G., Ehrensberger-Dow, M., & Volk, M. (2013). Assessing post-editing efficiency in a realistic translation environment In S. O’Brien, M. Simard, & L. Specia (Eds.), Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (Nice, September 2, 2013) (pp. 83–91). European Association for Machine Translation.
Klaper, D., Ebling, S., & Volk, M. (2013, August 8). Building a German/Simple German Parallel Corpus for Automatic Text Simplification The Second Workshop on Predicting and Improving Text Readability for Target Reader Populations (PITR 2013), Sofia.
Plamada, M., & Volk, M. (2013). Mining for Domain-specific Parallel Text from Wikipedia 112–120. http://www.aclweb.org/anthology/W13-2514
Läubli, S., Fishel, M., Volk, M., & Weibel, M. (2013). Combining statistical machine translation and translation memories with domain adaptation In S. Oepen, K. Hagen, & J. B. Johannesse (Eds.), Proceedings of the 19th Nordic Conference of Computational Linguistics (NODALIDA 2013), May 22–24, 2013, Oslo University, Norway (pp. 331–341). Linköping University Electronic Press. http://www.ep.liu.se/ecp/085/030/ecp1385030.pdf
Bywood, L., Volk, M., Fishel, M., & Georgakopoulou, P. (2013). Parallel subtitle corpora and their applications in machine translation and translatology Perspectives: Studies in Translatology, 21, 595–610. https://doi.org/10.1080/0907676X.2013.831920
Müller, M., & Volk, M. (2013). Statistical machine translation of subtitles: From OpenSubtitles to TED In I. Gurevych, C. Biemann, & T. Zesch (Eds.), Language Processing and Knowledge in the Web (No. 8105; Vol. 8105, pp. 132–138). Springer. https://doi.org/10.1007/978-3-642-40722-2_14
Plamada, M., & Volk, M. (2012). Using parallel treebanks for machine translation evaluation 145–156. http://tlt11.clul.ul.pt/
Grigonyte, G., Rinaldi, F., & Volk, M. (2012). Term evolution: use of biomedical terminologies 79–80.
Grigonyte, G., Rinaldi, F., & Volk, M. (2012). Change of biomedical domain terminology over time In A. Tavast, K. Muischnek, & M. Koit (Eds.), Frontiers in Artificial Intelligence and Applications (No. 247; pp. 74–81). I O S Press. https://doi.org/10.3233/978-1-61499-133-5-74
Fishel, M., Georgakopoulou, Y., Penkale, S., Petukhova, V., Rojc, M., Volk, M., & Way, A. (2012). From subtitles to parallel corpora Proceedins of the 16th Annual Conference of the European Association for Machine Translation EAMT’2012, 3–6. http://www.mt-archive.info/EAMT-2012-Fishel.pdf
Ebling, S., Tissi, K., & Volk, M. (2012). Semi-automatic annotation of semantic relations in a Swiss German sign language lexicon Proceedings of the 5th Workshop on the Representation and Processing of Sign Languages: Interactions between Corpus and Lexicon, LREC 2012, 31–36. http://www.lrec-conf.org/proceedings/lrec2012/workshops/24.Proceedings_SignLanguage.pdf
Plamada, M., & Volk, M. (2012, May 26). Towards a Wikipedia-extracted alpine corpus The Fifth Workshop on Building and Using Comparable Corpora, Istanbul. http://www.lrec-conf.org/proceedings/lrec2012/workshops/16.BUCC2012%20Proceedings.pdf
Rios, A., Göhring, A., & Volk, M. (2012). Parallel treebanking Spanish-Quechua: How and how well do they align? (7/13). online. http://elanguage.net/journals/lilt/article/view/2695
Ebling, S., Sennrich, R., Klaper, D., & Volk, M. (2011, November 27). Digging for names in the mountains: Combined person name recognition and reference resolution for German alpine texts 5th Language & Technology Conference, Poznan. https://doi.org/10.1007/978-3-319-08958-4_16
Volk, M., Göhring, A., Lehner, S., Rios, A., Sennrich, R., & Uibo, H. (2011, November 18). Word-aligned parallel text : a new resource for contrastive language studies. Supporting Digital Humanities, Conference 2011, Copenhagen.
Killer, M., Sennrich, R., & Volk, M. (2011). From multilingual web-archives to parallel treebanks in five minutes (H. Hedeland, T. Schmidt, & K. Wörner, Eds.; No. 96; pp. 57–62). Universität Hamburg. http://www.corpora.uni-hamburg.de/gscl2011/en/?download=AZM96.pdf
Jitca, M., Sennrich, R., & Volk, M. (2011). From historic books to annotated XML: Building a large multilingual diachronic corpus (No. 96). 75–80. http://www.corpora.uni-hamburg.de/gscl2011/downloads/AZM96.pdf
Furrer, L., & Volk, M. (2011). Reducing OCR errors in Gothic-script documents 97–103.
Furrer, L., & Volk, M. (2011). Reducing OCR errors in Gothic-script documents ERCIM News, 29–30. http://ercim-news.ercim.eu/
Göhring, A., & Volk, M. (2011, July 1). The Text+Berg corpus: an alpine french-german parallel resource TALN 2011, Montpellier.
Ebling, S., Way, A., Volk, M., & Naskar, S. K. (2011). Combining semantic and syntactic generalization in example-based machine translation (M. L. Forcada, H. Depraetere, & V. Vandeghinste, Eds.; pp. 209–216). http://www.ccl.kuleuven.be/EAMT2011/proceedings/pdf/eamt2011proceedings.pdf
Sennrich, R., & Volk, M. (2011, May 13). Iterative, MT-based sentence alignment of parallel texts NODALIDA 2011, Nordic Conference of Computational Linguistics, Riga.
Volk, M., & Sennrich, R. (2011, May 13). Disambiguation of English contractions for machine translation of TV subtitles NODALIDA 2011, Nordic Conference of Computational Linguistics, Riga.
Ahrenberg, L., Tiedemann, J., & Volk, M. (Eds.). (2010). Proceedings of the Workshop on Annotation and Exploitation of Parallel Corpora (Vol. 10). Northern European Association for Language Technology. http://dspace.utlib.ee/dspace/handle/10062/15893
Volk, M., Sennrich, R., Hardmeier, C., & Tidström, F. (2010). Machine translation of TV subtitles for large scale production 53–62. http://web.me.com/emcnglworkshop/JEC2010/Program.html
Sennrich, R., & Volk, M. (2010, November 4). MT-based sentence alignment for OCR-generated parallel texts The Ninth Conference of the Association for Machine Translation in the Americas (AMTA 2010), Denver. http://amta2010.amtaweb.org/AMTA/papers/2-14-SennrichVolk.pdf
Volk, M., Marek, T., & Sennrich, R. (2010). Reducing OCR errors by combining two OCR systems 61–65. http://ilk.uvt.nl/LaTeCH2010/paperlist.html
Volk, M., Goehring, A., & Marek, T. (2010, July 16). Combining parallel treebanks and geo-tagging Fourth Linguistic Annotation Workshop (LAW IV), Uppsala.
Volk, M., Bubenhofer, N., Althaus, A., Bangerter, M., Furrer, L., & Ruef, B. (2010, May 21). Challenges in building a multilingual alpine heritage corpus seventh international conference on Language Resources and Evaluation (LREC), Malta.
Piotrowski, M., Läubli, S., & Volk, M. (2010). Towards mapping of alpine route descriptions In R. S. Purves, P. Clough, & C. B. Jones (Eds.), Proceedings of the 6th Workshop on Geographic Information Retrieval (GIR’10) (pp. 15–16). https://doi.org/10.1145/1722080.1722083
Clematide, S., Klenner, M., & Volk, M. (Eds.). (2009). Searching Answers: Festschrift in Honour of Michael Hess on the Occasion of His 60th Birthday Monsenstein und Vannerdat. http://www.mv-buchshop.de/catalog/product_info.php/cPath/36_51/products_id/1338
Klenner, M. (2009). Nominal anaphora. Can we tame the beasts? In S. Clematide, M. Klenner, & M. Volk (Eds.), Searching Answers : Festschrift in Honour of Michael Hess on the Occasion of His 60th Birthday (pp. 77–84). Monsenstein und Vannerdat.
Mahlow, C., & Piotrowski, M. (2009). A Target-Driven Evaluation of Morphological Components for German In S. Clematide, M. Klenner, & M. Volk (Eds.), Searching Answers -- Festschrift in Honour of Michael Hess on the Occasion of his 60th birthday (pp. 85–99). MV-Wissenschaft.
Höfler, S. (2009). Modelling relevance-driven language evolution In S. Clematide, M. Klenner, & M. Volk (Eds.), Searching Answers: Festschrift in Honour of Michael Hess on the Occasion of His 60th Birthday (pp. 49–56). Monsenstein und und Vannerdat. http://www.mv-buchshop.de/catalog/product_info.php/cPath/36_51/products_id/1338
Schneider, G. (2009). Detecting Protein-Protein Interactions in Biomedical Literature Using a Parser In S. Clematide, M. Klenner, & M. Volk (Eds.), Searching Answers (pp. 109–118). MV Verlag.
Volk, M., Bubenhofer, N., Althaus, A., & Bangerter, M. (2009). Classifying named entity in an alpine heritage corpus Künstliche Intelligenz, 40–43.
Volk, M. (2009). How many mountains are there in Switzerland? Explorations of the SwissTopo name list In S. Clematide, M. Klenner, & M. Volk (Eds.), Searching answers: A festschrift for Michael Hess on the occasion of his 60th birthday (pp. 127–140). Monsenstein und Vannerdat.
Hardmeier, C., & Volk, M. (2009). Using linguistic annotations in statistical machine translation of film subtitles Nodalida, Odense.
Jekat, S., & Volk, M. (2009). Maschinelle und computergestützte Übersetzung In K. U. Carstensen (Ed.), Computerlinguistik und Sprachtechnologie. Eine Einführung (pp. 642–658). Spektrum.
Volk, M., & Warin, M. (2009). Datorerna råpluggar översättning Språktidningen, 5, 58–61. http://www.spraktidningen.se/art.lasso?id=09558
Sennrich, R., Schneider, G., Volk, M., & Warin, M. (2009). A New Hybrid Dependency Parser for German In C. Chiarcos, R. E. de Castilho, & M. Stede (Eds.), Von der Form zur Bedeutung: Texte automatisch verarbeiten / From Form to Meaning: Processing Texts Automatically. Proceedings of the Biennial GSCL Conference 2009 (pp. 115–124). Narr.
Marek, T., Schneider, G., & Volk, M. (2009). A framework for constituent-dependency conversion 8th Conference on Treebanks and Linguistic Theories, Milano.
Fuchs, N. E., Kaljurand, K., & Kuhn, T. (2009). ACE can be described by itself In S. Clematide, M. Klenner, & M. Volk (Eds.), Searching Answers: Festschrift in honour of Michael Hess on the occasion of his 60th birthday (pp. 45–48). Monsenstein und und Vannerdat.
Clematide, S. (2009). A morpho-syntactic generation service for German glossary entries In S. Clematide, M. Klenner, & M. Volk (Eds.), Searching Answers: Festschrift in Honour of Michael Hess on the Occasion of His 60th Birthday (pp. 33–43). Monsenstein und Vannerdat. http://www.mv-buchshop.de/catalog/product_info.php/cPath/36_51/products_id/1338
Bünzli, A. (2009). Natural language processing in law - change we need In S. Clematide, M. Klenner, & M. Volk (Eds.), Searching Answers: Festschrift in Honour of Michael Hess on the Occasion of His 60th Birthday (pp. 11–19). Monsenstein und Vannerdat. http://www.mv-buchshop.de/catalog/product_info.php/cPath/36_51/products_id/1338
Rios, A., Göhring, A., & Volk, M. (2009). A Quechua-Spanish parallel treebank 7th Conference on Treebanks and Linguistic Theories, Groningen.
Marek, T., Lundborg, J., & Volk, M. (2008). Extending the TIGER query language with universal quantification 5–17. https://files.ifi.uzh.ch/cl/volk/papers/Marek_Lundborg_Volk__KONVENS_2008.pdf
Volk, M., Marek, T., & Samuelsson, Y. (2008, August 23). Human judgements in parallel treebank alignment COLING Workshop on Human Judgements in Computational Linguistics, Manchester. http://www.aclweb.org/anthology-new/W/W08/W08-1208.pdf
Volk, M. (2008). The automatic translation of film subtitles: a machine translation success story? In J. Nivre, M. Dahllöf, & B. Megyesi (Eds.), Resourceful Language Technology: Festschrift in Honor of Anna Sågvall Hein (pp. 202–214). Uppsala University. http://publications.uu.se/abstract.xsql?dbid=8933
Volk, M., & Tidström, F. (2007). Comparing French PP-attachment to English, German and Swedish Nodalida Workshop on Building Frame Semantics Resources for Scandinavian and Baltic Languages, Tartu.
Volk, M., & Harder, S. (2007). Evaluating MT with translations or translators: what is the difference? MT-Summit, Copenhagen.
Samuelsson, Y., & Volk, M. (2007). Alignment tools for parallel treebanks GLDV Frühjahrstagung, Tübingen.
Samuelsson, Y., & Volk, M. (2007). Automatic phrase alignment. Using statistical n-gram alignment for syntactic phrase alignment 6th Workshop on Treebanks and Linguistic Theories, Bergen.
Lundborg, J., Marek, T., Mettler, M., & Volk, M. (2007). Using the Stockholm TreeAligner 6th Workshop on Treebanks and Linguistic Theories, Bergen.
Volk, M., Lundborg, J., & Mettler, M. (2007). A search tool for parallel treebanks ACL Workshop on Linguistic Annotation, Prague.
Volk, M., & Samuelsson, Y. (2007). Frame-Semantic Annotation on a Parallel Treebank Nodalida Workshop on Building Frame Semantics Resources for Scandinavian and Baltic Languages, Tartu.
Volk, M., Gustafson-Capková, S., Lundborg, J., Marek, T., Samuelsson, Y., & Tidström, F. (2006). XML-based phrase alignment in parallel treebanks EACL Workshop on Multi-dimensional Markup in Natural Language Processing, Trento.
Volk, M. (2006). How bad is the problem of PP-attachment? A comparison of English, German and Swedish ACL-SIGSEM Workshop on Prepositions, Trento.
Samuelsson, Y., & Volk, M. (2006). Phrase alignment in parallel treebanks 5th Workshop on Treebanks and Linguistic Theories, Prague.
Samuelsson, Y., & Volk, M. (2005). Presentation and representation of parallel treebanks 147–159.
Nivre, J., De Smet, K., & Volk, M. (2005). Treebanks: a whitepaper In H. Holmboe (Ed.), Nordisk Sprogteknologi. Nordic Language Technology. Årbog for Nordisk Sprogteknologisk Forskningsprogram 2000-2004 (p. ?-?). Museum Tusculanums Forlag.
Volk, M., Gusafson-Capková, S., Hagstrand, D., & Uibo, H. (2005). Teaching treebanking In H. Holmboe (Ed.), Nordisk Sprogteknologi. Nordic Language Technology. Årbog for Nordisk Sprogteknologisk Forskningsprogram 2000-2004 (pp. 143–159). Museum Tusculanums Forlag.
Warin, M., Oxhammar, H., & Volk, M. (2005). Enriching an ontology with WordNet based on similarity measures MEANING-2005 Workshop, Trento.
Merz, C., & Volk, M. (2005). Requirements for a parallel treebank search tool GLDV-Conference, Bonn.
Stocker, C., Macher, D., Studler, R., Bubenhofer, N., Crvelin, D., Liniger, R., & Volk, M. (2004). Studien-CD Linguistik: multimediale Einführungen und interaktive Übungen zur germanistischen Sprachwissenschaft Niemeyer Verlag.
Samuelsson, Y., & Volk, M. (2004). Automatic node insertion for treebank deepening Third Workshop on Treebanks and Linguistic Theories (TLT) 2004, Tübingen.
Volk, M., & Samuelsson, Y. (2004). Bootstrapping parallel treebanks Workshop on Linguistically Interpreted Corpora (LINC) at COLING, Genève.
Buitelaar, P., Steffen, D., Volk, M., Widdows, D., Sacaleanu, B., Vintar, S., Peters, S., & Uszkoreit, H. (2004). Evaluation resources for concept-based cross-lingual information retrieval in the medical domain LREC-2004, Lisbon.
Volk, M. (2003). German prepositions and their kin. A survey with respect to the resolution of PP attachment ambiguities Workshop on The Linguistic Dimensions of Prepositions and their Use in Computational Linguistics Formalisms and Applications, Toulouse.
Volk, M., Vintar, S., & Buitelaar, P. (2003). Ontologies in cross-language information retrieval 2nd Conference on Professional Knowledge Management, Luzern.
Vintar, S., Buitelaar, P., & Volk, M. (2003). Semantic relations in concept-based cross-language medical information retrieval Workshop on Adaptive Text Extraction and Mining, Cavtat_Dubrovnik.
Sacaleanu, B., Buitelaar, P., & Volk, M. (2003). A cross-language document retrieval system based on semantic annotation EACL, Budapest.
Volk, M. (2002). Using the web as corpus for linguistic research In R. Pajusalu & T. Hennoste (Eds.), ähendusepüüdja. Catcher of the Meaning. A Festschrift for Professor Haldur Õim (pp. 1–10). University of Tartu.
Volk, M., Pantli, A.-K., & Malka, A. M. (2002). The length factor in automatic bilingual terminology extraction 6th International Conference on Terminology and Knowledge Engineering, Nancy.
Volk, M., Ripplinger, B., Vintar, S., Buitelaar, P., Raileanu, D., & Sacaleanu, B. (2002). Semantic annotation for concept-based cross-language medical information retrieval International Journal of Medical Informatics, 67, 97–112. https://doi.org/10.1016/S1386-5056(02)00058-8
Volk, M. (2002). Combining unsupervised and supervised methods for PP attachment disambiguation COLING-2002, Taipeh.
Volk, M., & Buitelaar, P. (2002). A systematic evaluation of concept-based cross-language information retrieval in the medical domain 3rd Dutch-Belgian Information Retrieval Workshop, Leuven.
Clematide, S., & Volk, M. (2001). Linguistische und semantische Annotation eines Zeitungskorpus 201–209.
Volk, M., & Clematide, S. (2001). Learn-filter-apply-forget. Mixed approaches to named entity recognition 6th International Workshop on Applications of Natural Language for Informations Systems, Madrid.
Arnold, T., Clematide, S., Nespeca, R., Roth, J., & Volk, M. (2001). LUIS - Ein natürlichsprachliches, universitäres Informationssystem 115–126.
Volk, M. (2001). Exploiting the WWW as a corpus to resolve PP attachment ambiguities Corpus Linguistics, Lancaster.
Bohan, N., Breidt, E., & Volk, M. (2000). Evaluating translation quality as input to product development Proc. of Second Int. Conference On Language Resources And Evaluation, Athens.
Volk, M. (2000). Scaling up: using the WWW to resolve PP attachment ambiguities Proc. of Konvens-2000, Ilmenau.
Mehl, S., & Volk, M. (1999). Aspects of the translation of English subordinate clauses into German Problems and Potential of English-to-German MT systems. Workshop at the 8th International Conference on Theoretical and Methodological Issues in Machine Translation, Chester.
Volk, M. (1999). Choosing the right lemma when analysing German nouns Multilinguale Corpora: Codierung, Strukturierung, Analyse. 11. Jahrestagung der GLDV, Frankfurt.
Volk, M. (1998). The automatic translation of idioms. Machine translation vs. translation memory systems In N. Weber (Ed.), Machine translation: theory, applications, and evaluation. An assessment of the state of the art (No. 1; pp. 167–192). Gardez!-Verlag.
Mehl, S., Heidemann, B., & Volk, M. (1998). Zur Problematik der maschinellen Übersetzung von Nebensätzen zwischen den Sprachen Englisch und Deutsch Evaluation of the Linguistic Performance of Machine Translation Systems. Proceedings of the Workshop at the KONVENS-98, Bonn.
Mehl, S., Langer, H., & Volk, M. (1998). Statistische Verfahren zur Zuordnung von Präpositionalphrasen KONVENS-98, Bonn.
Schneider, G., & Volk, M. (1998). Adding manual constraints and lexical look-up to a brill-tagger for German ESSLLI-98 Workshop on Recent Advances in Corpus Annotation, Saarbrücken.
Volk, M. (1998). Das Evaluieren von Software für die maschinenunterstützte Übersetzung In Équivalences 97 - Die Akten. Computerwerkzeuge am Übersetzer-Arbeitsplatz: Theorie und Praxis (pp. 105–117). ASSTI.
Volk, M. (1998). Markup of a Test Suite with SGML In J. Nerbonne (Ed.), Linguistic Databases (No. 77; pp. 59–76). Center for the Study of Language and Information,.
Volk, M., & Schneider, G. (1998). Comparing a statistical and a rule-based tagger for German Proc. of KONVENS-98, Bonn.
Volk, M., & Richarz, D. (1997). Experiences with the GTU grammar development environment 107–113.
Stolzenburg, F., Höhne, S., Koch, U., & Volk, M. (1997). Constraint Logic Programming for Computational Linguistics In C. Retoré (Ed.), Logical Aspects of Computational Linguistics (No. 1328; pp. 406–425). Springer. https://doi.org/10.1007/BFb0052169
Volk, M. (1997). Probing the lexicon in evaluating commercial MT systems In Proceedings of the eighth conference on European chapter of the Association for Computational Linguistics (pp. 112–119). European Chapter Meeting of the ACL. https://doi.org/10.3115/979617.979632
Langer, H., Mehl, S., & Volk, M. (1997). Hybride NLP-Systeme und das Problem der PP-Anbindung Hybride konnektionistische, statistische und symbolische Ansätze zur Verarbeitung natürlicher Sprache. Berichtsband des Workshops, Freiburg. http://www.dfki.de/~busemann/ki97/ki97-ws03.html
Volk, M. (1996). Die Rolle der Valenz bei der Auflösung von PP-Mehrdeutigkeiten In S. Mehl (Ed.), Präpositionalsemantik und PP-Anbindung (No. 16; pp. 32–38). Gerhard-Mercator-Universität - GH - Duisburg, Institut für Informatik.
Volk, M. (1996). Parsing with ID/LP and PS rules 342–353.
Volk, M. (1995). Einsatz einer Testsatzsammlung im Grammar Engineering (Dissertation, University of Zurich)
Volk, M., Jung, M., & Richarz, D. (1995). GTU - A workbench for the development of natural language grammars Conference on Practical Applications of Prolog, Paris.
Volk, M., Fitschen, A., Pieper, S., & van Luijt, R. (1994). Was ist Linguistic Engineering? Künstliche Intelligenz, 8, 15–22.
Jung, M., Richarz, D., & Volk, M. (1994). GTU - Eine Grammatik-Testumgebung 427–430.

Quicklinks

Hauptnavigation

Martin Volk, Prof. Dr.

ZORA Publikationsliste

Download-Optionen

Publikationen