Auffinden

Auffinden

Auswerten

Auswerten

Aufbereiten und Aufbewahren

Aufbereiten

 

Mehr

Mehr

Hilfe

CLARIN hilft

Mehr erfahren

Publikationen

2017

Bański, P.; Kupietz, M.; Lüngen, H.; Rayson, P.; Biber, H.; Breiteneder, E.; Clematide, S.; Mariani, J.; Stevenson, M.; Sick, T. (2017): The Workshop on Challenges in the Management of Large Corpora and Big Data and Natural Language Processing (CMLC-5+BigNLP) 2017 including the papers from the Web-as-Corpus (WAC-XI) guest section. of. Mannheim: Institut für Deutsche Sprache. Verfügbar unter https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/6243.

Cassidy, S.; Schmidt, T. (2017): Tools for Multimodal Annotation. In: N. Ide & J. Pustejovsky (Eds.), Handbook of Linguistic Annotation. 209-227. Springer Dordrecht. https://doi.org/10.1007/978-94-024-0881-2_7.

Eckart, T.; Goldhahn, D.; Quasthoff, U. (2017): Using Corpus Query Engines for Facilitating Lexicographical Analysis of African LanguagesWorkshop of the African Association for Lexicography (AFRILEX), CLASA 2017, Grahamstown, Grahamstown, SA.

Eckart, T.; Goosen, T.; Haaf, S.; Hedeland, H.; Ohren, O.; van Uytvanck, D.; Windhouwer, M. (2017): Component Metadata Infrastructure Best Practices for CLARINCLARIN Annual Conference 2017 in Budapest, Hungary, Budapest, Hungary. Verfügbar unter https://www.clarin.eu/sites/default/files/Eckart-CLARIN2017_paper_9.pdf.

Eckart, T.; Gradl, T. (2017): Working towards a Metadata Federation of CLARIN and DARIAH-DECLARIN Annual Conference 2017 in Budapest, Hungary, Budapest, Hungary. Verfügbar unter https://www.clarin.eu/sites/default/files/Eckart-Gradl-CLARIN2017_paper_2.pdf.

Fürbacher, M.; Varádi, T.; Witt, A. (2017): Digitale Forschungsinfrastrukturen: Ihre Nutzung durch die Mitglieder der Europäischen Föderation nationaler Sprachinstitutionen. In: J. E. Dąbrowska-Burkhardt, Ludwig M./Itakura, Uta (Ed.), Deutsch: lokal – regional – global. Festschrift für Gerhard Stickel zum 80. Geburtstag (= Studien zur Deutschen Sprache 77). 103-114. Narr, 2017, Tübingen.

Gloning, T. (2017): Ernst Jünger Publizistik der 1920er Jahre. Befunde zum Wortgebrauchsprofil. In: A. Benedetti & L. Hagestedt (Eds.), Totalität als Faszination: Systematisierung des Heterogenen im Werk Ernst Jüngers. De Gruyter, Berlin.

Grallert, T.; Tiepmar, J.; Eckart, T.; Goldhahn, D.; Kuras, C. (2017): Digital Muqtabas CTS Integration in CLARINCLARIN Annual Conference 2017 in Budapest, Hungary, Budapest, Hungary. Verfügbar unter https://www.clarin.eu/sites/default/files/Grallert-etal-CLARIN2017_paper_21.pdf.

Haaf, S. (2017): Das DTA-Basisformat in neuem Gewand. In Im Zentrum Sprache. Untersuchungen zur deutschen Sprache. Verfügbar unter http://sprache.hypotheses.org/.

Haaf, S.; Thomas, C. (2017): Enabling the Encoding of Manuscripts within the DTABf. Extensions and Modularization of the Format.  In: Journal of the Text Encoding Initiative (jTEI) 10. Verfügbar unter http://jtei.revues.org/1650.

Harrington, J.; Schiel, F. (2017): /u/-fronting and agent-based modelling - The relationship between the origin and spread of sound change.  In: Language, 93  (2),  414-445. Verfügbar unter http://www.phonetik.uni-muenchen.de/forschung/publikationen/HarringtonSchiel-LANGUAGE2017.pdf.

Hinrichs, E.; Trippel, T. (2017): CLARIN-D: eine Forschungsinfrastruktur für die sprachbasierte Forschung in den Geistes- und Sozialwissenschaften.  In: Bibliothek - Forschung und Praxis, 1  (41). Verfügbar unter https://www.degruyter.com/downloadpdf/j/bfup.2017.41.issue-1/bfp-2017-0015/bfp-2017-0015.pdf.

Kisler, T.; Reichel, U.; Schiel, F. (2017): Multilingual processing of speech via web services In: Computer, Speech and Language, 45,  326-347. Verfügbar unter http://www.phonetik.uni-muenchen.de/forschung/publikationen/KislerReichelSchiel_CSL2017.pdf.

Knappen, J.; Fischer, S.; Kermes, H.; Teich, E.; Fankhause, P. (2017): The Making of the Royal Society Corpus. In: G. a. A. Bouma, Yvonne (Ed.), Proceedings of the NoDaLiDa 2017 Workshop on Processing Historical Language. 7-11. Linköping University Electronic Press, Linköpings universitet. Verfügbar unter http://www.ep.liu.se/ecp/article.asp?issue=133&article=003&volume=.

Kupietz, M.; Diewald, N.; Hanl, M.; Margaretha, E. (2017): Möglichkeiten der Erforschung grammatischer Variation mithilfe von KorAP. In: M. W. Konopka, Angelika (Ed.), Grammatische Variation. Empirische Zugänge und theoretische Modellierung. Jahrbuch des Instituts für Deutsche Sprache 2016. 319-329. de Gruyter Berlin/Boston. Verfügbar unter https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/5968.

Kupietz, M.; Witt, A.; Bański, P.; Tufiş, D.; Cristea, D.; Váradi, T. (2017): EuReCo – Joining Forces for a European Reference Corpus as a sustainable base for cross-linguistic research. In: P. Bański et al. (Eds.): The Workshop on Challenges in the Management of Large Corpora and Big Data and Natural Language Processing (CMLC-5+BigNLP) 2017 including the papers from the Web-as-Corpus (WAC-XI) guest section, Birmingham, UK, 15-19. Verfügbar unter https://ids-pub.bsz-bw.de/files/6243/4.+Kupietz_etal_EuReCo_2017.pdf.

Kuras, C.; Eckart, T. (2017): Prozessmodellierung mittels BPMN in Forschungsinfrastrukturen der Digital Humanities. In: INFORMATIK 2017, Lecture Notes in Informatics (LNI). Verfügbar unter https://dl.gi.de/bitstream/handle/20.500.12116/3873/B14-1.pdf?sequence=1&isAllowed=y.

Lüngen, H. (2017): DeReKo - Das Deutsche Referenzkorpus. Schriftkorpora der deutschen Gegenwartssprache am Institut für Deutsche Sprache in Mannheim.  In: Zeitschrift für germanistische Linguistik, 45  (1),  161-170. Verfügbar unter https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/6106.

Lüngen, H.; Kupietz, M. (2017): CMC Corpora in DeReKo. In: P. Bański et al. (Eds.), Proceedings of the Workshop on Challanges in the Management of Large Corpora and Big Data and Natural Language Processing (CMLC-5+BigNLP) 2017 including the papers from the Web-as-Corpus (WAC-XI) guest section. 20-24. Mannheim: Institut für Deutsche Sprache, Birmingham, UK. Verfügbar unter https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/6259.

Schmidt, T. (2017): Construction and Dissemination of a Corpus of Spoken Interaction - Tools and Workflows in the FOLK project.  In: Corpus Linguistic Software Tools. Journal for Language Technology and Computational Linguistics (JLCL), 31  (1),  127-154. Verfügbar unter http://www.jlcl.org/2016_Heft1/jlcl-2016-1-7Schmidt.pdf.

Schmidt, T.; Hedeland, H.; Jettka, D. (2017): Conversion and Annotation Web Services for Spoken Language Data in CLARIN. In: L. Borin (Ed.): Selected papers from the CLARIN Annual Conference 2016, Aix-en-Provence, France, 113-130. Verfügbar unter https://www.clarin.eu/sites/default/files/schmidt-etal-CLARIN2016_paper_27.pdf.

Tiepmar, J.; Eckart, T.; Goldhahn, D.; Kuras, C. (2017): Integrating Canonical Text Services into CLARIN's Search Infrastructure.  In: Linguistics and Literature Studies 5(2),  99-104. Verfügbar unter http://www.hrpub.org/download/20170228/LLS5-19308447.pdf.

Tiepmar, J.; Heyer, G. (2017): An Overview of Canonical Text Services.  In: Linguistics and Literature Studies, 5(2),  132-148. Verfügbar unter http://asv.informatik.uni-leipzig.de/publication/file/378/LLS9-19308836.pdf.

2016

Bański, P.; Frick, E.; Witt, A. (2016): Corpus Query Lingua Franca (CQLF). In: N. Calzolari et al. (Eds.), Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). 2804-2809, Paris: European Language Resources Association (ELRA). Verfügbar unter https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/5040.

Bański, P.; Kupietz, M.; Lüngen, H.; Witt, A.; Barbaresi, A.; Biber, H.; Breiteneder, E.; Clematide, S. (2016): The 4th Workshop on Challenges in the Management of Large Corpora. of. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2016/workshops/LREC2016Workshop-CMLC_Proceedings.pdf.

Beißwenger, M.; Chanier, T.; Erjavec, T.; Darja, F.; Herold, A.; Ljubseic, N.; Lüngen, H.; Poudat, C.; Stemle, E.; Storrer, A.; Wigham, C. (2016): Closing a gap in the language resources landscape: Groundwork and best practices from projects on computer-mediated communication in four European countries. In: CLARIN Annual Conference 2016, Aix-en-Provence.

Beißwenger, M.; Ehrhardt, E.; Herold, A.; Lüngen, H.; Storrer, A. (2016a): (Best) Practices for Annotating and Representing CMC and Social Media Corpora in CLARIN-D. In: D. Fišer & M. Beißwenger (Eds.), Proceedings of the 4th Conference on CMC and Social Media Corpora for the Humanities 7-11. Academic Publishing Division of the Faculty of Arts of the University of Ljubljana, 2016, Ljubljana.

Beißwenger, M.; Ehrhardt, E.; Herold, A.; Lüngen, H.; Storrer, A. (2016b): Converting and Representing Social Media Corpora into TEI: Schema and best practices from CLARIN-D. In: C. Resch et al. (Eds.), TEI Conference and Members' Meeting 2016. Book of Abstracts. 39-41. Austrian Academy of Sciences, Austrian Centre for Digital Humanities, 2016, Wien.

Beißwenger, M.; Herold, A.; Lüngen, H.; Storrer, A. (2016): Das Dortmunder Chat-Korpus in CLARIN-D: Modellierung und Mehrwerte. nisaba 2016, Duisburg.

Blessing, A.; Kuhn, J. (2016): Crosslingual Textual Emigration Analysis. In: Digital Humanities 2016: Conference Abstracts, Jagiellonian University & Pedagogical University, Kraków 744-745. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2014/pdf/1009_Paper.pdf.

Butt, M.; Heyer, G.; Holzinger, K.; Kantner, C.; Keim, D.; Kuhn, J.; Schaal, G.; Blessing, A.; Dumm, S.; El-Assady, M.; Gold, V.; Hautli-Janisz, A.; Lemke, M.; Müller, M.; Niekler, A.; Overbeck, M.; Wiedemann, G. (2016): Argumentanalyse in digitalen Textkorpora. In: DHd 2016, Leipzig.

Cosma, R.; Cristea, D.; Kupietz, M.; Tufiş, D.; Witt, A. (2016): DRuKoLa - Towards Contrastive German-Romanian Research based on Comparable Corpora. . In: P. Bański et al. (Eds.): 4th Workshop on Challenges in the Management of Large Corpora. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), Portorož, Slowenien, 28-32.

Degaetano-Ortlieb, S.; Kermes, H.; Khamis, A.; Knappen, J.; Teich, E. (2016): The Royal Society Corpus: From Uncharted Data to Corpus. In: LREC 2016, Portorož, Slovenia 2016. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2016/pdf/792_Paper.pdf.

Diewald, N.; Hanl, M.; Margaretha, E.; Bingel, J.; Kupietz, M.; Bański, P.; Witt, A. (2016): KorAP Architecture - Diving in the Deep Sea of Corpus Data. In: N. Calzolari et al. (Eds.), Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). Paris: European Language Resources Association (ELRA) 2016, Portoroz, Slovenia. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2016/pdf/243_Paper.pdf.

Diewald, N.; Margaretha, E. (2016): Krill: KorAP search and analysis engine.  In: Corpus Linguistic Software Tools. Journal for Language Technology and Computational Linguistics (JLCL) 31  (1). Verfügbar unter http://www.jlcl.org/2016_Heft1/jlcl-2016-1-4DiewaldMargaretha.pdf.

Eckart, T. (2016): Einsatz und Bewertung komponentenbasierter Metadaten in einer föderierten Infrastruktur für Sprachressourcen am Beispiel der CMDI. Universität Leipzig. Verfügbar unter https://core.ac.uk/download/pdf/45272098.pdf.

Eckart, T.; Kuras, C.; Quasthoff, U. (2016): Features for Generic Corpus Querying. In: International Conference on Language Resources and Evaluation (LREC). Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2016/pdf/503_Paper.pdf.

Fandrych, C.; Frick, E.; Hedeland, H.; Iliash, A.; Jettka, D.; Meißner, C.; Schmidt, T.; Wallner, F.; Weigert, K. (2016): Wer bist du, Nutzer? DHd 2016. 122-125. Verfügbar unter http://www.dhd2016.de/abstracts/vortr%C3%A4ge-053.html.

Fandrych, C.; Frick, E.; Hedeland, H.; Iliash, A.; Jettka, D.; Meißner, C.; Schmidt, T.; Wallner, F.; Weigert, K.; Westpfahl, S. (2016): User, who art thou? User profiling for oral corpus platforms. In: N. Calzolari et al. (Eds.), Tenth International Conference on Language Resources and Evaluation (LREC 2016). 280-287, Portorož, Slovenia Verfügbar unter https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/5077.

Fankhauser, P. (2016): Kuration und Exploration des Korpus "Diskurs in der Weimarer Republik". In: Digital Humanities im deutschsprachigen Raum. Verfügbar unter https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/5687.

Fankhauser, P.; Knappen, J.; Teich, E. (2016): Topical Diversification over Time in the Royal Society Corpus. In: DH 2016, Kraków, Poland. Verfügbar unter file:///Users/sshnder/Downloads/Fankhauser_Knappen_Teich_Topical_Diversification_Over_Time_In_The_Royal_Society_Corpus_2016a.pdf.

Fischer, P. M.; Diewald, N.; Kupietz, M.; Witt, A. (2016): Aufbau einer Korpusinfrastruktur für die Beobachtung des Schreibgebrauchs. In: DHd 2016. Modellierung - Vernetzung - Visualisierung. Die Digital Humanities als fächerübergreifendes Forschungsparadigma. Konferenzabstracts. , Universität Leipzig, 310-312. Verfügbar unter http://www.dhd2016.de/abstracts/posters-074.html.

Freudenberg, M.; Brümmer, M.; Rücknagel, J.; Ulrich, R.; Eckart, T.; Kontokostas, D.; Hellmann, S. (2016): The Metadata Ecosystem of DataID. Springer International Publishing. Verfügbar unter https://svn.aksw.org/papers/2016/MSOR_DataID2/public.pdf.

Haaf, S. (2016): Corpus Analysis based on Structural Phenomena in Texts. Exploiting TEI Encoding for Linguistic Research. In: N. C. e. al. (Ed.), Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), 23.-28. Mai 2016, Portorož (Slovenia), Paris: ELRA, 2016. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2016/pdf/1154_Paper.pdf.

Haaf, S.; Thomas, C. (2016): Die Historischen Korpora des Deutschen Textarchivs als Grundlage für sprachgeschichtliche Forschungen. In: H. R. V. H. L. Schiwek (Ed.), Sprachgeschichte des Deutschen. Positionierungen in Forschung, Studium, Schule. 217-234, Stuttgart. Verfügbar unter http://www.deutschestextarchiv.de/files/Haaf_Thomas_FAHNEN.pdf

Hedeland, H.; Lehmberg, T.; Wagner-Nagy, B. (2016): Digitale Workflows in Langzeitprojekten am Beispiel einer Infrastruktur zur Dokumentation indigener nordeuropäischer Sprachen (INEL). In: DHd 2016 152-154. Verfügbar unter http://www.dhd2016.de/abstracts/vortr%C3%A4ge-044.html.

Hinrichs, E.; Hinrichs, M.; Trippel, T. (2016): Proceedings of: Language Technology Resources and Tools for Digital Humanities (LT4DH): Workshop at COLING 2016. of. Verfügbar unter https://www.clarin-d.de/images/lt4dh/LT4DH-2016.pdf.

Jurish, B. (2016a): Diachronic Collocations and Genre. In: R. J. Whitt (Ed.), Diachronic Corpora, Genre, and Language Change. 22-24, Nottingham. Verfügbar unter http://www.deutschestextarchiv.de/files/jurish2016genre.pdf.

Jurish, B. (2016b): Tools, Toys, and FiltersRechtsgeschichte - Legal History Rg 24. 347-348. Verfügbar unter http://dx.doi.org/10.12946/rg24/347-348

http://www.deutschestextarchiv.de/files/jurish2016tools_draft.pdf.

Jurish, B.; Geyken, A.; Werneke, T. (2016): DiaCollo. Diachronen Kollokationen auf der Spur. In: DHd 2016. Modellierung - Vernetzung - Visualisierung, Leipzig, 172-175. Verfügbar unter http://www.deutschestextarchiv.de/files/jgw2016diacollo.pdf

Jurish, B.; Maletti, A.; Springmann, U.; Würzner, K.-M. (2016): SIGFSM Workshop on Statistical NLP and Weighted Automata at the 54th Annual Meeting of the Association for Computational Linguistics: Proceedings. of.

Kamocki, P. (2016): "Allow Mining!" - The Argument for "Orthogonal Uses" of Intellectual Works in the Debate on Text and Data MiningRevue Internationale du Droit d'Auteur 4-85. Verfügbar unter https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/5475.

Kamocki, P.; Ketzan, E.; Witt, A. (2016): Lizenzauswahlwerkzeuge für die digitalen Geisteswissenschaften. In: DHd 2016. Modellierung – Vernetzung – Visualisierung. Die Digital Humanities als fächerübergreifendes Forschungsparadigma. Konferenzabstracts. Universität Leipzig 7. bis 12. März 2016. Duisburg: nisaba, 2016 336-337. Verfügbar unter http://www.dhd2016.de/abstracts/posters-023.html.

Kamocki, P.; O'Regan, J. (2016): Privacy Issues in Online Machine Translation Services - European Perspective In: Language Resources and Evaluation Conference (LREC), Portorož, Slowenien, 4458-4462. Verfügbar unter https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/5043.

Kamocki, P.; O'Regan, J.; Stauch, M. (2016): "All your data belongs to us". European Perspectives on Privacy Issues in 'Free' Online Machine Translation Services. In: A. e. al. (Ed.), Privacy and Identity Management: Time for a Revolution? 265-280. Springer.

Kamocki, P.; Stranák, P.; Sedlák, M. (2016): The Public License Selector: Making Open Licensing Easier. In: N. Calzolari et al. (Eds.), Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). 2533-2538. ELRA, Portorož, Slovenien. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2016/pdf/880_Paper.pdf.

Kisler, T.; Uwe, R.; Schiel, F.; Draxler, C.; Jackl, B.; Poerner, N. (2016): BAS Speech Science Web Services - an Update of Current Developments10th International Conference on Language Resources and Evaluation (LREC 2016). 3880-3885, Portorož, Slovenia.

Knappen, J.; Fischer, S.; Kermes, H.; Teich, E.; Fankhause, P. (2017): The Making of the Royal Society Corpus. In: G. a. A. Bouma, Yvonne (Ed.), Proceedings of the NoDaLiDa 2017 Workshop on Processing Historical Language. 7-11. Linköping University Electronic Press, Linköpings universitet. Verfügbar unter http://www.ep.liu.se/ecp/article.asp?issue=133&article=003&volume=.

Kupietz, M.; Diewald, N.; Hanl, M.; Margaretha, E. (2017): Möglichkeiten der Erforschung grammatischer Variation mithilfe von KorAP. In: M. W. Konopka, Angelika (Ed.), Grammatische Variation. Empirische Zugänge und theoretische Modellierung. Jahrbuch des Instituts für Deutsche Sprache 2016. 319-329. de Gruyter Berlin/Boston. Verfügbar unter https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/5968.

Kupietz, M.; Geyken, A. (2016): Corpus Linguistic Software Tools.  In: Journal for Language Technology and Computational Linguistics (JLCL), 31(1). Verfügbar unter https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/6218.

Le, D.-T.; Vu, N. T.; Blessing, A. (2016): Towards a text analysis system for political debates. In: LaTeCH@ACL, Berlin: The Association for Computer Linguistics, Berlin. Verfügbar unter http://anthology.aclweb.org/W16-2117.

Lemke, M.; Wiedemann, G. (2016): Text Mining in den Sozialwissenschaften. Grundlagen und Anwendungen zwischen qualitativer und quantitativer Diskursanalyse. Springer VS 2016. Verfügbar unter http://www.springer.com/de/book/9783658072230.

Lüngen, H.; Beißwenger, M.; Ehrhardt, E.; Herold, A.; Storrer, A. (2016): Integrating corpora of computer-mediated communication in CLARIN-D: Results from the curation project ChatCorpus2CLARINProceedings of the 13th Conference on Natural Language Processing (KONVENS 2016). 156-164. Verfügbar unter https://www.linguistics.rub.de/konvens16/pub/20_konvensproc.pdf.

Martínez Martínez, J. M. (2016): Saarbrücken Cookbook Corpus: a recipe for a diachronic study à la CLARIN-D, Universität des Saarlandes, Deutschland. Verfügbar unter http://chozelinek.github.io/sacoco/2016.

Martínez Martínez, J. M. (2016): Working with Corpora (WwC). Verfügbar unter http://fedora.clarin-d.uni-saarland.de/unserwiki/doku.php?id=training:working_with_corpora.

Martínez Martinéz, J. M.; Vela, M. (2016): SubCo: A Learner Translation Corpus of Human and Machine Subtitles. In: LREC 2016, Portoroz, Slovenia. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2016/pdf/1244_Paper.pdf.

Poerner, N.; Schiel, F. (2016): An automatic chunk segmentation tool for long transcribed speech recordingsPhonetik & Phonologie, Munich.

Schmidt, T. (2016): Good practices in the compilation of FOLK, the Research and Teaching Corpus of Spoken German.  In: Compilation, transcription, markup and annotation of spoken corpora. Special Issue of the International Journal of Corpus Linguistics (IJCL 21:3), 21:3  (396-418). Verfügbar unter http://www.jbe-platform.com/content/journals/10.1075/ijcl.21.3.05sch.

Schmidt, T. (2017): Construction and Dissemination of a Corpus of Spoken Interaction - Tools and Workflows in the FOLK project.  In: Corpus Linguistic Software Tools. Journal for Language Technology and Computational Linguistics (JLCL), 31  (1),  127-154. Verfügbar unter http://www.jlcl.org/2016_Heft1/jlcl-2016-1-7Schmidt.pdf.

Schmidt, T.; Hedeland, H.; Jettka, D. (2017): Conversion and Annotation Web Services for Spoken Language Data in CLARIN. In: L. Borin (Ed.): Selected papers from the CLARIN Annual Conference 2016, Aix-en-Provence, France, 113-130. Verfügbar unter https://www.clarin.eu/sites/default/files/schmidt-etal-CLARIN2016_paper_27.pdf.

Schmidt, T.; Wörner, K.; Hedeland, H. (2016): Elemente eines Geschäftsmodells für etablierte wissenschaftliche Software.  In: FORGE 2016,  17-18.

Tiepmar, J.; Eckart, T.; Goldhahn, D.; Kuras, C. (2016): Canonical Text Services in CLARIN - Reaching out to the Digital Classics and beyond. In: CLARIN Annual Conference 2016.

Trippel, T. (2016): CLARIN: Forschungsinfrastruktur für die Geistes- und Sozialwissenschaften.  In: Akademie Aktuell: Zeitschrift der Bayerischen Akademie der Wissenschaften, 1  (56),  89-91. Verfügbar unter http://badw.de/fileadmin/pub/akademieAktuell/2016/56/0116_20_Trippel_V05.pdf.

Trippel, T.; Zinn, C. (2016): Enhancing the Quality of Metadata by using Authority Control. In: 5th Workshop on Linked Data in Linguistics, Portorož, Slovenia.

Westpfahl, S.; Schmidt, T. (2016): FOLK-Gold - A GOLD standard for Part-of-Speech-Tagging of Spoken GermanProceedings of the Tenth Conference on International Language Resources and Evaluation (LREC’16), Portorož, Slovenia. 1493-1499. Paris: European Language Resources Association (ELRA). Verfügbar unter https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/5078.

Wiedemann, G. (2016): Text Mining for Qualitative Data Analysis in the Social Sciences.  Springer 2016. Verfügbar unter www.springer.com/de/book/9783658153083.

Zinn, C.; Trippel, T.; Kaminski, S.; Dima, E. (2016): Crosswalking from CMDI to Dublin Core and MARC 21. In: N. Calzolari et al. (Eds.), Tenth International Conference on Language Resources and Evaluation (LREC 2016). European Language Resources Association (ELRA), Portorož, Slovenia. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2016/pdf/543_Paper.pdf.

2015

Bański, P.; Biber, H.; Breiteneder, E.; Kupietz, M.; Lüngen, H.; Witt, A. (2015): The 3rd Workshop on Challenges in the Management of Large Corpora (CMLC-3). of. Mannheim: Institut für Deutsche Sprache. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2012/workshops/05.CMLC-Proceedings.pdf.

Beißwenger, M.; Ehrhardt, E.; Horbach, A.; Lüngen, H.; Steffen, D.; Storrer, A. (2015): Adding Value to CMC Corpora: CLARINification and Part-of-Speech Annotation of the Dortmund Chat Corpus. In: 2nd Workshop on Natural Language Processing for Computer-Mediated Communication / Social Media (NLP4CMC2015), Essen, Germany. Verfügbar unter https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/4365.

Bingel, J.; Diewald, N. (2015): KoralQuery - a General Corpus Query Protocol Proceedings of the Workshop on Innovative Corpus Query and Visualization Tools at NODALIDA 2015. Linköping University Electronic Press, 2015, Linköping, Sweden. Verfügbar unter http://www.ep.liu.se/ecp/111/001/ecp15111001.pdf.

Blessing, A.; Glaser, A.; Kuhn, J. (2015): Biographical Data Exploration as a Test-bed for a Multi-view, Multi-method Approach in the Digital Humanities. In: First Conference on Biographical Data in a Digital World 2015, Amsterdam, The Netherlands, April 9, 2015. 53-60. Verfügbar unter http://ceur-ws.org/Vol-1399/paper9.pdf.

Blessing, A.; Kliche, F.; Heid, U.; Kantner, C.; Kuhn, J. (2015): Computerlinguistische Werkzeuge zur Erschließung und Exploration großer Textsammlungen aus der Perspektive fachspezifischer Theorien.  In: Grenzen und Möglichkeiten der Digital Humanities. Hg. von Constanze Baum / Thomas Stäcker. 2015 (= Sonderband der Zeitschrift für digitale Geisteswissenschaften, 1). Verfügbar unter http://www.zfdg.de/printpdf/128.

Cap, F.; Rösiger, I.; Kuhn, J. (2015): A Pilot Experiment on Exploiting Translations for Literary Studies on Kafka’s “Verwandlung”. In: 4th Workshop on Computational Linguistics for Literature (CLFL) at NAACL, Denver/Colorada, USA, 48. Verfügbar unter http://www.aclweb.org/anthology/W15-0706.

Çöltekin, Ç. (2015a): A Grammar-Book Treebank of Turkish. In: International Workshop on Treebanks and Linguistic Theories (TLT14), Warsaw, Poland. Verfügbar unter http://coltekin.net/cagri/papers/coltekin2015tlt.pdf.

Çöltekin, Ç. (2015b): Turkish nlp web services in the weblicht environment. In: CLARIN Annual Conference, Wroclaw, Poland. Verfügbar unter http://coltekin.net/cagri/papers/coltekin2015clarin.pdf.

De Kok, D. (2015a): Bootstrapping a neural net dependency parser for German using CLARIN resources. In: CLARIN Annual Conference, Wroclaw, Poland.

De Kok, D. (2015b): A Poor Man’s Morphology for German Transition-Based Dependency Parsing. In: International Workshop on Treebanks and Linguistic Theories (TLT14), Warsaw, Poland, 50.

De Kok, D.; Qiu, W.; Hinrichs, M. (2015): WebLicht: Bombardieren bevor die Services explodieren. In: DHd Konferenz, Graz.

Degaetano-Ortlieb, S.; Kermes, H.; Khamis, A.; Knappen, J.; Ordan, N.; Teich, E. (2015): Information Density in Scientific Writing: A Diachronic Perspective. In: "Challenging Boundaries" - 42nd International Systemic Functional Congress (ISFCW2015), RWTH Aachen University.

Degaetano-Ortlieb, S.; Kermes, H.; Khamis, A.; Ordan, N.; Teich, E. (2015): The taming of the data: Using text mining in building a corpus for diachronic analysis. In: Varieng - From Data to Evidence (d2e), University of Helsinki. Verfügbar unter http://www.ashrafkhamis.com/d2e2015.pdf.

Dima, C.; Hinrichs, E. (2015): Automatic Noun Compound Interpretation using Deep Neural Networks and Word Embeddings. In: IWCS 2015 173.

Draxler, C.; Kleiner, S. (2015): A Cross-Database Comparison of two Large German Speech DatabasesProceedings of ICPhS, Glasgow, UK.

Eckart, K.; Gärtner, M.; Schweitzer, K. (2015): Leaving Traces - Workflow Tracking in Computational Linguistics In: DGfS 2015 CL-Postersession.

Eckart, T.; Hellwig, A.; Goosen, T. (2015): Influence of Interface Design on User Behaviour in the VLO. In: CLARIN Annual Conference, Wroclaw, Poland.

Gärtner, M.; Schweitzer, K.; Eckart, K.; Kuhn, J. (2015): Multi-modal Visualization and Search for Text and Prosody Annotations. In: 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2015), Beijing, China, 25.

Geyken, A.; Gloning, T. (2015a): Corpus strategies, technology, organization. In: J. Gippert & R. Gehrke (Eds.), Historical Corpora. Challenges and Perspectives. 165-180. Narr Francke Attempto, Tübingen.

Geyken, A.; Gloning, T. (2015b): A living text archive of 15th-19th-century German. Corpus strategies, technology, organization. In: J. Gippert & R. Gehrke (Eds.), Historical Corpora. Challenges and Perspectives. 165-180. Narr Francke Attempto, Tübingen.

Goldhahn, D.; Eckart, T.; Gloning, T.; Dreßler, K.; Heyer, G. (2015): Operationalisation of Research Questions of the Humanities within the CLARIN Infrastructure – An Ernst Jünger Use Case. In: CLARIN Annual Conference, Wroclaw, Poland.

Gupta, R.; Orasan, C.; Zampieri, M.; Vela, M.; Genabith, J. V. (2015): Can Translation Memories afford not to use paraphrasing? In: 18th Annual Conference of the European Association for Machine Translation (EAMT) 35-42. Verfügbar unter https://pdfs.semanticscholar.org/1894/8166bf1a36cebc2ed385e1f69293085f8a4b.pdf.

Haaf, S.; Thomas, C. (2015): Introducing the DTABf-M: A Manuscript-specific Extension to the DTA >Base Format< (DTABf).  In: Journal of the Text Encoding Initiative (jTEI) 10: 2015 Conference Issue.

Henrich, V. (2015): Word Sense Disambiguation with GermaNet. PhD Thesis, University of Tübingen.

Henrich, V.; Hinrichs, E. (2015): Special Issue: Computational, Cognitive and Linguistic Approaches to the Analysis of Compounds and Collocations.  In: Journal of Cognitive Science, 16  (3). Verfügbar unter http://cogsci.snu.ac.kr/jcs/issue/vol16/no3/01+Introduction+to+the+Special+IssueOK.pdf.

Herzog, G.; Heid, U.; Trippel, T.; Bánski, P.; Romary, L.; Schmidt, T.; Witt, A.; Eckart, K. (2015): Recent Initiatives towards New Standards for Language Resources. In: International Conference of the German Society for Computational Linguistics and Language Technology (GSCL 2015), Essen, Germany, 154 - 156. Verfügbar unter https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/4358.

Hinrichs, E. (2015): Digitale Geisteswissenschaften und Informatik – Modelle der Zusammenarbeit. In: DHd Konferenz, Graz, Austria.

Hinrichs, E.; de Kok, D.; Çöltekin, Ç. (2015): Treebank Data and Query Tools for Rare Syntactic Constructions. In: International Workshop on Treebanks and Linguistic Theories (TLT14) 106.

Horbach, A.; Thater, S.; Steffen, D.; Fischer, P. M.; Witt, A.; Pinkal, M. (2015): Internet Corpora: A Challenge for Linguistic Processing.  In: Datenbank-Spektrum, 15  (1),  41-47. Verfügbar unter https://link.springer.com/article/10.1007/s13222-014-0172-z.

Jettka, D.; Hedeland, H.; Lehmberg, T. (2015): Vom digitalen Hausmeister zum international vernetzten Forschungs­datenzentrum. Die Entwicklung der digitalen Infrastruktur des Hamburger Zentrums für Sprachkorpora. In: FORGE 2015, Hamburg.

John, M.; Koch, S.; Heimerl, F.; Müller, A.; Ertl, T.; Kuhn, J. (2015): Interactive Visual Analysis Of German Poetics. In: Global Digital Humanities, Sydney, Australia.

Jügler, J.; Zimmerer, F.; Möbius, B.; Draxler, C. (2015): The effect of high-variability training on the perception and production of French stops by German native speakers. In: INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015 806--810. Verfügbar unter http://www.isca-speech.org/archive/interspeech_2015/i15_0806.html.

Jurish, B. (2015): DiaCollo: On the trail of diachronic collocations. In: CLARIN Annual Conference, Wroclaw, Poland, 28-31. Verfügbar unter http://www.deutschestextarchiv.de/files/jurish2015diacollo-clarin.pdf.

Jurish, B.; Ast, H. (2015): Using an Alignment-based Lexicon for Canonicalization of Historical Text. In: J. Gippert & R. Gehrke (Eds.), Historical Corpora. Challenges and Perspectives. 197-208. Narr Francke Attempto, Tübingen, Germany. Verfügbar unter https://www.semanticscholar.org/paper/Using-an-Alignment-based-Lexicon-for-Canonicalizat-Jurish-Ast/b4a4d525bd4faa3dd0c82210e720686833df20fa.

Kamocki, P.; Ketzan, E. (2015): "What's new in Creative Commons 4.0?". In: CLARIN Legal Issues Committee (CLIC), White Paper Series.

Khamis, A.; Degaetano-Ortlieb, S.; Kermes, H.; Knappen, J.; Ordan, N.; Teich, E. (2015): A resource for the diachronic study of scientific English: Introducing the Royal Society Corpus. In: Corpus Linguistics 2015, Lancaster. Verfügbar unter http://www.ashrafkhamis.com/cl2015.pdf.

Kisler, T.; Schiel, F.; Reichel, U. D.; Draxler, C. (2015): Phonetic/linguistic web services at BAS. In: Interspeech-2015, Dresden, Germany, 2609-2610. Verfügbar unter https://epub.ub.uni-muenchen.de/25255/1/KSRDIS2015.pdf.

Kuhn, J.; Reiter, N. (2015): A Plea for a Method-driven Agenda in the Digital Humanities. In: Global Digital Humanities, Sydney, Australia.

Kunz, K.; Lapshinova-Koltunski, E.; Martínez Martínez, J. M.; Menzel, K.; Steiner, E. (2015a): Lexical cohesion in a contrastive perspective (English - German). In: ICAME36, Trier, Germany.

Kunz, K.; Lapshinova-Koltunski, E.; Martínez Martínez, J. M.; Menzel, K.; Steiner, E. (2015b): Lexical cohesion in a contrastive perspective (English - German): some instantial patterns. In: 42nd International Systemic Functional Congress, RWTH Aachen University, Germany.

Kunz, K. A.; Lapshinova-Koltunski, E.; Martínez Martínez, J. M.; Menzel, K.; Steiner, E. (2015): GECCo: Corpus to Analyse German-English Contrasts in Cohesion. In: 1st Action Conference. Textlink: Structuring Discourse in Multilingual Europe, Louvain, Belgium. Verfügbar unter http://www.gecco.uni-saarland.de/GECCo/files/GECCo_Korpus_Overview.pdf.

Kupietz, M. (2015): Constructing a Corpus. In: P. Durkin (Ed.), The Oxford Handbook of Lexicography. 62-75. Oxford University Press, Oxford, UK.

Kupietz, M.; Schmidt, T. (2015): Schriftliche und mündliche Korpora am IDS als Grundlage für die empirische Forschung In: L. M. Eichinger (Ed.), Sprachwissenschaft im Fokus. Positionsbestimmungen und Perspektiven (= Jahrbuch des Instituts für Deutsche Sprache 2014). 297-322. de Gruyter, Berlin/Boston. Verfügbar unter https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/3482.

Lapshinova-Koltunski, E.; Vela, M. (2015): Measuring "Registerness" in Human and Machine Translation: A Text Classification Approach. In: EMNLP 2015 Workshop on Discourse in Machine Translation, Lisbon, Portugal. Verfügbar unter http://aclweb.org/anthology/W15-2517.

Lehmberg, T. (2015): Wissenstransfer und Wissensressourcen: Support und Helpdesk in den Digital Humanities. In: FORGE 2015, Hamburg, Germany.

Lemke, M.; Niekler, A.; Schaal, G. S.; Wiedemann, G. (2015): Content Analysis between Quality and Quantity. Fulfilling Blended-Reading Requirements for the Social Sciences with a Scalable Text Mining Infrastructure. Springer Berlin Heidelberg 2015. Verfügbar unter https://link.springer.com/article/10.1007/s13222-014-0174-x?sa_campaign=email/event/articleAuthor/onlineFirst.

Lemnitzer, L.; Geyken, A. (2015): Semantic Modeling of Collocations for Lexicographic Purposes.  In: Journal of Cognitive Science, 16  (3),  200-223. Verfügbar unter http://cogsci.snu.ac.kr/jcs/index.php/issues/?uid=187&mod=document.

Lyse, G. I.; Meurer, P.; De Smedt, K. (2015): COMEDI: A component metadata editor. In: Selected Papers from the CLARIN 2014 Conference, October 24-25, 2014 (8), Soesterberg, The Netherlands, 82-98. Verfügbar unter https://www.clarin.eu/sites/default/files/cac2014_submission_13_0.pdf.

Mahlow, C.; Piotrowski, M. (2015): Systems and Frameworks for Computational Morphology - Fourth International Workshop, SFCM 2015, Proceedings. Springer International Publishing, Cham/Berlin/Heidelberg.

Müller, A.; John, M.; Koch, S.; Ertl, T.; Kuhn, J. (2015): Erkennung und Visualisierung attribuierter Phrasen in Poetiken. In: Digital Humanities im deutschsprachigen Raum 2015.

Pal, S.; Vela, M.; Naskar, S. K.; van Genabith, J. (2015): USAAR-SAPE: An English--Spanish Statistical Automatic Post-Editing System. In: 10th Workshop on Statistical Machine Translation (WMT), Lisbon, Portugal. Verfügbar unter https://pdfs.semanticscholar.org/3261/a73cbae4fe25bd067febb6f76f05c35d10bb.pdf.

Quasthoff, U.; Goldhahn, D. (2015): Was sind IT-basierte Forschungsinfrastrukturen für die Geistes- und Sozialwissenschaft und wie können sie genutzt werden? .  In: Information - Wissenschaft & Praxis. Verfügbar unter https://www.degruyter.com/view/j/iwp.2015.66.issue-5-6/iwp-2015-0054/iwp-2015-0054.xml.

Quasthoff, U.; Goldhahn, D.; Eckart, T. (2015): Building Large Resources for Text Mining: The Leipzig Corpora CollectionText Mining - From Ontology Learning to Automated Text Processing Applications. Springer.

Reiter, N.; Kuhn, J. (2015): Annotation as a Pivot in DH Experiments. In Digital Humanities Experiments #DHIHA6, Open Peer Review-Blog. Verfügbar unter https://medium.com/@nilsreiter/experiments-in-digital-humanities-f0e225d26fe2.

Sandberg, M.; Baxter, R.; Lecarpentier, D.; Kamocki, P. (2015): Open Data - What Do Research Communities Really Think About it?  In: ERCIM (The European Research Consortium for Informatics and Mathematics) News, 2015 (100),  19-20.

Scarton, C.; Zampieri, M.; Vela, M.; van Genabith, J.; Specia, L. (2015): Searching for Context: a Study on Document-Level Labels for Translation Quality Estimation. In: 18th Annual Conference of the European Association for Machine Translation (EAMT), Antalya, Turkey.

Schiel, F. (2015): A statistical model for predicting pronunciation. In: International Conference on Phonetic Sciences, Glasgow, United Kingdom.

Schröck, J.; Lüngen, H. (2015): Building and Annotating a Corpus of German-Language Newsgroups. In: M. Beißwenger & T. Zesch (Eds.), NLP4CMC 2015. 2nd Workshop on Natural Language Processing for Computer-Mediated Communication / Social Media. Proceedings of the Workshop. 17-22. German Society for Computational Linguistics & Language Technology (GSCL) 2015, University of Duisburg-Essen, Campus Essen. Verfügbar unter https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/4364.

Schweitzer, K.; Gärtner, M.; Riester, A.; Rösiger, I.; Eckart, K.; Kuhn, J.; Dogil, G. (2015): Analysing automatic descriptions of intonation with ICARUS. In: Sixteenth Annual Conference of the International Speech Communication Association.

Stührenberg, M.; Diewald, N.; Gleim, R. (2015): Considerations for a linguistic network markup language. In: A. Mehler et al. (Eds.), Towards a theoretical framework for analyzing complex linguistic networks. (= Understanding Complex Systems). 299-330. Springer, 2015, Heidelberg.

Teich, E.; Degaetano-Ortlieb, S.; Fankhauser, P.; Kermes, H.; Lapshinova-Koltunski, E. (2015): The linguistic construal of disciplinarity: A data-mining approach using register features.  In: Journal of the Association for Information Science and Technology. 10.1002/asi.23457.

Telljohann, H.; Hinrichs, E. W.; Kübler, S.; Zinsmeister, H.; Beck, K. (2015): Stylebook for the Tübingen treebank of written German (TüBa-D/Z). Verfügbar unter http://www.sfs.uni-tuebingen.de/resources/tuebadz-stylebook-1201.pdf.

Thomas, C.; Wiegand, F. (2015): Making great work even better: Appraisal and Digital Curation of widely dispersed Electronic Textual Resources (c. 15th–19th cent.) in CLARIN-D. In: J. Gippert & R. Gehrke (Eds.), Historical Corpora. Challenges and Perspectives. Narr Francke Attempto, Tübingen. Verfügbar unter https://edoc.bbaw.de/opus4-bbaw/frontdoor/index/index/docId/2005.

Trippel, T.; Zinn, C. (2015a): Describing Research Data with CMDI. In: B. Lust et al. (Eds.), Proceedings of the Workshop: Development of Linguistic Linked open Data Resources for Collaborative Data-Intensive Research in the Language Sciences, University of Chicago.

Trippel, T.; Zinn, C. (2015b): DMPTY - A Wizard For Generating Data Management Plans. In: CLARIN Annual Conference, Wroclaw, Poland. Verfügbar unter http://www.ep.liu.se/ecp/123/006/ecp15123006.pdf.

Trouvain, J.; Möbius, B. (2015): BAS Resources for Phonetic Learner Corpora. In, Glasgow. Verfügbar unter http://www.ifcasl.org/docs/Draxler_final.pdf.

Vela, M.; Lapshinova-Koltunski, E. (2015): A Classification Approach to Register-based Machine Translation Evaluation. In: 15th MT Summit, Miami, Florida.

Vela, M.; Tan, L. (2015): Predicting Machine Translation Adequacy with Document Embeddings. In: 10th Workshop on Statistical Machine Translation (WMT), Lisbon, Portugal, 382-390. Verfügbar unter http://www.statmt.org/wmt15/pdf/WMT51.pdf.

Vela, M.; van Genabith, J. (2015): Re-assessing the WMT2013 Human Evaluation with Professional Translators Trainees? In: 18th Annual Conference of the European Association for Machine Translation (EAMT), Antalya, Turkey. Verfügbar unter https://pdfs.semanticscholar.org/2912/003e5b7db644ec50f7e289e529bce0c98be0.pdf?_ga=2.104353600.1121109587.1509626851-481376915.1509021444.

Wagner-Nagy, B.; Hedeland, H.; Lehmberg, T.; Rießler, M. (2015): INEL – ein Infrastrukturvorhaben zur Dokumentation indigener nordeurasischer Sprachen. In: FORGE 2015, Hamburg, Germany.

Würzner, K.-M.; Jurish, B. (2015): Dsolve. Morphological segmentation for German using conditional random fields. In: M. P. Cerstin Mahlow (Ed.): Fourth International Workshop on Systems and Frameworks for Computational Morphology (SFCM). , Stuttgart, 94-103. Verfügbar unter http://www.deutschestextarchiv.de/files/wj2015dsolve-draft.pdf.

Würzner, K.-M.; Jurish, B. (2015): Gramophone. A hybrid approach to grapheme-phoneme conversion. In: 12th International Conference on Finite State Methods and Natural Language Processing (FSMNLP), Düsseldorf. Verfügbar unter http://fsmnlp2015.phil.hhu.de/wp-content/uploads/2015/06/wuerzner_jurish-grapheme_phoneme.pdf.

Zielke, D.; Jettka, D. (2015): Towards depositor-controlled data ingest in digital repositories for linguistic corpora: automation of metadata processing. In: CLARIN Annual Conference, Wroclaw, Poland.

2014

Bański, P.; Diewald, N.; Hanl, M.; Kupietz, M.; Witt, A. (2014): Access Control by Query Rewriting. The Case of KorAPProceedings of the Ninth Conference on International Language Resources and Evaluation (LREC'14). 3817-3822. European Language Resoruces Association (ELRA). Verfügbar unter file:///Users/sshnder/Downloads/Banski_Diewald_Hanl_Kupietz_Witt_Access%20control_2014.pdf.

Bański, P.; Frick, E.; Hanl, M.; Kupietz, M.; Schnober, C.; Witt, A. (2014): Robust corpus architecture: a new look at virtual collections and data access Corpus Linguistics 2013. Abstract Book. 23-25. Lancaster: UCREL, Lancaster, UK.

Beißwenger, M.; Lüngen, H.; Margaretha, E.; Pölitz, C. (2014): Mining corpora of computer-mediated communication: Analysis of linguistic features in Wikipedia talk pages using Machine Learning methodsProceedings of KONVENS 2014 (Konferenz zur Verarbeitung natürlicher Sprachen). 42-47. Universität Hildesheim 2014, Hildesheim.

Beňuš, Š.; Reichel, U. D.; Mády, K. (2014): Modelling accentual phrase intonation in Slovak and Hungarian. In: Speech Prosody 7, Dublin, Republic of Ireland.

Berkling, K.; Reichel, U. D.; Lavalley, R. (2014): Untersuchung der Eignung von Fibeln für einen systematischen Schriftspracherwerb. In: GEBF Tagung - Die Perspektiven verbinden, Frankfurt a.M., Germany, 163.

Beurskens, M.; Kamocki, P.; Ketzan, E. (2014): Implied Consent: A Silent Revolution in Digital Copyright Law. U.S., German and French PerspectivesRevue Internationale du Droit d'Auteur. 2-108. Verfügbar unter http://www.la-rida.com/en/.

Björkelund, A.; Eckart, K.; Riester, A.; Schauffler, N.; Schweitzer, K. (2014): The extended DIRNDL corpus as a resource for coreference and bridging resolution. In: Ninth International Conference on Language Resources and Evaluation (LREC 2014). Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2014/pdf/891_Paper.pdf.

Björkelund, A.; Kuhn, J. (2014): Learning Structured Perceptrons for Coreference Resolution with Latent Antecedents and Non-local Features. In: A. f. C. Linguistics (Ed.): 52nd Annual Meeting of the Association for Computational Linguistics, Baltimore, Maryland, 47-57. Verfügbar unter https://aclweb.org/anthology/P/P14/P14-1005.pdf.

Blessing, A. (2014): Information Extraction for the Geospatial Domain. Stuttgart, Germany. Verfügbar unter http://elib.uni-stuttgart.de/opus/volltexte/2014/9522/.

Blessing, A.; Kuhn, J. (2014): Textual Emigration Analysis (TEA). In: Ninth International Conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2014/pdf/1009_Paper.pdf.

Blumtritt, J.; Elbers, W.; Goosen, T.; Sallé, M.; Windhouwer, M. (2014): User Delegation in the CLARIN Infrastructure. In: CLARIN Annual Conference, Soesterberg, The Netherlands. Verfügbar unter http://www.clarin.eu/sites/default/files/cac2014_submission_7_0.pdf.

Boehlke, V. (2014): „CLARINifizierung“ von Ressourcen – Anforderungen, Beispiele und Erfahrungen. In: DTA- & CLARIN-D-Konferenz und Workshop: Textkorpora in Infrastrukturen für die Geistes- und Sozialwissenschaften BBAW, Berlin, Germany.

Bosch, S.; Eckart, K.; Faaβ, G.; Heid, U.; Lee, K.; Pareja-Lora, A.; Pretorius, L.; Romary, L.; Witt, A.; Zeldes, A.; Zipser, F. (2014): From <tiger2/> to ISOTiger - Community Driven Developments for Syntax Annotation in SynAF. In: Thirteenth International Workshop on Treebanks and Linguistic Theories (TLT13), Tübingen, Germany, 258-264. Verfügbar unter http://tlt13.sfs.uni-tuebingen.de/tlt13-proceedings.pdf.

De Kok, D. (2014): Build your own Treebank. of CLARIN Annual Conference. Verfügbar unter http://www.clarin.eu/sites/default/files/cac2014_submission_8_0.pdf.

Degaetano-Ortlieb, S.; Fankhauser, P.; Kermes, H.; Lapshinova-Koltunski, E.; Ordan, N.; Teich, E. (2014): Data Mining with Shallow vs. Linguistic Features to Study Diversification of Scientific Registers. In: 9th Edition of the Language Resources and Evaluation Conference LREC 2014, Reykjavik, Iceland. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2014/pdf/291_Paper.pdf.

Degaetano-Ortlieb, S.; Teich, E. (2014): Register diversification in evaluative language: The case of scientific writing. In: G. Thompson & L. Alba-Juez (Eds.), Evaluation in Context. 241–258. John Benjamins Publishing Company, Amsterdam, The Netherlands.

Deppermann, A.; Schmidt, T. (2014): Gesprächsdatenbanken als methodisches Instrument der Interaktionalen Linguistik - Eine exemplarische Untersuchung auf Basis des Korpus FOLK in der Datenbank für Gesprochenes Deutsch (DGD2).  In: Korpora in der Linguistik - Perspektiven und Positionen zu Daten und Datenerhebung [= Mitteilungen des Deutschen Germanistenverbandes 1/2014], 1/2014,  4-17. Verfügbar unter https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2222.

Drude, S.; Broeder, D.; Trilsbeek, P. (2014): The Language Archive and its solutions for sustainable endangered languages corpora.  In.

Drude, S.; Trilsbeek, P.; Sloetjes, H.; Broeder, D. (2014): Best practices in the creation, archiving and dissemination of speech corpora at the Language Archive. In: R. Şükriye et al. (Eds.), Best Practices for Spoken Corpora in Linguistic Research 183-207. Newcastle upon Tyne: Cambridge Scholars Publishing. Verfügbar unter http://pubman.mpdl.mpg.de/pubman/item/escidoc:2051339:4/component/escidoc:2051340/Chapter11_DRUDEetal.pdf.

Ďurčo, M.; Windhouwer, M. (2014): From CLARIN Component Metadata to Linked Open Data. In: LREC, Reykjavik, Iceland. Verfügbar unter https://pure.knaw.nl/portal/files/779607/CMD2RDF.pdf.

Eckart de Castilho, R.; Biemann, C.; Gurevych, I.; Yimam, S. M. (2014): WebAnno: a flexible, web-based annotation tool for CLARIN. In: Proceedings of the CLARIN Annual Conference (CAC) 2014, Soesterberg, Niederlande. Verfügbar unter http://www.aclweb.org/anthology/P13-4001.pdf.

Eckart, K.; Heid, U. (2014): Resource interoperability revisited. In: 12. Konferenz zur Verarbeitung natürlicher Sprache, KONVENS 2014, Hildesheim, Germnay, 116-126. Verfügbar unter http://www.uni-hildesheim.de/konvens2014/data/konvens2014-conference-proceedings.pdf.

Fankhauser, P.; Kermes, H.; Teich, E. (2014): Combining Macro- and Microanalysis for Exploring the Construal of Scientific Disciplinarity. In: Digital Humanities, Lausanne, Switzerland. Verfügbar unter http://www.informatik.uni-leipzig.de/~stjaenicke/Survey.pdf.

Fankhauser, P.; Knappen, J.; Teich, E. (2014): Exploring and Visualizing Variation in Language Resources. In: N. Calzolari et al. (Eds.): Ninth International Conference on Language Resources and Evaluation, LREC 2014, Reykjavik, Iceland. Verfügbar unter http://lrec2014.lrec-conf.org/media/filer_public/2014/06/17/conference_book_of_abstracts_lrec2014_upd.pdf.

Feiser, H. S.; Draxler, C. (2014): Perceptual voice similarity of related speakers: telephone and microphone recordings. In: 23rd Annual Conference of the International Association for Forensic Phonetics and Acoustics, IAFPA 2014, Zurich, Switzerland, 2-9.

Gärtner, M.; Björkelund, A.; Thiele, G.; Seeker, W.; Kuhn, J. (2014): Visualization, Search, and Error Analysis for Coreference Annotations. In: 52nd Annual Meeting of the Association for Computational Linguistics - System Demonstrations, Association for Computational Linguistics, Baltimore, Maryland, US, 7-12. Verfügbar unter https://aclweb.org/anthology/P/P14/P14-5002.pdf.

Goosen, T.; Eckart, T. (2014): Virtual Language Observatory 3.0: What’s New? In: CLARIN Annual Conference 2014, Soesterberg, The Netherlands. Verfügbar unter http://www.clarin.nl/sites/default/files/CLARIN Annual Report 2014 FINAL.pdf.

Goosen, T.; Windhouwer, M.; Ohren, O.; Herold, A.; Eckart, T.; Ďurčo, M.; Schonefeld, O. (2014): CMDI 1.2: Improvements in the CLARIN Component Metadata Infrastructure. In: CLARIN Annual Conference, Soesterberg, The Netherlands. Verfügbar unter https://www.clarin.eu/sites/default/files/cac2014_submission_5_0.pdf.

Haaf, S.; Fankhauser, P.; Trippel, T.; Eckart, K.; Eckart, T.; Hedeland, H.; Herold, A.; Knappen, J.; Schiel, F.; Stegmann, J.; Uytvanck, D. V. (2014): CLARIN's Virtual Language Observatory (VLO) under scrutiny -- The VLO taskforce of the CLARIN-D centres. In: CLARIN Annual Conference, Soesterberg, Netherlands. Verfügbar unter http://www.clarin.eu/sites/default/files/cac2014_submission_30_0.pdf.

Haaf, S.; Schulz, M. (2014): Historical Newspapers & Journals for the DTA. In: Language Resources and Technologies for Processing and Linking Historical Documents and Archives – Deploying Linked Open Data in Cultural Heritage – LRT4HDA, Reykjavik, Island, 50-54. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2014/workshops/LREC2014Workshop-LRT4HDA Proceedings.pdf - page=57.

Hedeland, H.; Jettka, D.; Lehmberg, T. (2014): Vernetzung statt Vereinheitlichung – Digitale Forschungsinfrastrukturen in den Geisteswissenschaften.  In: b.i.t.online - Zeitschrift für Bibliothek, Information und Technologie, 17  (5),  446-455. Verfügbar unter http://www.b-i-t-online.de/heft/2014-05/fachbeitrag-hedeland.pdf.

Heinrich, C.; Schiel, F. (2014): The influence of alcoholic intoxication on the short-time energy function of speech.  In: Journal of the Acoustical Society of America, 135  (5),  2942-2951. Verfügbar unter https://www.phonetik.uni-muenchen.de/forschung/publikationen/JASA_HeinrichSchiel_2014.pdf.

Henrich, V.; Hinrichs, E. (2014a): Consistency of Manual Sense Annotation and Integration into the TüBa-D/Z Treebank13th International Workshop on Treebanks and Linguistic Theories, Tübingen, Germany. Verfügbar unter http://tlt13.sfs.uni-tuebingen.de/tlt13-proceedings.pdf.

Henrich, V.; Hinrichs, E. (2014b): Proceedings of the Workshop on Computational, Cognitive, and Linguistic Approaches to the Analysis of Complex Words and Collocations. of. University of Tübingen, Department of Linguistics. Verfügbar unter http://www.sfs.uni-tuebingen.de/~vhenrich/cclcc_2014/CCLCC_2014_workshop_proceedings.pdf.

Henrich, V.; Hinrichs, E.; Barkey, R. (2014): Aligning Word Senses in GermaNet and the DWDS Dictionary of the German LanguageProceedings of the 7th Global WordNet Conference (GWC 2014). 63-70, Tartu, Estonia. Verfügbar unter http://www.sfs.uni-tuebingen.de/GermaNet/documents/publications/Henrich-et-al_GermaNet-DWDS-Mapping_GWC2014.pdf.

Henrich, V.; Hinrichs, E.; Vodolazova, T. (2014): Aligning GermaNet Senses with Wiktionary Sense Definitions. In: Z. Vetulani & J. Mariani (Eds.), Human Language Technology: Challenges for Computer Science and Linguistics. 329-342. Verfügbar unter http://link.springer.com/chapter/10.1007/978-3-319-08958-4_27.

Heyer, G.; Niekler, A.; Wiedemann, G. (2014): Brauchen die Digital Humanities eine eigene Methodologie? Überlegungen zur systematischen Nutzung von Text Mining Verfahren in einem politikwissenschaftlichen Projekt. In: Jahrestagung der Digital Humanities im deutschsprachigen Raum (DHd 2014). Verfügbar unter http://asv.informatik.uni-leipzig.de/publication/file/255/Heyer-Brauchen_die_Digital_Humanities_eine_eigene_Methodologie__berlegungen-1451050.pdf.

Hinrichs, E. (2014): Auxiliary Fronting in German Subordinate Clauses. A Synchronic and Diachronic Corpus Study Using the CLARIN Research Infrastructure. In: CLARIN Annual Conference, Soesterberg, The Netherlands.

Hinrichs, E.; Kisler, T.; Trippel, T. (2014): Research infrastructures and resources for processing natural language. In: KONVENS, Hildesheim, Germany. special session.

Hinrichs, E.; Krauwer, S. (2014a): The CLARIN Research Infrastructure: Resources and Tools for E-Humanities Scholars. In: N. Calzolari et al. (Eds.), Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14). 1525–1531. ELRA, Reykjavík, Island. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2014/pdf/415_Paper.pdf.

Hinrichs, E.; Krauwer, S. (2014b): CLARIN: Resources, Tools, and Services for Digital Humanities Research. In: Digital Humanities, Lausanne, Switzerland.

Jettka, D.; Stein, D. (2014): The HZSK Repository: Implementation, Features, and Use Cases of a Repository for Spoken Language Corpora.  In: D-Lib Magazine, 20  (9/10). Verfügbar unter Jettka, D. and D. Stein (2014). The HZSK Repository: Implementation, Features, and Use Cases of a Repository for Spoken Language Corpora. In: D-Lib Magazine, Vol. 20, No. 9/10

Jurish, B.; Thomas, C.; Wiegand, F. (2014): Querying the Deutsches Textarchiv. In: U. Kruschwitz et al. (Eds.), Proceedings of the Workshop MindTheGap 2014: Beyond Single-Shot Text Queries: Bridging the Gap(s) between Research Communities. 25-30. Verfügbar unter http://ceur-ws.org/Vol-1131/mindthegap14_7.pdf.

Kamocki, P. (2014): The Liability of Service Providers in e-Research Infrastructures: Killing the Messenger? In: N. Calzolari et al. (Eds.), The Ninth International Conference on Language Resources and Evaluation (LREC'14). 4220-4224. European Language Resources Association (ELRA), Reykjavik, Iceland. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2014/summaries/202.html.

Kamocki, P.; Ketzan, E. (2014): Creative Commons and Language Resources: General Issues and What's New. In CC 4.0. CLARIN Legal Issues Committee (CLIC) White Paper Series.

Kermes, H.; Knappen, J.; Martinez, J. M. M.; Teich, E.; Vela, M. (2014): TeLeMaCo - A tool for the dissemination of teaching and learning materials. In: CLARIN Annual Conference, Soesterberg, Netherlands.

Kessler, W.; Kuhn, J. (2014): Detecting Comparative Sentiment Expressions - A Case Study in Annotation Design Decisions. In: KONVENS 2014, Hildesheim, Germany. Verfügbar unter http://hildok.bsz-bw.de/frontdoor/index/index/year/2014/docId/258.

Kuhn, J. (2014): The Challenge for the Computational Sciences in Digital Humanities: Establishing a Common Meta-Methodological Framework. In: DHd Workshop "Informatik und die Digital Humanities", Leipzig, Germany.

Kunz, K.; Teich, E.; Hansen-Schirra, S.; Neumann, S.; Daut, P. (2014): Caught in the Middle - Language Use and Translation: A Festschrift for Erich Steiner on the Occasion of his 60th Birthday. Universitätsverlag des Saarlandes, Sarbrücken, Germany. Verfügbar unter http://universaar.uni-saarland.de/monographien/volltexte/2014/122/pdf/Kunz_etal_Festschrift_Steiner.pdf.

Kupietz, M. (2014): Der Programmbereich Korpuslinguistik am IDS: Gegenwart und Zukunft. In: I. f. D. Sprache (Ed.), Ansichten und Einsichten. 50 Jahre Institut für Deutsche Sprache. . 320-328. Institut für Deutsche Sprache Mannheim, Mannheim.

Kupietz, M.; Belica, C.; Lüngen, H.; Perkuhn, R. (2014): Zwischen Empirie und Ästhetik - Ansätze zur korpuslinguistischen Untersuchung und Bewertung von Sprachwandel. In: A. W. Plewnia, Andreas (Ed.), Sprachverfall? Dynamik - Wandel - Variation (= Jahrbuch des Instituts für Deutsche Sprache 2013). 149-169. de Gruyter 2014, Berlin/Boston.

Kupietz, M.; Biber, H.; Lüngen, H.; Bański, P.; Breiteneder, E.; Mörth, K.; Witt, A.; Takhsha, J. (2014): LREC 2014 Workshop Challanges in the Management of Large Corpora (CMLC2). of. European Language Resources Association (ELRA).

Kupietz, M.; Lüngen, H. (2014): Recent Developments in DeReKo. In: N. Calzolari (Ed.), Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14). 2378-2385. European Language Resources Association (ELRA) 2014.

Kupietz, M.; Lüngen, H.; Bański, P.; Belica, C. (2014): Maximizing the Potential of Very Large CorporaProceedings of the LREC-2014-Workshop Challenges in the Management of Large Corpora (CMLC2). European Language Resources Association (ELRA).

Kupietz, M.; Lüngen, H.; Mair, C. (2014): Korpuslinguistik und interdisziplinäre Perspektiven auf Sprache (CLIP). Marc Kupietz

Lüngen, H.; Kupietz, M. (2014): Das Deutsche Referenzkorpus DeReKo im Jubiläumsjahr 2014Sprachreport 2/2014. 24-26. Institut für Deutsche Sprache, Mannheim.

Mády, K.; Reichel, U. D.; Beňuš, Š. (2014): Accentual phrases in Slovak and Hungarian. In: Campbell et al. (Eds.): 7th International Conference on Speech Prosody, Dublin, Ireland, 752-756. Verfügbar unter https://www.phonetik.uni-muenchen.de/~reichelu/publications/MadyReichelBenusSP2014.pdf.

Mahlow, C. (2014): Verbal and Nominal Components of German and Spanish Phrasemes. In: Workshop on MWEs, Frankfurt am Main, Germany.

Mahlow, C.; Eckart, K.; Stegmann, J.; Blessing, A.; Thiele, G.; Gärtner, M.; Kuhn, J. (2014): Resources, Tools, and Applications at the CLARIN Center Stuttgart. In: 12th Konferenz zur Verarbeitung natürlicher Sprache (KONVENS 2014) 11-21.

Margaretha, E.; Lüngen, H. (2014): Building Linguistic Corpora from Wikipedia Articles and Discussions.  In: Building and Annotating Corpora of Computer-mediated Communication: Issues and Challenges at the Interface between Computational and Corpus Linguistics. Journal for Language Technology and Computational Linguistics (JLCL). (= Journal for Language Technology and Computational Linguistics), 29  (2),  59-82.

Martens, S.; Passarotti, M. (2014): Thomas Aquinas in the TüNDRA: Integrating the Index Thomisticus Treebank into CLARIN-D. In: N. Calzolari et al. (Eds.): Ninth International Conference on Language Resources and Evaluation (LREC-2014), Reykjavík, Island. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2014/pdf/70_Paper.pdf.

Mixdorff, H.; Hönemann, A.; Niebuhr, O.; Draxler, C. (2014): Perceived Prominence Reflected by Imitations of Words with and without F0 Continuity. In: Proc. of 7th Speech Prosody, Dublin, Ireland.

Neumeyer, V.; Schiel, F.; Hoole, P. (2014): Speech of cochlear implant patients: An acoustic analysis of vowel production. In: 15th ICPLA Conference (International Clinical Linguistics and Phonetics Association), Stockholm, Sweden, 97. Verfügbar unter http://phonetik.uni-muenchen.de/forschung/publikationen/AbstractICPLA2014.pdf.

Neumeyer, V.; Schiel, F.; Hoole, P.; Schuster, M.; Zirn, S. (2014): Acoustic analysis of vowel production during the first year after cochlear implantation. In: 13th International Conference on Cochlear Implants, Munich, Germany. Verfügbar unter http://phonetik.uni-muenchen.de/forschung/publikationen/CI2014Abstract.pdf.

Pfefferkorn, O.; Fankhauser, P. (2014): On the Role of Historical Newspapers in Disseminating Foreign Words in German. In: Ninth conference on International Language Resources and Evaluation (LREC’14), Reykjavik, Iceland.

Reichel, U. D. (2014): Linking bottom-up intonation stylization to discourse structure. Computer, Speech, and Language.  In: 28,  1340-1365.

Reichel, U. D.; Kisler, T. (2014): Language-independent grapheme-phoneme conversion and word stress assignment as a web service.  In: Elektronische Sprachverarbeitung 2014,  42-49.

Reichel, U. D.; Mády, K. (2014): Comparing parameterizations of pitch register and its discontinuities at prosodic boundaries for HungarianProceedings of Interspeech 2014.

Reichel, U. D.; Markó, A.; Mády, K. (2014): Parameterization and automatic labeling of Hungarian intonationProceedings Speech Prosody. 738-742.

Ruhi, Ş.; Haugh, M.; Schmidt, T.; Wörner, K. (2014): Best Practices for Spoken Corpora in Linguistic Research. Cambridge Scholars Publishing, Newcastle, UK.

Schaal, S. G.; Kath, R. (2014): Zeit für einen Paradigmenwechsel in der politischen Theorie? In: A. Brodocz et al. (Eds.), Die Verfassung des Politischen: Festschrift für Hans Vorländer. 331-350. Springer Fachmedien Wiesbaden, Wiesbaden, Germany. DOI 10.1007/978-3-658-04784-9_20.

Schiel, F. (2014): CLARIN: Prosodic/Gesture Corpora - Existing Resources, Best Practice and Formats. In: PAGE Workshop 2014, Ludwig-Maximilians-Universität München, Germany.

Schiel, F.; Kisler, T. (2014): German Alcohol Language Corpus - the Question of Dialect. In: Ninth International Conference on Language Resources and Evaluation (LREC'14) 353-356.

Schmidt, T. (2014a): The Database for Spoken German - DGD2Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland: European Language Resources Association (ELRA).

Schmidt, T. (2014b): Gesprächskorpora und Gesprächsdatenbanken am Beispiel von FOLK und DGD.  In: Gesprächsforschung - Online-Zeitschrift zur verbalen Interaktion 15,  169-233.

Schmidt, T. (2014c): The Research and Teaching Corpus of Spoken German - FOLKProceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland: European Language Resources Association (ELRA).

Schonefeld, O.; Witt, A. (2014): Forschungsinfrastrukturen am IDS: Gegenwart und Zukunft. In: I. f. D. Sprache (Ed.), Ansichten und Einsichten. 50 Jahre Institut für Deutsche Sprache. 329-336. Institut für Deutsche Sprache, 2014, Mannheim.

Schuller, B.; Steidl, S.; Batliner, A.; Schiel, F.; Krajevski, J. (2014): Introduction to the Special Issue on Broadening the View on Speaker Analysis.  In: Computer, Speech and Language, 28  (2),  343-345.

Schuller, B.; Steidl, S.; Batliner, A.; Schiel, F.; Krajevski, J.; Weninger, F.; Eyben, F. (2014): Medium-term speaker states—A review on intoxication, sleepiness and the first challenge.  In: Computer, Speech and Language,  346-374.

Sorokin, D.; Dima, C.; Hinrichs, E. (2014): Classification of Semantic Relations in German Nominal Compounds Using SVMs. In: V. Henrich & E. Hinrichs (Eds.): ESSLLI 2014 Workshop on Computational, Cognitive, and Linguistic Approaches to the Analysis of Complex Words and Collocations, Tübingen, Germany. Verfügbar unter http://www.sfs.uni-tuebingen.de/~vhenrich/cclcc_2014/CCLCC_2014_workshop_proceedings.pdf.

Stift, U.-M.; Schmidt, T. (2014): Mündliche Korpora am IDS: Vom Deutschen Spracharchiv zur Datenbank für Gesprochenes Deutsch. In: I. f. D. Sprache (Ed.), Ansichten und Einsichten. 50 Jahre Institut für Deutsche Sprache. 360-375. Institut für Deutsche Sprache, 2014, Mannheim.

Strunk, J.; Schiel, F.; Seifart, F. (2014): Untrained Forced Alignment of Transcriptions and Audio for Language Documentation Corpora using WebMAUS. In: Ninth International Conference on Language Resources and Evaluation (LREC'14) 3940-3947.

Tan, L.; Schumann, A.-K.; Martínez, J. M. M.; Bond, F. (2014): Sensible: L2 translation assistance by emulating the manual post-editing process. In: 8th International Workshop on Semantic Evaluation (SemEval 2014), Dublin, Ireland, 541–545. Verfügbar unter http://www.aclweb.org/anthology/S14-2094.

Thiele, G.; Seeker, W.; Gärtner, M.; Björkelund, A.; Kuhn, J. (2014): A Graphical Interface for Automatic Error Mining in Corpora. In: 14th Conference of the European Chapter of the Association for Computational Linguistics, Gothenburg, Sweden, 57-60.

Vela, M.; Schumann, A.-K.; Wurm, A. (2014a): Beyond Linguistic Equivalence. An Empirical Study of Translation Evaluation in a Translation Learner Corpus. In: Workshop on Humans and Computer-assisted Translation (HaCaT 2014).

Vela, M.; Schumann, A.-K.; Wurm, A. (2014b): Human Translation Evaluation and its Coverage by Automatic Scores. In: LREC Workshop on Automatic and Manual Metrics for Operational Translation Evaluation (MTE), Reykjavik, Iceland.

Wälischmiller, L.; Schiel, F.; Hoole, P.; Neumeyer, V.; Zirn, S. (2014): Intonation of speakers with cochlear implant. In: CI 2014, Munich, Germany.

Wright, S. E.; Windhouwer, M.; Schuurman, I.; Broeder, D. (2014): Segueing from a Data Category Registry to a Data Concept Registry. In: 11th International Conference on Terminology and Knowledge Engineering (TKE 2014), Berlin, Germany. Verfügbar unter https://hal.archives-ouvertes.fr/hal-01005840/document.

Wynne, M.; Trippel, T.; Draxler, C. (2014): Using CLARIN for Digital Research. In: Digital Humanities, Lausanne, Switzerland. Verfügbar unter https://www.clarin.eu/sites/default/files/DH2014TutorialWorkshop_handout_0.pdf.

Zampieri, M.; Vela, M. (2014): Quantifying the Influence of MT Output in the Translators' Performance: A Case Study in Technical Translation. In: EACL Workshop on Humans and Computer-assisted Translation (HaCat) 93-98.

2013

Amoia, M. (2013): Interactive Video-Training for Medical Professionals. In: The 5th International Conference on Computer Supported Education (CSEDU 2013).

Amoia, M.; Kermes, H.; Knappen, J.; Martínez, J. M.; Teich, E.; Vela, M. (2013): TeLeMaCo - a collaborative repository for training and teaching materials in linguistics. In: The International Conference of the German Society for Computational Linguistics and Language Technology (GSCL 2013).

Beck, K.; Hinrichs, E.; Trippel, T.; Zastrow, T. (2013): CLARIN-D – eine web- und zentrenbasierte Forschungsinfrastruktur für die Geistes- und Sozialwissenschaften. In: 35. Jahrestagung der Deutschen Gesellschaft für Sprachwissenschaft – Informationsstruktur, Potsdam, Germany.

Blessing, A.; Sonntag, J.; Kliche, F.; Heid, U.; Kuhn, J.; Stede, M. (2013): Towards a Tool for Interactive Concept Building for Large Scale Analysis in the Humanities. In: 7th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, Sofia, Bulgaria, 55-64.

Boehlke, V. (2013): CLARIN-D User Relations. In: CLARIN Annual Meeting, Prag, Czech Republic.

Boehlke, V.; Eckart, T.; Kuras, C. (2013): Tools and Resources of the CLARIN D Centre Leipzig. In: CLARIN-D M24 Workshop, Nijmegen, Netherlands. Verfügbar unter http://clarin-d.de/joomla/images/m24/DEMOSLAM.pdf.

Boehlke, V.; Heyer, G.; Wittenburg, P. (2013): IT-Based Research Infrastructures for the Humanities and Social Sciences - Developments, Examples, Standards, and Technology.  In: Information Technology.

Ďurčo, M.; Windhouwer, M. (2013): Semantic Mapping in CLARIN Component Metadata. In: Metadata and Semantics Research. In: MTSR, Thessaloniki, Greece, 163-16810.1007/978-3-319-03437-9_17.

Eckart, T. (2013): TEI-Integrator. In: 2. CLARIN-D Disseminationsworkshop, Leipzig, Germany. Verfügbar unter http://clarin-d.de/joomla/images/m24/DEMOSLAM.pdf.

Fankhauser, P.; Fiedler, N.; Witt, A. (2013): Forschungsdatenmanagement in den Geisteswissenschaften am Beispiel der germanistischen Linguistik.  In: Zeitschrift für Bibliothekswesen und Bibliographie (ZfBB), 60  (6),  296-306.

Gärtner, M.; Thiele, G.; Seeker, W.; Björkelund, A.; Kuhn, J. (2013): ICARUS - An Extensible Graphical Search Tool for Dependency Treebanks. In: 51st Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Sofia, Bulgaria, 55-60. Verfügbar unter https://aclweb.org/anthology/P/P13/P13-4010.pdf.

Geyken, A.; Haaf, S.; Jurish, B.; Schulz, M.; Thomas, C.; Wiegand, F. (2013): DTA-/CLARIN-D-Konferenz und -Workshops: Historische Textkorpora für die Geistes- und Sozialwissenschaften. Fragestellungen und Nutzungsperspektiven. In: CLARIN-D-Newsletter 14/15. Verfügbar unter https://www.clarin-d.de/images/newsletter/CLARIN-D-Newsletter-2013-4.pdf.

Gloning, T. (2013): Digitale Ressourcen und sprachgeschichtliche Forschung. Am Beispiel des "Polytechnischen Journals" (1820-1831). In: Jahrestagung der Gesellschaft für germanistische Sprachgeschichte (GGSG), Universität Kassel.

Gloning, T.; Neuber, F.; Stäcker, T.; Seim, S.; Thomas, C. (2013): Zwischenbericht des CLARIN–D Kurationsprojekts 1 der F-AG 1; Berichtszeitraum 01.09.2012–28.02.2013.

Haaf, S.; Wiegand, F.; Geyken, A. (2013): Measuring the Correctness of Double-Keying: Error Classification and Quality Control in a Large Corpus of TEI-Annotated Historical Text.  In: Journal of the Text Encoding Initiative. 10.4000/jtei.739.

Heyer, G.; Boehlke, V. (2013): Text Mining in the Humanities – A Plea for Research Infrastructures. In: J. Gippert & R. Gehrke (Eds.), "Historical Corpora". Corpus Linguistics and Interdisciplinary Perspectives on Language (CLIP). Gunther Narr Publishing House, Tübingen, Germany. Verfügbar unter http://asv.informatik.uni-leipzig.de/staff/Volker_Boehlke.

Hinrichs, E.; Henrich, V.; Barkey, R. (2013): Using Part-Whole Relations for Automatic Deduction of Compound-Internal Relations in GermaNet.  In: Language Resources and Evaluation, 47  (3),  839-858.

Jurish, B. (2013): Canonicalizing the Deutsches Textarchiv. In: I. Hafemann (Ed.): Perspektiven einer corpusbasierten historischen Linguistik und Philologie. Internationale Tagung des Akademienvorhabens „Altägyptisches Wörterbuch“ an der Berlin-Brandenburgischen Akademie der Wissenschaften, Berlin, Germany.

Jurish, B.; Drotschmann, M.; Ast, H. (2013): Constructing a Canonicalized Corpus of Historical German by Text Alignment.  In: New Methods in Historical Corpora, Volume 3 of Corpus Linguistics and Interdisciplinary Perspectives on Language (CLIP), 3.

Jurish, B.; Würzner, K.-M. (2013a): Multi-threaded composition of finite-state automata. In: 11th International Conference on Finite State Methods and Natural Language Processing, St Andrews, Scotland, 81-89. Verfügbar unter http://www.aclweb.org/anthology/W13-1813.

Jurish, B.; Würzner, K.-M. (2013b): Word and Sentence Tokenization with Hidden Markov Models In: Journal for Language Technology and Computational Linguistics (JLCL), 28  (2),  61-83. Verfügbar unter http://www.jlcl.org/2013_Heft2/3Jurish.pdf

Kupietz, M.; Frick, E. (2013): Korpusanalyseplattform der nächsten Generation. In: I. Kratochvílová & N. R. Wolf (Eds.), Grundlagen einer sprachwissenschaftlichen Quellenkunde (= Studien zur Deutschen Sprache 66). 27-36. Narr 2013, Tübingen.

Lapshinova, E.; Kunz, K. (2013): Cross-linguistic analysis of cohesion: Variation across production types and registers. In: Workshop at ICAME34 "Cross-linguistic studies at the interface between lexic and grammar", Santiago de Compostela, Spain.

Lapshinova-Koltunski, E. (2013): VARTRA: A Comparable Corpus for Analysis of Translation Variation Proceedings of 6th Workshop on Building and Using Comparable Corpora (BUCC), Sofia, Bulgaria.

Lapshinova-Koltunski, E.; Degaetano, S.; Teich, E.; Kermes, H. (2013): Cohesive conjunctions in English and German: Systemic contrasts and textual differences. In: F. Poppi & W. Cheng (Eds.), Linguistic evolution of conjunctive relations in emerging scientific registers. Cambridge Scholars Publishing.

Lapshinova-Koltunski, E.; Degaetano-Ortlieb, S.; Kermes, H.; Teich, E. (2013): Usefulness of Corpora Enriched with Annotations on Abstract Linguistic Levels. In: Genre- and Register-Related Text and Discourse Features in Multilingual Corpora, Brussels, Belgium.

Lapshinova-Koltunski, E.; Kunz, K. (2013): Detecting Cohesion: semi-automatic annotation procedures. In: Corpus Linguistics, Lancaster, UK.

Mair, C. (2013): Developing digital infrastructure for language documentation: some personal experiences, some lessons learnt, and perspectives from CLARIN-D. In: Applying Documentary Linguistics, Freiburg.

Martens, S. (2013): TüNDRA: A Web Application for Treebank Search and Visualization. In: 12th International Workshop on Treebanks and Linguistic Theories (TLT 2013), Sofia. Verfügbar unter http://www.bultreebank.org/TLT12.

Neuber, F.; Thomas, C. (2013): Kurz vor dem Ziel: Das Kurationsprojekt 1 der F-AG 1 "Deutsche Philologie".  In: CLARIN-D-Newsletter, 5,  29-32.

Reichel, U. D. (2013): Das BAS-Repository.  In: CLARIN-D Newsletter, 5,  22-26. Verfügbar unter http://clarin07.ims.uni-stuttgart.de/joomla/images/newsletter/CLARIN-D-Newsletter-2013-5.pdf.

Schmidt, T.; Dickgießer, S.; Gasch, J. (2013): Die Datenbank für Gesprochenes Deutsch - DGD2.  In.

Schmidt, T.; Wörner, K.; Hedeland, H.; Lehmberg, T. (2013): Leitfaden zur Beurteilung von Aufbereitungsaufwand und Nachnutzbarkeit von Korpora gesprochener Sprache.  1-22. Verfügbar unter http://ids-pub.bsz-/ bw.de/frontdoor/index/index/docId/1331].

Sperberg-McQueen, C. M.; Schonefeld, O.; Kupietz, M.; Lüngen, H.; Witt, A. (2013): Igel: Comparing document grammars using XQuery. In: The Markup Conference 2013 (Balisage Series on Markup Technologies 10), Montréal, Cannada.

Teich, E. (2013): Choices in analyzing choice: methods and techniques for register analysis. In: L. Fontaine et al. (Eds.), Systemic Functional Linguistics: Exploring Choice. 417-431. Cambridge University Press.

Teich, E.; Degaetano-Ortlieb, S.; Kermes, H.; Lapshinova-Koltunski, E. (2013): Scientific registers and disciplinary diversification: a comparable corpus approach. In: The 6th Workshop on Building and Using Comparable Corpora (BUCC), Sofia, Bulgaria.

Westpfahl, S.; Schmidt, T. (2013): POS für(s) FOLK - Part of Speech Tagging des Forschungs- und Lehrkorpus Gesprochenes Deutsch.  In: Journal for Language Technology and Computational Linguistics (JLCL),  (1),  139-156.

Zastrow, T.; Hinrichs, E.; Hinrichs, M.; Beck, K. (2013): Scientific Visualization as CLARIN-D Web Applications. In: Digital Humanities. Verfügbar unter http://dh2013.unl.edu/abstracts/files/downloads/DH2013_conference_abstracts_print.pdf.

2012

Amoia, M.; Bretaudiere, T.; Denis, A.; Gardent, C.; Perez-Beltrachini, L. (2012): A Serious Game for Second Language Acquisition in a Virtual Environment.  In: Journal on Systemics, Cybernetics and Informatics 1,  24-34.

Arranz, V.; Broeder, D.; Gaiffe, B.; Gavrilidou, M.; Monachini, M.; Trippel, T. (2012a): Describing Language Resources with Metadata: Towards Flexibility and Interoperability in the Documentation of Language Resources. In: LREC 2012.

Arranz, V.; Broeder, D.; Gaiffe, B.; Gavrilidou, M.; Monachini, M.; Trippel, T. (2012b): Proceedings of the Workshop Describing Language Resources with Metadata: Towards Flexibility and Interoperability in the Documentation of Language Resources.LREC 2012, Istanbul, Türkei. Verfügbar unter http://lrec-conf.org/proceedings/lrec2012/workshops/11.LREC2012 Metadata Proceedings.pdf.

Bański, P.; Fischer, P. M.; Frick, E.; Ketzan, E.; Kupietz, M.; Schnober, C.; Schonefeld, O.; Witt, A. (2012): The New IDS Corpus Analysis Platform: Challanges and Prospects. In: N. Calzolari et al. (Eds.), Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12). 2905-2911. European Language Resources Association (ELRA), Istanbul, Turkey.

Bański, P.; Majewski, S.; Stührenberg, M.; Werthmann, A. (2012): Building and Maintaining the TEI LingSIG Bibliography. Using Open Source Tools for an Open Content Initiative.  In: TEI and Linguistics. Journal of the Text Encoding Initiative, 3.

Banski, P.; Modignani Picozzi, E. L.; Witt, A. (2012): TEI and Linguistics.  In: Journal of the Text Encoding Initiative 3.

Bański, P.; Modignani Picozzi, E. L.; Witt, A. (2012): Editorial Introduction to the Third Issue.  In: TEI and Linguistics. Journal of the Text Encoding Initiative, 3.

Beißwenger, M.; Ermakova, M.; Geyken, A.; Lemnitzer, L.; Storrer, A. (2012a): DeRiK. A German Reference Corpus of Computer-Mediated Communication. In: Digital Humanities Conference, Hamburg. Verfügbar unter http://www.dh2012.uni-hamburg.de/conference/programme/abstracts/derik-a-german-reference-corpus-of-computer-mediated-communication.1.html.

Beißwenger, M.; Ermakova, M.; Geyken, A.; Lemnitzer, L.; Storrer, A. (2012b): A TEI schema for the Representation of the Computer-mediated Communication.  In: TEI (Text Encoding Initiative) Journal, Vol 3. Verfügbar unter http://jtei.revues.org/476.

Beißwenger, M.; Lemnitzer, L. (2012): Aufbau eines Referenzkorpus zur deutschsprachigen internetbasierten Kommunikation: Fragen der Annotation und Verarbeitung. In: Talk, Webkorpora in Linguistik und Sprachforschung (Workshop), Mannheim.

Blessing, A.; Eckart, K.; Kuhn, J.; Stegmann, J.; Zinsmeister, H. (2012): Interaktive Textanalyse und das CLARIN-D Infrastrukturprojekt. In: Ringvorlesung Digital Humanities: Projekte und Perspektiven.

Boehlke, V.; Compart, T.; Eckart, T. (2012): Building up a CLARIN resource center – Step 1: Providing metadata. Workshop on Describing Language Resources with Metadata. In: LREC 2012, Istanbul, Turkey.

Broeder, D.; Uytvanck, D. v.; Gavrilidou, M.; Trippel, T. (2012): Standardizing a Component Metadata Infrastructure. In: N. Calzolari et al. (Eds.): International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, Turkey. Verfügbar unter http://pubman.mpdl.mpg.de/pubman/item/escidoc:1451190:4/component/escidoc:1478385/Broeder_LREC_2012.pdf.

Broeder, D.; Windhouwer, M.; van Uytvanck, D.; Trippel, T.; Goosen, T. (2012): CMDI: a Component Metadata Infrastructure. In: V. Arranz et al. (Eds.): Proceedings of the LREC 2012 Workshop Describing Language Resources with Metadata: Towards Flexibility and Interoperability in the Documentation of Language Resources., Istanbul, Türkei, 1-4. Verfügbar unter http://lrec-conf.org/proceedings/lrec2012/workshops/11.LREC2012 Metadata Proceedings.pdf.

Delz, M. (2012): WebLicht - a web based linguistic chaining tool for incremental text annotation. In: Tutorium auf der 52. Studentischen Tagung Sprachwissenschaften (StuTS 52), Berlin.

Dima, E.; Henrich, V.; Hinrichs, E.; Hinrichs, M.; Hoppermann, C.; Trippel, T.; Zastrow, T.; Zinn, C. (2012): A Repository for the Sustainable Management of Research Data. In: 8th International Conference on Language Resources and Evaluation. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2012/pdf/470_Paper.pdf.

Dima, E.; Hinrichs, E.; Hinrichs, M.; Kislev, A.; Trippel, T.; Zastrow, T. (2012): Integration of WebLicht into the CLARIN Infrastructure. In: CLARIN-D/DARIAH Workshop „Service-oriented Architectures (SOAs) for the Humanities: Solutions and Impacts“ at Digital Humanities Conference, Hamburg, Germany, 17-23. Verfügbar unter http://clarin-d.de/images/workshops/proceedingssoasforthehumanities.pdf.

Dima, E.; Hoppermann, C.; Trippel, T.; Zinn, C. (2012): A Metadata Editor to Support the Description of Linguistic Resources. In: LREC 2012, Istanbul, Turkey. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2012/pdf/468_Paper.pdf.

Dittmann, H.; Ďurčo, M.; Geyken, A.; Roth, T.; Zimmer, K. (2012): A distributed corpus of German varieties.  In: Multilingual Corpora and Multilingual Corpus Analysis, 14,  339–346.

Draxler, C.; Kisler, T. (2012): Tutorien zu SpeechRecorder, WebMAUS und percy In: 52. Studentische Tagung Sprachwissenschaft (StuTS), Berlin.

Drude, S.; Trilsbeek, P.; Broeder, D. (2012): Language Documentation and Digital Humanities: The (DoBeS) Language Archive. In: J. C. Meister (Ed.): Digital Humanities 2012, Hamburg. Verfügbar unter http://www.dh2012.uni-hamburg.de/conference/programme/abstracts/language-documentation-and-digital-humanities-the-dobes-language-archive.1.html.

Eckart, K. (2012): A standardized general framework for encoding and exchange of corpus annotations: The Linguistic Annotation Framework, LAF. In: KONVENS 2012, SFLR  Workshop, Vienna, Austria, 506-515. Verfügbar unter http://www.oegai.at/konvens2012/proceedings/76_eckart12w/76_eckart12w.pdf.

Eckart, K.; Riester, A.; Schweitzer, K. (2012): A Discourse Information Radio News Database for Linguistic Analysis. In: C. Chiarcos et al. (Eds.), Linked Data in Linguistics. Representing and Connecting Language Data and Language Metadata. 65-75. Springer, Heidelberg.

Fischer, P. M.; Witt, A. (2012): Developing Solutions for Long-Term Archiving of Spoken Language Data at the Institut für Deutsche Sprache. In: LREC 2012 Workshop “Best Practices for Speech Corpora in Linguistic Research”, European Language Resources Association (ELRA), Istanbul, Turkey, 47-50. Verfügbar unter http://www.lrec-conf.org/lrec2012/IMG/pdf/LREC2012.Workshop Abstracts.pdf.

Frick, E.; Schnober, C.; Bański, P. (2012): Evaluating Query Languages for a Corpus Processing System. In: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC-2012), Istanbul, Turkey, 2286-2294.

Geyken, A.; Gloning, T.; Stäcker, T. (2012): CLARIN-D-Panel bei der DH 2012 in Hamburg. Ein Panel zu „Large Historical Reference Corpora of German“ an der Uni Hamburg.  In: CLARIN-D-Newsletter, 2,  13/14. Verfügbar unter https://www.clarin-d.de/images/newsletter/clarin-d-newsletter_2012_02.pdf.

Geyken, A.; Haaf, S.; Jurish, B.; Schulz, M.; Thomas, C.; Wiegand, F. (2012): TEI und Textkorpora. Fehlerklassifikation und Qualitätskontrolle vor, während und nach der Texterfassung im Deutschen TextarchivJahrbuch für Computerphilologie online. Verfügbar unter http://www.computerphilologie.de/jg09/geykenetal.html.

Geyken, A.; Haaf, S.; Kupietz, M.; Lüngen, H.; Witt, A. (2012): CLARIN-D User Guide, Chapter 6.2: Text CorporaCLARIN-D AP5: CLARIN-D User Guide. Version: 1.0.1, Berlin, 2012. Verfügbar unter https://www.clarin-d.net/de/hilfe/benutzerhandbuch.

Geyken, A.; Haaf, S.; Thomas, C.; Wiegand, F.; Stäcker, T.; Gloning, T. (2012): Compiling large historical reference corpora of German: Quality Assurance, Interoperability and Collaboration in the Process of Publication of Digitized Historical Prints. In: Digital Humanities, Hamburg, Germany.

Geyken, A.; Haaf, S.; Wiegand, F. (2012): The DTA ‘base format’: A TEI-Subset for the Compilation of Interoperable Corpora. In: J. Jancsary (Ed.): 11th Conference on Natural Language Processing (KONVENS) - Empirical Methods in Natural Language Processing (LThist 2012 Workshop), Vienna, Austria, 383-391. Verfügbar unter http://www.oegai.at/konvens2012/proceedings.pdf.

Hedeland, H.; Schmidt, T. (2012): Technological and methodological challenges in creating, annotating and sharing a learner corpus of spoken German.  In: Multilingual Corpora and Multilingual Corpus Analysis, Hamburg Studies in Multilingualism, 14,  25-46. Verfügbar unter http://exmaralda.org/gscl2011/downloads/AZM96.pdf.

Hedeland, H.; Wörner, K. (2012): Experiences and Problems creating a CMDI profile from an existing Metadata Schema. In: LREC-Workshop Describing LRs with Metadata: Towards Flexibility and Interoperability in the Documentation of LR, Limerick, Ireland. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2012/workshops/11.LREC2012 Metadata Proceedings.pdf.

Herold, A.; Lemnitzer, L. (2012a): CLARIN-D User GuideCLARIN-D AP5: CLARIN-D User Guide. Version: 1.0.1. 128. BBAW, Berlin. Verfügbar unter http://media.dwds.de/clarin/userguide/userguide-1.0.1.pdf.

Herold, A.; Lemnitzer, L. (2012b): CLARIN-D User Guide: Introduction and BackgroundCLARIN-D AP5: CLARIN-D User Guide. Version: 1.0.1, Berlin, 2012. Verfügbar unter https://www.clarin-d.net/de/hilfe/benutzerhandbuch.

Herold, A.; Lemnitzer, L.; Geyken, A. (2012): Integrating lexical resources through an aligned lemma list. In: C. Chiarcos et al. (Eds.), Linked Data in Linguistics. 35-44. Springer, Berlin, Heidelberg. Verfügbar unter http://svn.aksw.org/papers/2013/Thesis_Sebastian/submission/public_version.pdf.

Heyer, G. (2012): Ansätze und Perspektiven zum Einsatz von Text Mining in den Geisteswissenschaften. In: Eingeladener Vortrag an der Goethe Universität Frankfurt/Main.

Hinrichs, E.; Beck, K. (2012): Web-Based Linguistic Annotation: Current Practice and Future Directions. In: Mapping the Landscape of eResearch, Digitization Lifecycle, MPG, Berlin, Germany. Verfügbar unter http://www.h-net.org/reviews/showpdf.php?id=35883.

Jurish, B. (2012): Finite-state Canonicalization Techniques for Historical German.

Kasper, W.; Vela, M. (2012): Monitoring and Summarization of Hotel Reviews. In: M. F. a. F. R. a. L. Cantoni (Ed.): Information and Communication Technologies in Tourism 2012.

Kermes, H. (2012a): Formulaic expressions: in this paper but where? . In: Proceedings of ICAME 33, Leuven, Belgium.

Kermes, H. (2012b): A methodology for the extraction of information about the usage of formulaic expressions in scientific texts. In: Proceedings of LREC 2012, Istanbul, Turkey.

Kermes, H.; Teich, E. (2012): Formulaic expressions in scientific texts: Corpus design, extraction and exploration.  In: Lexicographica, 28  (1),  99-120.

Kisler, T.; Schiel, F.; Sloetjes, H. (2012): Signal processing via web services: the use case WebMAUS. In: Digital Humanities 2012, Hamburg, Germany, 30-34.

Klosa, A.; Kupietz, M.; Lüngen, H. (2012): Zum Nutzen von Korpusauszeichnungen für die LexikographieLexicographica 28. 71-97. de Gruyter 2012, Berlin/Boston.

Lapshinova-Koltunksi, E.; Teich, E.; Degaetano-Ortlieb, S. (2012): Terminology now and then: changes across periods in academic writing. In: ICAME 33, Leuven, Belgium. Verfügbar unter http://wwwling.arts.kuleuven.be/icame33/.

Lapshinova-Koltunski, E.; Kunz, K.; Amoia, M. (2012): Compiling a Multilingual Spoken Corpus. In: GSCP 2012 International Conference: SPEECH and CORPORA, Universidade Federal de Minas Gerais, Brazil.

Lapshinova-Koltunski, E.; Teich, E.; Degaetano-Ortlieb, S. (2012): Tracing 'hybridity' in academic discourse: a corpus-based approach. In: ESFLCW 2012, Bertinoro, Italy.

Lüngen, H. (2012): DeReKo-Archiv jetzt mit fünf Milliarden Textwörtern. Zum größten digitalen Textarchiv für deutsche Texte der GegenwartSprachreport 1/2012. 26. Institut für Deutsche Sprache 2012, Mannheim.

Lüngen, H.; Sperberg-McQueen, M. C. (2012): A TEI P5 Document Grammar for the IDS Text Model.  In: TEI and Linguistics. Journal of the Text Encoding Initiative 3.: 2012.

Martínez Martínez, J. M.; Serrat Roozen, I. (2012): ECPC: el discurso parlamentario europeo desde la perspectiva de los estudios traductológicos de corpus.  In: Linguamática, 4  (2),  65-73. Verfügbar unter http://linguamatica.com/index.php/linguamatica/article/view/131.

Neuber, F.; Thomas, C. (2012): Vorstellung des Kurationsprojekts 1 der Clarin-D-FAG 1 „Deutsche Philologie“.  In: CLARIN-D-Newsletter, 3,  11-13. Verfügbar unter http://www.clarin-d.de/images/newsletter/CLARIN-D-Newsletter-2012-3.pdf.

Perkuhn, R.; Keibel, H.; Kupietz, M. (2012): Korpuslinguistik. Fink, 2012, Paderborn.

Reichel, U. D. (2012): PermA and Balloon: Tools for string alignment and text processing. In: Proc. Interspeech, Portland, Oregon, US. Verfügbar unter http://www.phonetik.uni-muenchen.de/forschung/publikationen//ReichelIS2012.pdf.

Schmidt, T. (2012): EXMARaLDA and the FOLK tools ― two toolsets for transcribing and annotating spoken language. In: N. Calzolari (Ed.): The Eighth International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, Turkey, 236-240.

Schmidt, T.; Wörner, K. (2012): EXMARaLDA. In: J. Durand et al. (Eds.), Handbook on Corpus Phonology. Oxford University Press

Schuurman, I.; Windhouwer, M. (2012): Tutorial: ISOcat in daily life - possible uses of a large repository of widely used linguistic concepts. . In: Workshop at the LREC 2012, Istanbul.

Stehouwer, H.; Ďurčo, M.; Auer, E.; Broeder, D. (2012): Federated Search: Towards a Common Search Infrastructure. In: N. Calzolari et al. (Eds.): Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12), Istanbul, Türkei, 3255-3259. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2012/pdf/524_Paper.pdf.

Teich, E.; Degaetano-Ortlieb, S.; Kermes, H.; Lapshinova-Koltunski, E. (2012): Register contact: an exploration of recent linguistic trends in the scientific domain. In: Historical Corpora 2012, Frankfurt, Germany.

Telljohann, H.; Hinrichs, E.; Kübler, S.; Zinsmeister, H.; Beck, K. (2012): Stylebook for the Tübingen Treebank of Written German (TüBa-D/Z), Seminar für Sprachwissenschaft Tübingen.

Thomas, C.; Wiegand, F. (2012): Making great work even better. Appraisal and Digital Curation of widely dispersed Electronic Textual Resources (c. 15th–19th cent.). In: CLARIN-D (Ed.): International Conference “Historical Corpora 2012”, Goethe University, Frankfurt, Germany. Verfügbar unter urn:nbn:de:kobv:b4-opus-23081.

Trippel, T.; Hoppermann, C.; Depoorter, G. (2012): The Component Metadata Infrastructure (CMDI) in a Project on Sustainable Linguistic ResourcesProceedings of the LREC 2012 Workshop Describing Language Resources with Metadata: Towards Flexibility and Interoperability in the Documentation of Language Resources., Istanbul, Türkei. Verfügbar unter http://lrec-conf.org/proceedings/lrec2012/workshops/11.LREC2012 Metadata Proceedings.pdf.

Van Uytvanck, D.; Stehouwer, H.; Lampen, L. (2012): Semantic metadata mapping in practice: The Virtual Language Observatory. In: N. Calzolari et al. (Eds.), Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12). ELRA, Istanbul, Türkei. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2012/pdf/437_Paper.pdf.

Vela, M. (2012): Extraction of ontology schema components from financial news. Postfach 151141, 66041 Saarbrücken. Verfügbar unter http://scidok.sulb.uni-saarland.de/volltexte/2012/4706.

Windhouwer, M.; Wright, S. E. (2012): Linked data in linguistics: Representing and connecting language data and language metadata. In: C. Chiarcos et al. (Eds.), Linking to linguistic data categories in ISOcat. 99-107. Springer, Berlin.

Wittenburg, P.; van Uytvanck, D. (2012): CLARIN-D User Guide, Chapter 2: Metadata. CLARIN-D. Verfügbar unter http://media.dwds.de/clarin/userguide/text/index.xhtml.

Wörner, K. (2012): Finding the balance between strict defaults and total openness: Collecting and managing metadata for spoken language corpora with the EXMARaLDA Corpus Manager. In: T. Schmidt & K. Wörner (Eds.), Multilingual Corpora and Multilingual Corpus Analysis, Hamburg Studies in Multilingualism 14. 383-400. John Benjamin Publishing Company.

Zastrow, T.; Dima, E. (2012): Workflow Engines in Digital Humanities. In: Joint CLARIN-D/DARIAH Workshop „Service-oriented Architectures (SOAs) for the Humanities: Solutions and Impacts" - Digital Humanities Conference, Hamburg, Germany, 56-61. Verfügbar unter http://clarin-d.de/images/workshops/proceedingssoasforthehumanities.pdf.

Zinn, C.; Hoppermann, C.; Trippel, T. (2012): The ISOcat Registry Reloaded: a Re-engineering Proposal Following schema.org. In: E. Simperl et al. (Eds.): 9th Extended Semantic Web Conference (ESWC 2012) 285-29910.1007/978-3-642-30284-8_26.

Zinsmeister, H.; Breckle, M. (2012): The ALeSKo learner corpus: design – annotation – quantitative analyses. In: T. Schmidt & K. Wörner (Eds.), Multilingual Corpora and Multilingual Corpus Analysis. Hamburg Studies in Multilingualism. 71–96. John Benjamins Publishing Company.

2011

Amoia, M. (2011): I-FLEG: A 3D-Game for Learning French. In: The 9th International Conference on Education and Information Systems, Technologies and Applications (EISTA 2011).

Amoia, M.; Gardent, C.; Perez-Beltrachini, L. (2011): Learning a Second Language with a Videogame. In: The International Conference on Information and Communication Technologies (ITS 2011) for Language Learning.

Barkey, R.; Hinrichs, E.; Hoppermann, C.; Trippel, T.; Zinn, C. (2011a): Komponenten-basierte Metadatenschemata und Facetten-basierte Suche: Ein flexibler und universeller Ansatz. In: J. Griesbaum et al. (Eds.): 12. Internationales Symposium für Informationswissenschaft (ISI 2011) (58), Hildesheim, S. 62-73.

Barkey, R.; Hinrichs, E.; Hoppermann, C.; Trippel, T.; Zinn, C. (2011b): Trailblazing through Forests of Resources in Linguistics. In: Digital Humanties (DH), Stanford.

Belica, C.; Kupietz, M.; Witt, A.; Lüngen, H. (2011): The Morphosyntactic Annotation of DeReKo: Interpretation, Opportunities, and Pitfalls. In: M. Konopka et al. (Eds.): Grammatik und Korpora. Dritte Internationale Konferenz, Mannheim, Germany, 451-469.

Broeder, D.; Schonefeld, O.; Trippel, T.; Van Uytvanck, D.; Witt, A. (2011): A pragmatic approach to XML interoperability – the Component Metadata Infrastructure (CMDI). In: Presented at Balisage: The Markup Conference 2011, Montréal, Canada, 10.4242/BalisageVol7.Broeder01.

Degaetano, S.; Teich, E. (2011): Exploring a semi-automatic approach for the analysis of interpersonal meaning in large corpora. In: International Systemic Functional Congress (ISFC 38), Lisbon, Portugal.

Geyken, A. (2011a): Die dynamische Verknüpfung von Kollokationen mit Korpusbelegen und deren Repräsentation im DWDS-Wörterbuch.  In: Datenmodellierung für Internetwörterbücher 2/2011,  9-22.

Geyken, A. (2011b): Statistische Wortprofile zur schnellen Analyse der Syntagmatik in Textkorpora. In: A. Abel & R. Zanin (Eds.), Korpora in Lehre und Forschung. 115-137, Bozen/Bolzano.

Geyken, A. (2011c): Wege zu einem historischen Referenzkorpus des Deutschen: das Projekt Deutsches Textarchiv. In: I. Hafemann (Ed.): Perspektiven einer corpusbasierten historischen Linguistik. Internationale Tagung des Akademienvorhabens „Altägyptisches Wörterbuch“, Berlin, Germany, 221-234.

Geyken, A.; Haaf, S.; Jurish, B.; Schulz, M.; Steinmann, J.; Thomas, C.; Wiegand, F. (2011): Das Deutsche Textarchiv: Vom historischen Korpus zum aktiven Archiv. In: S. Schomburg et al. (Eds.), Digitale Wissenschaft. Stand und Entwicklung digital vernetzter Forschung in Deutschland. 157-161, Köln. Verfügbar unter http://www.hbz-nrw.de/dokumentencenter/veroeffentlichungen/Tagung_Digitale_Wissenschaft.pdf - page=159.

Hedeland, H.; Lehmberg, T.; Schmidt, T.; Wörner, K. (2011): Multilingual Corpora at the Hamburg Centre for Language Corpora. In: T. Schmidt & K. Wörner (Eds.): GSCL Conference, Hamburg, Germany.

Heyer, G.; Büchler, M.; Boehlke, V. (2011): Aspects of an Infrastructure for eHumanities. In: Supporting Digital Humanities, Copenhagen, Denmark.

Hoppermann, C.; Trippel, T.; Zinn, C. (2011): Devil’s Advocate on Metadata in Science. In: H. Hedeland et al. (Eds.): Conference of the German Society for Computational Linguistics and Language Technology (GSCL), Hamburg, 105-109.

Jurish, B. (2011a): Comparing canonicalizations of historical German text. In: 11th Meeting of the ACL Special Interest Group on Computational Morphology and Phonology (SIGMORPHON) 72-77. Verfügbar unter http://www.aclweb.org/anthology/W10-2209.

Jurish, B. (2011b): More than Words. Using Token Context to Improve Canonicalization of Historical German.  In: Journal for language technology and computational linguistics (JLCL). Verfügbar unter http://www.jlcl.org/2010_Heft1/bryan_jurish.pdf

Kasper, W.; Vela, M. (2011): Sentiment Analysis for Hotel Reviews. In: Computational Linguistics-Applications Conference. Verfügbar unter http://www.proceedings2011.cla-conf.org/pliks/25.pdf.

Kermes, H. (2011): Usage and function of formulaic expressions in scientific texts. In: Corpus Linguistics 2011, Birmingham, UK, 86-87.

Maria, G.; Labropoulou, P.; Piperidis, S.; Monachini, M.; Frontini, F.; Francopoulo, G.; Arranz, V.; Mapelli, V. (2011): A metadata schema for the description of language resources (LRs). In: 5th International Joint Conference on Natural Language Processing (IJCNLP 2011).

Schmidt, T.; Wörner, K.; Hedeland, H.; Lehmberg, T. (2011): New and future developments in EXMARaLDA. In: T. Schmidt & K. Wörner (Eds.): GSCL Conference, Hamburg, Germany.

Schuurman, I.; Windhouwer, M. (2011): Explicit semantics for enriched documents. What do ISOcat, RELcat and SCHEMAcat have to offer? In: 2nd Supporting Digital Humanities conference (SDH 2011), Copenhagen.

Teich, E.; Degaetano, S. (2011): SciTex - A Diachronic Corpus for Analyzing the Development of Scientific Registers. In: Conference on New Methods in Historical Corpora, University of Manchaster, UK.

Teich, E.; Lapshinova, E.; Degaetano, S. (2011): Interdisciplinarity in acadamic discourse - A corpus-based analysis. In: Interdisciplinary Linguistics, Belfast, UK.

Teich, E.; Lapshinova, E.; Kermes, H.; Degaetano, S. (2011): Linguistic evolution of emerging scientific registers. In: Clavier 2011, Modena, Italy.

Vela, M.; Declerck, T. (2011): A Multi-Layer Approach on the Extraction of Schema Components of Ontologies from Text. In: SEMAPRO 2011.

Zinn, C. (2011): Building a Faceted Browser in CouchDB Using Views on Views and Erlang Metaprogramming. In: Functional and Constraint Logic Programming: 20th International Workshop, WFLP 2011, Odense, Denmark, 104-122.

Ältere

Belica, C.; Keibel, H.; Kupietz, M.; Perkuhn, R.; Vachková, M. (2010): Putting corpora into perspective. Rethinking synchronicity in corpus linguistics. In: M. G.-D. Mahlberg, Victorina/Smith, Catherine (Ed.), Proceedings of the Corpus Linguistics Conference 2009. 22. Liverpool: University of Liverpool 2010, Liverpool, UK.

Geyken, A.; Haaf, S.; Jurish, B.; Schulz, M.; Steinmann, J.; Thomas, C.; Wiegand, F. (2010): Das Deutsche Textarchiv: Vom historischen Korpus zum aktiven Archiv. In: S. Schomburg et al. (Eds.): Digitale Wissenschaft. Stand und Entwicklung digital vernetzter Forschung in Deutschland, Köln, 157-161. Verfügbar unter http://fiz1.fh-potsdam.de/volltext/DigiWis/13472.pdf.

Henrich, V.; Hinrichs, E. (2010a): GernEdiT - The GermaNet Editing ToolProceedings of the Seventh Conference on International Language Resources and Evaluation (LREC 2010). 2228-2235, Valletta, Malta.

Henrich, V.; Hinrichs, E. (2010b): Standardizing Wordnets in the ISO Standard LMF: Wordnet-LMF for GermaNet. In: 23rd International Conference on Computational Linguistics (COLING 2010), Beijing, 456 – 464.

Hinrichs, E.; Henrich, V.; Zastrow, T. (2010): Sustainability of Linguistic Data and Analysis in the Context of a Collaborative eScience Environment. In: Seventh Conference on International Language Resources and Evaluation (LREC 2010), Valetta, Malta.

Hinrichs, E.; Hinrichs, M.; Zastrow, T. (2010): WebLicht: Web-Based LRT Services for German. In: Proceedings of the Systems Demonstrations at the 48th Annual Meeting of the Association for Computational Linguistics (ACL-2010), Upsala, Schweden, 25-29. Verfügbar unter http://aclweb.org/anthology/P10-4005.

Hinrichs, M.; Zastrow, T.; Hinrichs, E. (2010): WebLicht: Web-based LRT Services in a Distributed eScience Infrastructure. In: N. Calzolari et al. (Eds.): Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10), Valletta, Malta, 489-493. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2010/pdf/270_Paper.pdf.

Jurish, B. (2010): Efficient online k-best lookup in weighted finite-state cascades. In: T. Hanneforth & G. Fanselow (Eds.), Language and Logos. Studies in Theoretical and Computational Linguistics. 313-327, Berlin, 2010.

Klein, W.; Geyken, A. (2010): Das Digitale Wörterbuch der Deutschen Sprache (DWDS). In: U. Heid et al. (Eds.), Lexicographica. 79-93, Berlin/New York.

Kunze, C.; Lemnitzer, L. (2010): Lexical-semantic and conceptual relations in GermaNet. In: P. Storjohann (Ed.), Lexical-semantic relations. Theoretical and practical perspectives. 163-184. Benjamins, Amsterdam.

Kupietz, M.; Belica, C.; Keibel, H.; Witt, A. (2010): The German Reference Corpus DeReKo: A primordial sample for linguistic research. In: N. Calzolari et al. (Eds.), Proceedings of the Seventh Conference on International Language Resources and Evaluation (LREC 2010). 1848-1854. European Language Resources Association (ELRA) 2010.

Kupietz, M.; Keibel, H. (2009): The Mannheim German Reference Corpus (DeReKo) as a basis for empirical linguistic research.  In: Workings Papers in Corpus-based Linguistics and Language Education, 3,  53-59.

Kupietz, M.; Schonefeld, O.; Witt, A. (2010): The German Reference Corpus: New developments building on almost 50 years of experience. In: E. Arranz (Ed.): Language Resources: From Storyboard to Sustainability and LR Lifecycle Management, Workshop held at the Seventh Conference on International Language Resources and Evaluation (LREC 2010), Malta, 39-43.

Rehm, G.; Schonefeld, O.; Trippel, T.; Witt, A. (2010): Sustainability of Linguistic Resources Revisited. In: International Symposium on XML for the Long Haul: Issues in the Long-term Preservation of XML, Balisage, 10.4242/BalisageVol6.Witt01.

Schulz, A.; Teich, E. (2009): The secret life of the negative: An investigation of polarity and modality in a corpus of newsgroup texts.  In: English Corpus Linguistics: Looking back, Moving forward, 74,  191-204.

Sloetjes, H.; Wittenburg, P. (2008): Annotation by Category: ELAN and ISO DCR. In: N. Calzolari et al. (Eds.): Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08), Marrakesch, Marokko. Verfügbar unter http://www.lrec-conf.org/proceedings/lrec2008/pdf/208_paper.pdf.

Váradi, T.; Wittenburg, P.; Krauwer, S.; Wynne, M.; Koskenniemi, K. (2008): Clarin: Common language resources and technology infrastructure. In: 6th International Conference on Language Resources and Evaluation (LREC).