CLARIN-D Blog

WebLicht Tutorial

https://youtu.be/3RgRCEa6Smo

This video tutorial shows one of multiple ways how you can use WebLicht. WebLicht is a web application provided by CLARIN-D that allows you to build toolchains for linguistic annotations on different layers like Morphology, Syntax or Named Entity Recognition.

To get started you have to log in via your CLARIN- or any other university affiliation account. After clicking the Start-Button and selecting Easy Mode which supplies you with a pre-defined toolchain, you can either analyze text that you directly type in or copy-paste to the corresponding window, use a sample text provided by WebLicht or upload a text file. Now you can select your preferred layer of annotation and hit run to get a detailed analysis for the selection you have made. It is then possible to download the complete file or parts of it as .csv or .xml.

In case the pre-defined toolchain does not satisfy your needs, you can also switch to the Advanced Mode where you can build your own customized toolchain. You can always refer to the Helpdesk if you have any questions, suggestions, or problems that you want to report. 

Read more

How to use WebMAUS

https://youtu.be/G-TVDx5KQBs

This video tutorial about WebMAUS - the Munich AUtomatic Segmentation explains how you can easily generate a textgrid file that aligns an audio signal to a transcription out of the application. If you want to learn more about WebMAUS in general click here. The procedure to receive the textgrid is quite simple. You just need your text file containing the transcription and your corresponding audio file with spoken language and feed it into the application via drag-and-drop (careful! the files need to have the same name.

After this step, a menu drops down where you can select your preferences and hit the 'run' button. After a few seconds, WebMAUS has created a textgrid for you which you can download and open in PRAAT along with your audio file and check where WebMAUS has segmented your file and further process it. 

Read more

WebMAUS Introduction

https://youtu.be/7lI-gOShtFA

This video tutorial gives a brief introduction to the Munich AUtomatic Segmentation -- or WebMAUS. It is a tool to align speech signals to linguistic categories which makes it, amongst other things, possible to align the audio signal of a video to its transcript. As input, WebMAUS needs a video signal and some kind of a transcription of the spoken text. 

To get the actual output, the input text first needs to be normalized. With the Balloon tool, the expected pronunciation is created in SAMPA (a phonetic alphabet). In a next step, all other possible variants of pronunciation are made along with their probability. All those other possible pronunciations are visualized in a probabilistic graph where finally WebMAUS searches for the path of phonetic units that have truly been spoken. The outcome is a transcript of the real pronunciation along with its segmentation. 

There is an open source download and a web application. The usage is free for all academic members of Europe.

Read more