databases

advertisement
Chapter 2. Some tools for building and querying annotated speech
databases
2.1 Getting started with existing speech databases
Install databases
This tutorial shows how to make the database first accessable to the Emu Database Tool. It shows
how to download and install a database by the use of the Database Installer.
Open an utterance of the database in Emu
This tutorial shows how an utterance of a database is opened using the labelling tool. In particular, we
show how to load a database to see its utterances and how to open the utterances in the Labeller.
There are also some explanations about the type of annotation tiers and associated information.
Finally, useful actions in the labelling tool are presented like zooming in/ out/ all /into marks, resizing
of signal panes and adjusting the signal parameters.
2.2 Interface between Praat and Emu
Open and annotate an utterance of an Emu database in Praat
This tutorial shows how an utterance of an Emu speech database can be send to Praat. The utterance
gam002 of the first database is opened and annotated with Praat and the annotations are saved in
the Emu file format from within Praat. Finally, the same utterance is opened in the Emu labelling tool.
2.3 Interface to R
2.3.2 Reading Emu segment lists into R
Query an Emu database in R
This tutorial shows how the Emu-R library is loaded in R and the commands needed to query an Emu
speech database to create a segment list of the query results. We also show how durations of segments
are calculated from the segment list.
2.4 Creating a new speech database: from Praat to Emu to R
Praat annotation
This tutorial shows how a wav file is annotated for words on a Word tier using Praat thereby creating a
TextGrid.
Convert TextGrid file into Emu format
This tutorial shows how to convert a Praat TextGrid into annotation file(s) that Emu can read using
Convert Labels.
2.5 A first look at the template file
A first look at the template file
This tutorial intoduces the Graphical Template Editor. It explains the steps needed to edit
p2epreparedtpl.tpl so that it can access a complete database. Following a short description of each pane
of the editor, changes to the template file's Track and Variables panes are carried out and
described.
Analyse utterances of Praat derived database in R
This tutorial shows how words on a Word tier that were annotated in Praat are queried and further
analysed in R.
2.7 Questions
The following tutorials show all the steps for creating an Emu speech database so that it is
readable in Praat.
(a) – (g) Create new template file
This tutorial shows the creation of a new template file.
(h) – (i) Enter the location of the template into Emu
This tutorial shows the configuration of the Emu system to find the new database.
(j) Segmenting and labelling in Emu similar to Praat
This tutorial is an introduction to a way of segmenting and labelling in Emu which is quite similar to
Praat.
(k) – (l) Convert Emu annotations into a Praat TextGrid
This tutorial shows the conversion of Emu annotation files into Praat TextGrid files that can then be
opened in Praat together with its signal
Chapter 3 Applying routines for speech signal processing
3.1 Calculating, displaying, and correcting formants
Install database second
This tutorial shows how the database second is installed and finally an utterance of the database is
opened in the labelling tool.
Calculate formant frequencies
This tutorial shows how formant frequencies are calculated for one of two speakers of the database
second using tkassp.
Add formant track to a database
This tutorial shows how formant data are defined in a template file. More specifically, a track fm is
added to the template file and the signal display is extended by the fm track. Finally, an
utterance is opened that shows the displayed formant data.
Manual correction of formant tracking error
This tutorial shows how a formant tracking error is manually corrected by redrawing the data point.
3.4 Questions
1.(a) Calculate formant frequencies for a female speaker
This tutorial shows how an utterance list is restricted to one speaker and how formants are calculated
for a female speaker using the forest analyses of tkassp.
1.(b) - (d) Find and correct outlier data
This tutorial shows how the utterance containing outliers can be found using R and how to correct the
data manually.
2.(a) – (d) Intensity contour in database aetobi
This tutorial demonstrates: the installation of the aetobi database; the calculation of intensity
contours for all utterances of the database aetobi using rmsana of tkassp; the necessary
modifications to the database template to make the intensity data available for the database. Finally, it
shows an aetobi utterance opened in the labelling tool with spectrogram and intensity contours and
how to change the display to show only the Word tier and intensity contour.
2.(e) – (f) Change parameter of intensity contour
This tutorial shows how intensity contours are calculated for the database aetobi with different
window sizes and window shifts. It illustrates how two different intensity contours are defined in the
database template and displayed in the labelling tool.
3. Calculate zero-crossing-rate
This tutorial shows how zero-crossing-rate data are calculated for the database using zcrana in
tkassp. The required modifications to the template file to display the zero-crossing-rate data and the
Word tier in the labelling tool are also demonstrated.
4. Calculate fundamental and formant frequencies using f0ana and forest in tkassp
This tutorial shows how fundamental and formant frequencies are calculated for the database using
using f0ana and forest in tkassp in a single step..
5. Obtain trackdata for a segment list
This tutorial shows how a segmnent list is created in R and written out to a plain text file. This text file
is loaded into tkassp, fundamental frequencies are calculated for the segment list and stored in
another plain txt file. The video also shows how the fundamental frequency data from the text file is
loaded into R and stored in R as an object .
Chapter 4 Querying annotation structures
4.4 Entering structured annotations with Emu
Install database gt
This tutorial shows how database gt is installed using the Database Installer.
Entering structured annotations
This tutorial shows how you get to the Hierarchy View and how to add labels and links between
labels there.
Display segment numbers
This tutorial shows how the segment numbers are displayed in the Signal View, the Hierarchy
View and the EMU2 View(b)
4.6 Graphical user interface to the Emu query language
Graphical User Interface to the query language
This tutorial shows how the database gt is searched for a complex query using the graphical user
interface to the Emu query language from within the Emu Query Tool. It is shown how each
simple query is entered in the interface so that the complex query can be calculated from the system.
Subsequently, the database is searched and the result is shown. A segment is opened with the Emu
labelling tool and all parts of the query are explained to show that the result matches the query.
4.8 Building annotation structures semi-automatically with Emu-Tcl
Build annotation structures semi-automatically
This tutorial shows how to write an Emu-Tcl script that links automatically labels on different tiers in a
parent-child relationship depending on their time stamps. For this purpose, a script is written and
explained, the child-parent relationship and the path to the Emu-Tcl script are added to the template.
Finally an utterance is opened and the script is run with the Build Hierarchy button.
Build annotation over an entire database
This tutorial shows how an Emu-Tcl script is run over the entire database using the Emu AutoBuild
Tool. The wizard steps that are provided by the tool are presented. Finally utterances are opened that
shows the result of the procedure.
Convert flat annotations into hierarchical form
This tutorial shows how to convert the flat annotations of the aetobi database into a hierarchical
form using the Emu AutoBuild Tool with the prepared tobi2hier.txt Emu-tcl script.
Finally an utterance is opened to see what the script does.
Download