Consonant Vowel Tokens

An extensive database of 1,728 isolated CVs is available through this website. 8 tokens for each of 18 consonants in 3 vowel contexts were digitally recorded from 4 talkers (2 males and 2 females).

Directories and filenames describe the CV and the talker. The three vowel contexts are labeled in ascii as:

The 18 consonant contexts are labeled in ascii as: The CV directory includes 46 subdirectories containing each CV combination from the 18 consonants and 3 vowels. The name of the subdirectory is the concatonation of the CV (ba, da, SHee, etc.). Within each subdirectory there are 8 exemplars from each of the 4 talkers. These files are named CV.n.SU, where the CV matches the name of the subdirectory, n is the token number, and SU is the identification for each subject. Subjects "peg" and "jb" are female, and "bh" and "bps" are male.

The data files contain raw big-endian short integer data, with no header. The sampling rate is 16 kHz.

To download tar files containing 32 exemplars for each one of the 46 CVs, choose the CV from the following gzipped tar files:

CV tar files

To download a single exemplar, select the CV you would like from the list below, and then shift-click on the desired token to save it to your computer.
a ee oo
b ba tokens bee tokens boo tokens
d da tokens dee tokens doo tokens
g ga tokens gee tokens goo tokens
p pa tokens pee tokens poo tokens
t ta tokens tee tokens too tokens
k ka tokens kee tokens koo tokens
m ma tokens mee tokens moo tokens
n na tokens nee tokens noo tokens
s sa tokens see tokens soo tokens
Z Za tokens Zee tokens Zoo tokens
sh sha tokens shee tokens shoo tokens
SH SHa tokens SHee tokens SHoo tokens
f fa tokens fee tokens foo tokens
V Va tokens Vee tokens Voo tokens
th tha tokens thee tokens thoo tokens
TH THa tokens THee tokens THoo tokens
j ja tokens jee tokens joo tokens
ch cha tokens chee tokens choo tokens

Back to the UCLA SPAPL home page