my papers
now total is 4
NO.1
A New Markov Model for Speech Recognition
Zhu Qifeng , 14th Lab of the Institute of Acoustic, 11.11,1995 [
Abstract ] In this paper we give out and emphasize the concept of the nonlinear
property of the information distribution of speech signal in the time domain.
Some exist solutions are reviewed. With two pragmatic principles we give
out a new method to modify the traditional HMM to fit this property. This
method is tested in non- continuous speech recognition. The result is also
reported in this paper and discussed. Key Word: HMM, speech recognition,
nonlinear Topic area: (B) Speech Recognition Mail address: 14 Lab , mailbox
2712, Beijing, 100080 Email: zqf@speech1.ioa.ac.cn Telephone: 2553842
NO.2.
AMM algorism for speech recognition
Zhu Qifeng, 14th Lab of the Institute of Acoustic
Abstract
In this paper I first discuss the basis of the HMM
model. In HMM model the training and the matching algorithm
are with the goal to maximize the probability. From the view of
pattern recognition and robust training, the goal should be
maximize the distance among samples in different group and
to minimize the distance among samples in a same group. In
this paper we will find the basis of HMM is not the best to fit
the goal of pattern recognition, and maybe is not the most
robust one in some occurrences.
In the second half of this paper, I will discuss a
straitforward way to determine the states in the HMM model,
using the concept of "sound stimuli" initiated by Prof. Yu.
The training process is greatly simplified and no recursive process
is needed. I discuss the potential of this method to fit the goal
of pattern recognition.
Key word: HMM, speech recognition
NO.3
The Gain Factor In LPC and CEP coefficients In the Speech Recognition
Zhu Qifeng , 14th Lab of the Institute of Acoustic, 4,15,1996 [
Abstract ] In the LPC analysis in speech recognition, the gain factor,
G, and the C0 in CEP coresponds with the simulation of vocal cords, it
is firmly related with the voice amplitude. In this paper, we will illustrate
the meaning of the gain factor by means of many figures, including the
figures of LPC spectrum, CEP spectrum, frequency spectrum derived from
LPC, each of which we give a figure with and without the gain factor. And
we also give out the origin frequency spectrum of the voice. After this,
we also calculate the standard deviation of the CEP coefficients of different
orders. In this way, the weighting value can be obtained. With the direct
means and theoretical means above mentioned, we suggest that the gain factor
can be omitted.
NO.4
The coordinated hearing phenomena and its influnce in speech recognition
Zhu Qifeng , 14th Lab of the Institute of Acoustic, 5,15,1996 [
Abstract ] In this paper, we would discuss the phenomena of coodinated
hearing, i.e. the sense of a voice at one time is not only depended on
the frequency spectrum at the current time, but also depended on the frequency
spectrum of past of the future sense. After the introduction, we would
give out some examples, by which the coordinated phenomena we have studied
can be clearly seen. Then with the method of blackbox, we would give preliminary
modelized explanation for the phenomena. At the end, we would discuss the
effect of these phenomena on the speech recognition, and give out some
discuss on the possible improvement on the algorithm on speech recognition
and the HMM model. Key Word speech recognition, hearing£¬ coordinated
hearing
ok?
back
home
For
the whole paper, please contact me with zqf@ucla.edu
This page was created using WEB
Wizard Version 1.2
Copyright © 1995 ARTA Software Group and David P. Geller