CA2216224A1

CA2216224A1 - Block algorithm for pattern recognition

Info

Publication number: CA2216224A1
Application number: CA002216224A
Authority: CA
Inventors: Peter R. Stubley; Andre Gillet; Vishwa N. Gupta; Christopher K. Toulson; David B. Peters
Original assignee: Northern Telecom Ltd
Current assignee: Nortel Networks Ltd
Priority date: 1997-09-19
Filing date: 1997-09-19
Publication date: 1999-03-19
Also published as: US6092045A; EP0903728A2; EP0903728A3

Abstract

Comparing a series of observations representing unknown speech, to stored models representing known speech, the series of observations being divided into at least two blocks each comprising two or more of the observations, is carried out in an order which makes better use of memory.
First, the observations in one of the blocks are compared (31), to a subset comprising one or more of the models, to determine a likelihood of a match to each of the one or more models. This step is repeated (33) for models other than those in the subset; and the whole process is repeated (34) for each block.

Claims

1. A method of comparing a series of observations representing unknown speech, to stored models representing known speech, the series of observations being divided into at least two blocks each comprising two or more of the observations, the method comprising the steps of:
a) comparing two or more of the observations in one of the blocks of observations, to a subset comprising one or more of the models, to determine a likelihood of a match to each of the one or more models;
b) repeating step a) for models other than those in the subset; and c) repeating steps a) and b) for a different one of the blocks.

2. The method of claim 1 wherein the observations are represented as multidimensional vectors, for the comparison at step a).

3. The method of claim 1 wherein the comparison at step a) uses a Viterbi algorithm.

4. The method of claim 1 wherein the models are represented as finite state machines with probability distribution functions attached.

5. The method of claim 1 wherein the models comprise groups of representations of phonemes.

6. The method of claim 1 wherein the models comprise representations of elements of speech, and step a) comprises the step of:
comparing the block of observations to a predetermined sequence of the models in the subset.

7. The method of claim 1 wherein step a) comprises the steps of:
comparing the block of observations to a predetermined sequence of the models in the subset;

determining for each of the models in the sequence, a score which represents the likelihood of a match with the observations compared so far;
storing the score in a score buffer for use in determining scores of subsequent models in the sequence; and determining when the score is no longer needed, then re-using the score buffer to store a subsequent score.

8. The method of claim 1 wherein, step a) comprises the step of:
comparing the block of observations to a lexical graph comprising a predetermined sequence of the models in the subset, wherein the sequence comprises different types of models, and the comparison is dependent on the type; and the method comprises the step of:
determining the types of the models before the block is compared.

9. The method of claim 1, the models comprising finite state machines, having multiple state sequences, wherein step a) comprises the steps of:
determining state scores for the matches between each respective observation and state sequences of the respective model, making an approximation of the state scores, for the observation, for storing to use in matching subsequent observations, the approximation comprising fewer state scores than were determined for the respective observation.

10. A method of recognising patterns in a series of observations, by comparing the observations to stored models, using a processing means having a main memory for storing the models and a cache memory, the cache memory being too small to contain all the models and observations, the series of observations being divided into blocks of at least two observations, the method comprising the steps of:

a) using the processor to compare a subset of the models to the observations in one of the blocks of observations, to recognise the patterns, the subset of the models being small enough to fit in the cache memory;
b) repeating step a) for a different subset of the models and;
c) repeating steps a) and b) for a different one of the blocks.

11. A method of recognising patterns in a series of observations by comparing the observations to stored models, the series of observations being divided into at least two blocks each comprising two or more of the observations, the models comprising finite state machines, having multiple state sequences, the method comprising the steps of:
a) comparing two or more of the observations in one of the blocks of observations, to a subset comprising one or more of the models, to determine a likelihood of a match to each of the one or more models, by determining which of the state sequences of the respective model is the closest match, and how close is the match;
b) repeating step a) for models other than those in the subset; and c) repeating steps a) and b) for a different one of the blocks.

12. The method of claim 11 wherein the observations are speech signals, and the models are representations of elements of speech.

13 The method of claim 11 wherein the comparison at step a) uses the Viterbi algorithm.

14. The method of claim 11 wherein the models are represented as finite state machines with probability distribution functions attached.

15. A method of comparing a series of observations representing unknown speech, to stored models representing known speech, by comparing the observations to stored models, the series of observations being grouped into one or more blocks each comprising two or more of the observations, the models comprising finite state machines, having multiple state sequences, the method comprising, for each of the one or more blocks, the steps of:
a) comparing two or more of the observations in the respective block, to a subset comprising one or more of the models, to determine a likelihood of a match to each of the one or more models, by determining which of the state sequences of the respective model is the closest match, and how close is the match; and b) repeating step a) for models other than those in the subset.

16. Software stored on a computer readable medium for comparing a series of observations representing unknown speech, to stored models representing known speech, the series of observations being divided into at least two blocks each comprising two or more of the observations, the software being arranged for carrying out the steps of:
a) comparing two or more of the observations in one of the blocks of observations, to a subset comprising one or more of the models, to determine a likelihood of a match to each of the one or more models;
b) repeating step a) for models other than those in the subset; and c) repeating steps a) and b) for a different one of the blocks.

17. Software stored on a computer readable medium for recognising patterns in a series of observations by comparing the observations to stored models, the series of observations being divided into at least two blocks each comprising two or more of the observations, the models comprising finite state machines, having multiple state sequences, the software being arranged to carry out the steps of:
a) comparing two or more of the observations in one of the blocks of observations, to a subset comprising one or more of the models, to determine a likelihood of a match to each of the one or more models, by determining which of the state sequences of the respective model is the closest match, and how close is the match;
b) repeating step a) for models other than those in the subset; and c) repeating steps a) and b) for a different one of the blocks.

18. Software stored on a computer readable medium for comparing a series of observations representing unknown speech, to stored models representing known speech, by comparing the observations to stored models, the series of observations being grouped into one or more blocks each comprising two or more of the observations, the models comprising finite state machines, having multiple state sequences, the software being arranged to carry out for each of the one or more blocks, the steps of:
a) comparing two or more of the observations in the respective block, to a subset comprising one or more of the models, to determine a likelihood of a match to each of the one or more models, by determining which of the state sequences of the respective model is the closest match, and how close is the match; and b) repeating step a) for models other than those in the subset.

19. A speech recognition processor for comparing a series of observations representing unknown speech, to stored models representing known speech, the series of observations being divided into at least two blocks each comprising two or more of the observations, the processor being arranged to carry out the steps of:
a) comparing two or more of the observations in one of the blocks of observations, to a subset comprising one or more of the models, to determine a likelihood of a match to each of the one or more models;
b) repeating step a) for models other than those in the subset; and c) repeating steps a) and b) for a different one of the blocks.

20. A speech recognition processor for recognising patterns in a series of observations by comparing the observations to stored models, the series of observations being divided into at least two blocks each comprising two or more of the observations, the models comprising finite state machines, having multiple state sequences, the processor being arranged to carry out the steps of:
a) comparing two or more of the observations in one of the blocks of observations, to a subset comprising one or more of the models, to determine a likelihood of a match to each of the one or more models, by determining which of the state sequences of the respective model is the closest match, and how close is the match;
b) repeating step a) for models other than those in the subset; and c) repeating steps a) and b) for a different one of the blocks.

21. A speech recognition processor for comparing a series of observations representing unknown speech, to stored models representing known speech, by comparing the observations to stored models, the series of observations being grouped into one or more blocks each comprising two or more of the observations, the models comprising finite state machines, having multiple state sequences, the processor being arranged to carry out, for each of the one or more blocks, the steps of:
a) comparing two or more of the observations in the respective block, to a subset comprising one or more of the models, to determine a likelihood of a match to each of the one or more models, by determining which of the state sequences of the respective model is the closest match, and how close is the match; and b) repeating step a) for models other than those in the subset.