SPEECH RECOGNITION PROJECT

 

Logical Designs developed a phoneme based speech recognition system and customized preprocessing.  Running in real time, the digitized audio signal was preprocessed FFT and customized adaptive gain control technology into inputs to a set of neural network phoneme detectors.  Output from the phoneme detectors was then used with a phoneme dictionary to produce the text output.

What makes this methodology interesting is the ability to avoid the most common problem facing speech recognition.  This problem is the segmentation vs. recognition problem.  Most recognizers must segment the audio stream into words or phonemes prior to recognition.  Then the segmented data is further processed and passed to word or phrase recognizers.  Logical Designs phoneme detectors run continuously and the output is used in combination with the dictionaries to form the output text.  No segmentation eliminates the need for unnatural gaps between words in dictation and ultimately, higher recognition rates.

Development of the Logical Designs Speech techniques was performed using the TIMIT speech database for training set development.

Logical Designs has developed neural algorithms not available from any other source. We can supply a custom software and hardware solution to your problem, or work with the tools you have in house. Logical Designs has the ability and experience to make your project and application a success.