ISD-T360SB Datasheet, PDF(50/109 Page) List of Unclassifed Manufacturers – VoiceDSP Digital Speech Processor with Master/Slave, Full-Duplex Speakerphone, Multiple Flash and ARAM/DRAM Support

English

English German Russian Spanish Italian Polish Chinese Japanese Korean French Portuguese	Language :

ISD-T360SB Datasheet, PDF (50/109 Pages) List of Unclassifed Manufacturers – VoiceDSP Digital Speech Processor with Master/Slave, Full-Duplex Speakerphone, Multiple Flash and ARAM/DRAM Support

◁

ISD-T360SB

2âSOFTWARE

2.3.1 VCD (VOICE COMPRESSION AND

DECOMPRESSION)

The VoiceDSP processor implements a state of

the art VCD algorithm of the CELP family. The al-

gorithm provides 3 compression rates that can

be selected dynamically (actually, the algorithm

supports more compression rates). PCM record-

ing (no compression) is also provided.

The lowest compression rate of 5.3 Kbit/s enables

about 30 minutes of recording on an 8-Mbit de-

vice (depending on the relative silence period).

The mid-quality compression rate of 9.9 Kbit/s

provides about 16 minutes of voice recording

time. The highest compression rate of 16.8 Kbit/s,

the highest quality recording, stores up to 10 min-

utes on a 8-Mbit device. For detailed information

about recording times refer to table 1-5.

Before recording each message, the microcon-

troller selects one of the three compression rates,

or PCM recording, with the compression_rate

parameter of the R (Record) command. During

message playback the VoiceDSP processor

reads this one byte parameter and selects the

appropriate speech decompression algorithm.

IVS vocabularies can be prepared in either of

the three compression rates, or in PCM format,

using the IVS tool. All messages in a single vocab-

ulary must be recorded using the same algo-

rithm. (See the IVS Userâs Guide for more details).

During speech synthesis, the VoiceDSP processor

automatically selects the appropriate speech

decompression algorithm.

SILENCE COMPRESSION

A Voice Activity Detector (VAD) is used in order

to detect periods of silence during the compres-

sion of the recorded message. Silence is treated

differently than normal voice by the compres-

sion algorithm. It is compressed to about 1.0 Kbit/

s. The compressed silence contains data that al-

lows to generate comfort noise during message

playback. The comfort noise generation is impor-

tant because the human ear is not used to ârealâ

silence while listening to messages.

Various tunable parameters are available in or-

der to optimally tune the VAD. The silence com-

pression may be turned Off, though it is planned

to remain On continuously. For more details refer

to table 2-4 of the Command Description sec-

tion.

NOTE

The silence compression should be turned

Off when ARAM devices are used for voice

storage. Otherwise, unpredictable results

are expected during message playback.

SW AGC

A SoftWare Automatic Gain Control (SW AGC)

algorithm is activated with the compression

module in order to regulate the input signal to a

dynamic range that will provide higher compres-

sion quality. The algorithm senses the energy lev-

el and updates the signal gain in order to amplify

low energy signals and to avoid signal satura-

tion. The SW AGC feature eliminates the need for

an external HW AGC, thus reducing hardware

costs and complexity. Hardware Gain Control

may be used to avoid signal saturation prior to

sampling the signal.

A tunable parameter determines the maximum

allowed gain for the SW AGC algorithm. The SW

AGC may be turned Off, though it is planned to

remain On continuously. For more details refer to

table 2-4 of the Command Description section.

VARIABLE SPEED PLAYBACK

This feature increases or decreases the speed of

messages and synthesized messages during

playback. Use the SPS (Set Playback Speed) to

set the speed of message playback. The new

speed applies to all recorded messages and syn-

thesized messages (only if synthesized using IVS),

until changed by another SPS command. If this

command is issued while the VoiceDSP proces-

sor is in the PLAY state, the speed also changes

for the message currently being played.

The speedup / slowdown algorithm is designed

to maintain the pitch of the original speech. This

approach provides the same speech tone while

playback speed varies.

2-10

Voice Solutions in Siliconâ¢

▷