Embedded Digital System
|
|||||||||
Speech in Toyland nonetheless represents a paradox, according to contributor Larry Gaddy, marketing manager for Winbond Electronics Corp. (Hsinchu, Taiwan). The big issue in a command-and-control applications environment-even a "you-talk/it-talks-back" environment-is not the processing power of the microprocessor or DSP used to decode speech, but rather the amount of memory. More memory means more realistic speech, but it also elevates the cost of the electronics. "For speech in toys, the cost issue is always brutal," says Gaddy. Winbond, whose customers include automotive console makers like Blaupunkt in Germany, has developed industrial-strength speech recognizers-hands-free phone dialers for cars, and voice-activated speaker phones. The key to conserving expensive memory usage, Gaddy believes, is to build recognizers that rely on acoustic models-not word models. Word-based recognition, under the best of circumstances, is only about 95 percent accurate. Acoustic models, particularly for digits, are 99 percent accurate, he says. The problems of recognizing speech in young children, whose pronunciations can be challenging as well as cute, may affect the accuracy of these smart toys. "In toys, cost is king," according to contributors Jeff Rogers and Erik Soule, marketing analysts at Sensory Inc. (Sunnyvale, Calif.). "Typically, there is a four- to five-time mark-up in the toy industry, which means if technologies initially cost $5, this quickly translates to $20 to $25 retail for just the electronics. Once the plastic, plush, manufacturing and shipping are added, the product now retails at $50 to $100, or more." Sensory conserves costs by using an 8-bit processor and customizing it with digital filters and special-purpose cells. Its products include speaker-dependent and -independent recognizers, speaker verification (voice password) systems, word-spotters, speech synthesizer, voice record and playback, and four-voice music generators. Target markets, they say, are Tomogochi products, voice-controlled robots, interactive pets, dolls, password-protected products and AIBO-type dogs. The issue of adding radio-control modules to toys is addressed by distributor Insight Electronics in (San Diego, Calif.). Schlacht describes some of the transmitter-receiver modules readily available from companies like Infineon Technologies AG and RF Micro Devices. These enable serial data communications between controller and remote modules, but, admittedly, may be a bit pricey for use in toys. Until then, a common approach to a low-cost RF transmit/receive link is to use a completely discrete solution. On the transmit side, the simplest way to start is to use a known oscillator circuit and vary one or more quiescent parameters that will vary the transmitted power level. The amount of amplitude variation depends on the application, the environment, the required range and noise level tolerated. But 6 to 10 dBm is usually sufficient for controlling toys across a room or playground, Schlacht says. Infrared (IR) is useful as a very low-cost solution, Schlacht acknowledges. IR's drawback is that it depends on an unimpeded line of sight (LOS) between the controller and the remote. "Any application involving an 8-year-old that requires him to maintain an LOS between his toy and his controller is doomed to frustration and an eventual loss of interest." Iguana Robotics Inc. (Mahomet, Ill.) president Anthony Lewis and consulting scientist Ralph Etienne-Cummings, an assistant professor at Johns Hopkins University (Baltimore), have a particular slant on the kind of computations required to power a robot, whether it is used as an extension of a NASA space shuttle arm, or kid's toy. A moving device must effectively "see" where it's going, they argue, and this process of relating what they see to how they move is effectively a biological relationship. The devices they've created for commercial exploitation and for university research make a point of integrated "brain-style" object recognition. The chips use what Lewis and Etienne-Cummings call embedded biomorphic technology (EBT)-algorithms derived from human biology but used to control robotic devices. Embedding these algorithms on-chip provides a more streamlined method of control than the lookup-table sequences used by microprocessors in their instruction-fetch/instruction-execute mode of operation. A recent chip built at Johns Hopkins, for example, can process the equivalent of 12 Gflops/milliWatt with roughly 5 bits of resolution. Two chips serve as test pieces for EBT. One is an intelligent visual sensor that incorporates mixed-signal, "computation on read-out" (COR) that analyzes pixel data from a 128 x 64 array. The color processing circuits construct a histogram-effectively sizing these images-at a very high frame rate. It effectively performs an intelligent object recognition (albeit fuzzy) at a very high rate of speed. This was developed exclusively at Iguana Robotics. A second chip, developed by Iguana with Johns Hopkins University, is a locomotion controller chip, capable of controlling the movement of a four-legged robotic toy animal. The current chip can generate the basic movement pattern for a small running biped robot. "While these techniques may seem exotic," write Lewis and Etienne-Cummings, "they may be the most effective route to building low-cost, sophisticated robotlike toys." A number of the breakthroughs in motion control, image processing and speech recognition really depend on digital signal processing, points out senior fellow and resident visionary Gene Frantz of Texas Instruments Inc. (Dallas). The mass-storage industry depends on DSPs to control hard-disk drive read-head positioners, and thus increase disk drive density. DSPs also decode multimedia streams for digital versatile disk (DVD) players. But they also powered kids' toys like Speak-'N-Spell and the Julie doll. The TI fellow details future improvements in both architecture and manufacturing processes: He predicts a $5, 5,000-Mips processor that consumes 0.1 mW/Mips by 2002; and a 50,000-Mips device that will cost just 15 cents and run on 1 microWatt/Mips by 2012, ensuring an increase in smart toys and entertainment appliances. Meanwhile, the biggest job of the set-top box is MPEG-2 digital video decoding, points out Jeff Haight, product manager for high-end processors at MIPS Technologies Inc. (Mountain View, Calif.). But typically the set-top box (STB) and the television reside in a home-entertainment environment that includes advanced audio processors, digital camera hookups and multimedia Internet downloads. It will only be a matter of time before a clever marketing executive insists on providing these services from the same box-if not on the same signal-processing device.
Embedded Digital System Co.,Ltd. CANADA 嵌入数码系统公司 Email: embedigital@yahoo.com copy right © 2002 All Rights Reserved |
|
|
|