Gesture Interfaces Via Sound: Clever Ideas Abound
May 10, 2012
File this one under "cool concept; unclear implementation fit." Microsoft Research and the University of Washington have partnered to develop SoundWave, a proof-of-concept system that leverages a computer's microphone and speaker combo to implement rudimentary gesture control. Doppler shifts, akin to those harnessed by sonar and astronomers, are at the root of the scheme. The speaker emits tones in the 18-22 kHz frequency range, which are spectrally affected by hand movement. A microphone picks up the audio, and software subsequently compares the source to the altered output, thereby discerning hand distance, location and speed and direction of gesture movement.
One issue I see upfront with the scheme is that it seemingly requires a fairly high quality transducer set to both generate and capture such high frequency sounds; the cheap speakers and microphones used with most computers probably aren't up to the task. Also, given the prevalence of webcams built not only into laptop bezels but also within standalone computer displays, an embedded vision-based gesture scheme would seem to be a more robust alternative solution, even more so if a depth-discerning image sensor were included in the mix.
Nonetheless, SoundWave is a nifty hack and might be valuable as an embedded vision gesture interface algorithm supplement even if it's insufficiently robust in its own right. For more, check out the video below:
If the video player doesn't appear (it works for me in Safari but not Firefox, for example), download the Quicktime MP4 source.
- ??
- ADAS
- aerospace
- Analog Devices DSP
- analytics
- Android
- Apple
- Apple iPad
- Apple iPhone 4S
- Apple iPhone iPod touch
- Aptina
- Augmented Reality
- Automotive
- Automotive vision
- Autonomous drone
- Autonomous Vehicle
- Autostereoscopic displays
- Azumio
- Barcode
- Baseball
- Biometrics
- Blackfin Embedded Vision Starter Kit Hands-on Workshop
- blur
- Boston Image Processing and Computer Vision Group
- Camera design
- cameraphone
- Carnegie Mellon
- CCD
- CES
- CEVA
- CMOS
- CogniMem
- Comic
- Computational Photography
- computer vision
- Contamination detection
- De-warping
- Design News
- DESIGN West
- Disney
- Driver assistance
- driver information
- embedded vision
- embedded vision alliance
- Embedded Vision Summit
- Embedded vision training
- Embedded Vision Tutorial
- emotion
- emotion detection
- eva
- Evaluation modules
- Eye tracking
- eyeSight
- Face detection
- face recognition
- Facial detection
- Facial recognition
- FiRe
- flying
- Focus
- Foxconn
- games
- gesture
- gesture interface
- Gesture interfaces
- gesture recognition
- GestureTek
- HDR
- health
- High-speed camera
- High-speed capture
- High-speed video camera
- IBM
- IEEE
- IEEE Embedded Vision Workshop
- Image analysis
- Image compression
- Image recognition
- Image sensor
- image sensors
- Image Sensors 2013
- IMS Research
- Industrial vision
- Intel Gesture Interface Facial Recognition
- investment
- iOS
- iPad
- Jitendra Malik
- Kinect
- Kinect Optical Scanner Robotics
- Kodak
- Light intensity detection
- Linley Group
- Lytro
- Mac OS X
- medical
- Medical imaging
- microsoft
- Microsoft Kinect
- military
- mobile
- Motion
- Motion Capture
- Move
- National Instruments
- Neural networks
- New members
- Newsletter
- Nokia
- Nvidia
- NVIDIA Android
- nViso
- object tracking
- object video
- open source
- OpenCV SimpleCV Python C C++
- OpenNI
- optical character recognition
- Optical flow
- Organic Motion
- panel
- Panorama mode
- Parking analytics
- patent
- pc
- PlayStation
- PlayStation Move
- PointGrab
- presentation
- Processors
- pulse
- pulse rate measurement
- Qualcomm
- Raspberry Pi
- Remote control
- Robotics
- robots
- rolling shutter
- Samsung smartphone
- Satellites
- SDK
- search
- security
- slow motion
- Smart TV
- Smartphone
- Soccer
- Sony
- Sports
- Still image analytics
- Surface visualization
- surgery
- Surveillance
- Synopsys
- Tegra 3
- tennis
- Tensilica
- Texas Instruments
- Thermal imaging
- TI
- traffic control
- traffic lights
- user interface
- VanGogh Imaging
- videantis
- video analytics
- Video editing
- Video surveillance
- Videoconferencing
- VideoSurf
- Virtual shopping
- Vision
- Vision Research Phantom
- Volvo
- Webcast
- Website
- x86
- Xbox 360
- xkcd








