Communication: Humans Talk. Computers Talk - Smart Home Automation with Linux and Raspberry Pi

Hardware Reference

In-Depth Information

else if ($cmd == "lightsoff") {

system("heyu turn bedroom_light off");

}

?>

You can then work on abstracting and extending this at will. In Chapter 7 you'll integrate this into a

general-purpose message system.

N Before investing heavily in voice recognition software, ensure that it can distinguish between whatever

different voices can control the system, because a lot of software can listen only to a single, preselected voice as its

primary purpose is dictation and not voice recognition.

Note

Note that most software of this type doesn't provide access to the words you've actually spoken; the computer just

thinks that there's a higher probability of it being this one than that one. Although this gives you fewer opportunities

for error, it also prevents the use of any analog or scaled commands, such as “dim to 72%.”

From iOS

Something that has partially solved the problem of voice recognition is the cloud. By moving the processing into a

remote device means that you can perform recognition by using more powerful algorithms than would traditionally

be available on a hand-held machine, albeit at the expense of requiring yourself to be online, and finding yourself at

the mercy of a (traditionally) nonopen technology. The most famous example of this has been Siri.

Released in 2010 as a standalone app, Siri has since been incorporated into the iPhone 4s and has been a staple

feature of versions since iOS 5 on both iPhone and iPad. In its traditional mode of operation what happens is:

u

Siri records your voice from the microphone

guzzoni.apple.com

u

Your iOS device uploads a rendering of it to their servers at

u

The guzzoni server processes the voice, and works out the request

u

The guzzoni server returns a command packet to your iOS device

As you can see, two parts of the chain require an iOS device. However, with tinkering (and software such as

SiriProxy) you can circumvent half of this process. The operation then becomes:

u

Siri records your voice from the microphone

u

Your iOS device uploads a rendering of it to your server

guzzoni.apple.com

u

Your server passes the request to

u

The guzzoni server processes the voice, and works out the request

u

The guzzoni server return a command packet to your server

u

Your server can then process any arbitrary command, using the text string that was recognized

u

Your server passes a result string (with optional image) to the iOS device

As you can see, there is a fairly traditional proxy metaphor taking place here, with your server acting as a go

between for the iOS device the guzzoni server so that it is able to step into the middle of the process, to perform

custom commands.

Smart Home Automation with Linux and Raspberry Pi

Search WWH ::

Custom Search

Home