This is a simple one.
When you’re in Windows Explorer and want to make a new folder, say
Under f w f (or Under Foxtrot Whiskey Foxtrot)
This will leave you all set to name the folder.
This is a simple one.
When you’re in Windows Explorer and want to make a new folder, say
Under f w f (or Under Foxtrot Whiskey Foxtrot)
This will leave you all set to name the folder.
I’ve been getting a lot of questions lately about filling out forms using speech commands.
Here’s how it works:
The UC Keyword List allows you to say “Find” followed by any word or phrase to select that word or phrase in any program that has a Find dialog box. This includes Firefox and Internet Explorer. You can combine the word-finding ability with the Tab key to go straight to any field. For example, “Find First Tab” puts your cursor in the “First Name” field.
To see the forms facility in action, take this self-guided tour:
“Demo 2 File”
“Find Address Tab”
44 State St.
“Find Phone Tab”
555-555-5555
The Keyword List comes loaded with common key words and phrases. Say “Add Keyword” to add your own (say “UC Lesson 10.1” for detailed instructions).
4/15/09 Note: The release version of Utter Command also includes the UC Tab List, which allows you to mix tabs and phrases in order to fill out two fields in a form at once. For instance, you can say “2 Tab John 1 Tab Smith”, to tab 2 fields forward, then fill in “John” and “Smith” in successive fields, or “3 Tab Address” to tab 3 fields forward and fill in your address, for instance 29 Downing St. See the UC List explanation in Overview for details, including pictures.
It’s easy to remove formatting from selected text in a Word document — “Control Space”, but what do you do in programs that don’t have this function?
There’s not a keyboard shortcut for removing formatting in Google documents, but here’s a two-step way to get the job done. You can copy the text to one of the UC Clipboard files, which will save it as .txt, then paste the plain text from that file.
Here’s the sequence:
Removing formatting from selected text in a Google document
Copy to 1 File
1 File Paste
You can speed things up by selecting text and copying to a UC clipboard file in one step. Here are a couple of examples:
Selecting text and removing its formatting in a Google document
Line Copy to 1 File
1 File Paste
or
2 Graphs · Copy to 1 File
1 File Paste
It’s easy to remove formatting from selected text in a Word document — “Control Space”, but what do you do in programs that don’t have this function?
There’s not a keyboard shortcut for removing formatting in Google documents, but here’s a two-step way to get the job done. You can copy the text to one of the UC Clipboard files, which will save it as .txt, then paste the plain text from that file.
Here’s the sequence:
Removing formatting from selected text in a Google document
Copy to 1 File
1 File Paste
You can speed things up by selecting text and copying to a UC clipboard file in one step. Here are a couple of examples:
Selecting text and removing its formatting in a Google document
Line Copy to 1 File
1 File Paste
or
2 Graphs · Copy to 1 File
1 File Paste
It’s important to save documents you’re working on (say “Control s”), but it’s all too easy to get absorbed in what you are writing and forget to save.
Here are a couple of commands that make this easier:
Making a new line and saving in one step
Enter · Control s
Making a paragraph and saving in one step
2 Enter · Control s
Keyboard shortcuts are powerful tools for the speech interface because they work across all programs and they can be combined — you can say several keyboard shortcuts in one phrase to speed things up.
This is why we encourage all software makers to make all features available via keyboard shortcuts.
Google is experimenting with adding keyboard shortcuts to search results. Here are the experimental keyboard shortcuts:
Command | Action |
Letter J | Selects next result |
Letter K | Selects previous result |
Enter (or Letter O) | Opens selected result |
Slash | Moves cursor to search box |
Escape | Moves cursor to results |
And here’s how to speed things up further with Utter Command combinations:
Command | Action |
Letter J · Enter | Opens next result |
Letter K · Enter | Opens previous result |
J Times 1-100 | Moves down 1-100 and selects result |
K Times 1-100 | Moves up 1-100 and selects result |
J Times 1-100 · Enter | Moves down 1-100 and opens result |
K Times 1-100 · Enter | Moves up 1-100 and opens result |
Escape · Enter | Moves cursor to results and opens |
To try these out
1. Go to the Google experimental page www.google.com/experimental/1
2. Under the Keyboard Shortcuts heading click “Join Experiment”
3. Go to regular Google search www.google.com2 or Advanced Google search www.google.com/advanced_search?hl=en3, type a query, then try the shortcuts on the results.
As long as you’re logged in you’ll be able to use these shortcuts in the regular and advanced Google search pages.
Note: the Join Experiment button uses cookies. If your browser is set to remove all cookies at the end of a session and you want to retain this setting add www.google.com to your exceptions list (Firefox: Tools/Options/Privacy/Exceptions; Internet Explorer: Tools/Options/Privacy/Sites).
The SpeechTEK speech conference has a lot to say about the state of the desktop speech interface. The exhibits in and 2006 and 2007 were largely about where all the speech interface action is these days — not on the desktop, but over the telephone with interactive voice response (IVR) systems.
I went to several sessions aimed at the voice user interface designers (Vuids) who construct telephone speech command interfaces (even though I’m something of an imposter as a desktop voice user interface designer — I guess Dvuid would be the appropriate term).
We’re dealing with a lot of the same issues, though often with different twists:
And we’re looking at similar answers:
I’ve also been thinking about the differences between IVR and the desktop speech interface — these differences make the challenges more difficult or easier for each of the systems.
Here’s an attempt to explain the potential of the speech interface.
Controlling a computer using a mouse and keyboard is a very specific type of control, and for many years it was all we knew. This type of control still defines how we think about communicating with computers.
While it’s good to tap existing knowledge, it’s important not to let experience confine new methods of communication.
The way today’s speech interfaces work, speech commands often follow in the footsteps of the keyboard and mouse (“File”, “Open”, “Budget”, “Enter”) rather than tapping the full potential of speech (“Budget Folder”).
Think about the differences between road travel and air travel.
A plane goes faster than a car, so following a road from the air is faster than driving, and following roads might not be a bad idea at first to get your bearings. But the real power of air travel is the ability to travel any route, including over areas inaccessible by car like large bodies of water, mountain ranges and polar regions.
The Human-Machine Grammar that underpins Utter Command is aimed at mapping the best way to use speech to control the computer. The real power of speech is the ability to command the computer in ways not possible using the keyboard and mouse.
Here’s another metaphor:
In the days when cars that went 15 miles an hour were cutting-edge, this seemed fast — four times faster than walking and you didn’t have to expend energy. It may seem like working on a computer is fast today. It’s not. Speech has the potential to take us into another realm in terms of productivity.
Redstart Systems was born of user frustration and disappointment with speech software that seemed to fall far short of its potential.
In 1993 the heavy computer use of a deadline journalist caught up with me in the form of severe repetitive stress injuries in both hands. The good news was desktop speech recognition had just arrived. The bad news was it wasn’t really practical to use. Recognition wasn’t great, commands were difficult to remember and commanding a computer using speech was just plain slow.
More than a decade later recognition has improved dramatically — the NaturallySpeaking speech engine makes dictating to a computer quite accurate. Commands, however, are still difficult to remember and often slower than the keyboard and mouse, making things like bringing up files, editing and formatting, cutting and pasting, setting up email messages, and Web research more than a little tedious.
Utter Command is the product of a decade of frustration and experimentation with a component of speech-recognition that has lagged behind efforts to improve dictation accuracy — the speech user interface, or words you use to control the computer.
Utter Command works the way your brain does and makes controlling your computer using speech commands cognitively easy and blazingly fast. Really. Commands are underpinned by an organized grammar system informed by cutting edge research in cognition, linguistics, networking and human behavior. This makes commands easy to remember and, more importantly, gives you the ability to combine commands, which not only speeds everything up but enables more than is possible using just the keyboard and mouse.
Instead of following in the footsteps of the keyboard and mouse, Utter Command allows you to fly along by carrying out many computer steps at once. Take a look at someone humming along on the keyboard and mouse and notice how many steps everything takes. Most of these steps are only necessary because keys and screen space are limited. If you don’t have to think between steps, there’s no need for separate steps other than to accommodate the computer.
Here’s a quiz for you:
How many steps should you have to go through to
a. Navigate to a folder you already know the name of
b. Navigate to a file you already know the name of
c. Set up an email message to a couple of friends and CC a couple more
d. Search for the definition of “prosody” on a particular Web site
(Our answers are at the bottom of this post.)
It’s high time we stopped accommodating the computer.
We’re getting ready — interface-wise — to cross over to a world where speech commands will untether you from the keyboard and kick your computer use into high gear.
In this world you’ll have choices — for everything you do on the computer you can use speech, the keyboard, or the mouse. And if you need to use speech all the time, Utter Command allows you to do everything by speech that you can using the mouse and keyboard.
a. 1 step b. 1 step c. 1 step d. 1 step
Redstart Systems makes speech interface software that speeds computer use. We’ve just launched a pre-release version of Utter Command for NaturallySpeaking Professional.
Utter Command is the culmination of more than a decade of using and thinking about the speech interface. Utter Command is based on Human-Machine Grammar, a system of words and rules that follows the way the brain works. UC commands are concise, consistent and combinable, which makes for powerful, easy-to-use speech software.
There’s lots more to think about, as technological improvements to speech engine software and microphones, faster computers, smaller computers, and new technologies like portable projectors and electronic paper make it more and more practical to use speech to control machines.
In this blog I’ll explore all aspects of using speech to control a computer.