The Design of Virtual and Augmented Reality
  • 😄Introduction
    • 🔠Terminology
    • ⁉️Answering Common Questions
  • Background on Existing Design Best Practices
    • 🧩Content Creation: Designing an Experience
    • 🧑‍🤝‍🧑User Experience Design: Designing for People
    • 🏨Environmental Design: Designing for Space
  • The Foundations of VR and AR
    • Immersion, Presence, and Engagement
    • Designing for Human Embodiment
    • Designing for Human Perception
    • Designing for Accessibility
  • Designing for the Medium
    • Challenges of VR Technology
    • Challenges of AR Technology
      • AR Mobile Design
  • An Introduction to Spatial Design
    • Ergonomics of People and Places
    • Input Method
      • Controllers & Other Peripherals
      • Hand Tracking
      • Head Pose and Gaze
      • Voice and Dictation
    • Interface Layout and Location
      • Types of Interfaces and Visual Design
      • Interface Elements and Behaviors
      • Error Prevention
    • Interaction Design
      • Button States and Object Manipulation
      • Uncanny Valley of Interaction Design
  • By Definition, Doing Too Much
    • Multiple Inputs, Platforms, Locations, and People?
    • How To Figure Out What's Not Working
  • Afterword
    • 🌅A Call to the Past and Towards the Future
  • A Ridiculous Repository of Resources and References
    • 🗃️Guides and Frameworks
    • 📺Video Presentations and Essays
    • ✏️Blogs and Articles
    • 📚Books
    • 📑Published Research
Powered by GitBook
On this page
  • Advantages of Voice Input
  • Challenges for Voice Input and Dictation
  1. An Introduction to Spatial Design
  2. Input Method

Voice and Dictation

PreviousHead Pose and GazeNextInterface Layout and Location

Last updated 2 years ago

Voice and dictation are a tricker form of input, and normally only employed at the operating system level. Voice and dictation allows you to directly command digital content. Voice input can be a natural way to communicate your intent. Unfortunately because of the lack of visual affordances, this can easily confuse users who are unaware of the proper prompt to say.

Advantages of Voice Input

Voice input is a natural way to communicate our intents. Using voice is also a convenient input method when we have our arms full or are multi-tasking. On devices where typing on a keyboard is difficult, voice dictation can be an efficient alternative way to input text.

Challenges for Voice Input and Dictation

While voice input is great for many different applications, it also faces several challenges. Voice is only as powerful as the system behind it, so an individual may say "louder" but the system won't be able to tell how much louder is enough noise. This also routinely gets in the way of object manipulation. Voice and dictation often incorrectly hears or misinterprets commands and speech.

Voice input in shared spaces Voice may not be socially acceptable in spaces that you share with others. Here are a few examples:

  • The user may not want to disturb others (for example, in a quiet library or shared office)

  • Users may feel awkward being seen talking to themselves in public,

  • A user may feel uncomfortable dictating a personal or confidential message (including passwords) while others are listening

Voice input of unique or unknown words Difficulties for voice input also come when users are dictating words that may be unknown to the system, such as nicknames, certain slang words, or abbreviations.

Learning voice commands While the ultimate goal is to naturally converse with your system, often apps still rely on specific pre-defined voice commands. A challenge associated with a significant set of voice commands is how to teach them without overloading the user and how to help the user to keep them.

Link:
https://docs.microsoft.com/en-us/windows/mixed-reality/design/voice-input
Link:
https://developer.magicleap.com/en-us/learn/guides/design-voice