SlideShare a Scribd company logo
1 of 12
Multi-modal Interaction
Multimodal HCI System
› Combination of multiple modalities, or usage of more than
one independent channel signals for the interaction between
a user and a machine is termed as multimodal human
computer interaction system (MMHCI).
› A multimodal interface acts as a facilitator of human-
computer interaction via two or more modes of input.
– Input: voice, pen, gesture, face expression, etc.
– Output: voice, graphical output, etc.
A classic example of a multimodal system is the “Put That
There” system.(Bolt,1980)
18/03/2013 4
5
Multimodal Systems – Why?
› Provide transparent, flexible, and powerfully expressive means
of HCI.
› Easier to learn and use.
› Robustness and Stability.
› If used as front-ends to sophisticated application systems,
conducting HCI in modes all users are familiar with, then the
cost of training users would be reduced.
› Potentially user, task and environment adaptive.
6
• Inputs: An interactive multimodal implementation will use multiple input
modes such as audio, speech, handwriting, and keyboarding, and other
input modes.
• Outputs: An interactive multimodal implementation will use one or more
modes of output, such as speech, text, graphics, audio files, and
animation.
• Interaction manager: The interaction manager is the logical component
that coordinates data and manages execution flow from various input
and output modality component interface objects. The interaction
manager maintains the interaction state and context of the application
and responds to inputs from component interface objects and changes
in the system and environment. In some architectures the interaction
manager may be implemented as one single component. In other
architectures the interaction manager may be treated as a composition
of lesser components. Composition may be distributed across process
and device boundaries.
Multimodal interfaces process two or more combined user
input modes— such as speech, pen, touch, manual gestures,
gaze, and head and body movements— in a coordinated
manner with multimedia system output. They are a new class
of interfaces that aim to recognize naturally occurring forms of
human language and behavior, and which incorporate one or
more recognition-based technologies (e.g., speech, pen, vision).
Challenges for multimodal interface design
› More than 2 modes –e.g. spoken, gestural, facial
expression, gaze; various sensors
› Inputs are uncertain –vs. Keyboard/mouse
– Corrupted by noise
– Multiple people
› Recognition is probabilistic
› Meaning is ambiguous
Approach
Gain robustness via
– Fusion of inputs from multiple modalities
– Using strengths of one mode to compensate for
weaknesses of others—design time and run time
– Avoiding/correcting errors
– Statistical architecture
– Confirmation
– Dialogue context
– Simplification of language in a multimodal context
– Output affecting/channeling input
Differences Between Multimodal Interfaces
and GUIs
GUI
1. Assume that there is a single event
stream that controls event loop with
processing being sequential.
2. Assume interface actions (e.g. selection
of items) are atomic and unambiguous.
3. Built to be separable from application
software and reside centrally on one
machine.
4. Do not require temporal constraints.
Architecture not time sensitive.
MULTI-MODAL
1. Typically process continuous and
simultaneous input from parallel incoming
streams.
2. Process input modes using recognition
based technology, good at handling
uncertainty.
3. These have large computational and
memory requirements and are typically
distributed over the network.
4. Require time stamping of input and
development of temporal constraints on
mode fusion operations.
Application Areas
› Driver Monitoring
› Architecture and Design
› Geographical Information Systems
› Emergency Operations
› Field-based Operations
› Mobile Computing and Telecommunications
› Virtual Reality
› Pervasive/Ubiquitous Computing
› Computer-Supported Collaborative Work
› Education and Entertainment
› Intelligent Homes/Offices
› Intelligent Games
› Helping People with Disabilities

More Related Content

What's hot

What's hot (20)

HCI
HCIHCI
HCI
 
Human computer interaction-Memory, Reasoning and Problem solving
Human computer interaction-Memory, Reasoning and Problem solvingHuman computer interaction-Memory, Reasoning and Problem solving
Human computer interaction-Memory, Reasoning and Problem solving
 
HCI 3e - Ch 13: Socio-organizational issues and stakeholder requirements
HCI 3e - Ch 13:  Socio-organizational issues and stakeholder requirementsHCI 3e - Ch 13:  Socio-organizational issues and stakeholder requirements
HCI 3e - Ch 13: Socio-organizational issues and stakeholder requirements
 
interaction norman model in Human Computer Interaction(HCI)
interaction  norman model in Human Computer Interaction(HCI)interaction  norman model in Human Computer Interaction(HCI)
interaction norman model in Human Computer Interaction(HCI)
 
Human Computer Interaction (HCI)
Human Computer Interaction (HCI)Human Computer Interaction (HCI)
Human Computer Interaction (HCI)
 
Human Computer Interaction - Interaction Design
Human Computer Interaction - Interaction DesignHuman Computer Interaction - Interaction Design
Human Computer Interaction - Interaction Design
 
HCI 3e - Ch 8: Implementation support
HCI 3e - Ch 8:  Implementation supportHCI 3e - Ch 8:  Implementation support
HCI 3e - Ch 8: Implementation support
 
Screen based controls in HCI
Screen based controls in HCIScreen based controls in HCI
Screen based controls in HCI
 
HCI - Chapter 3
HCI - Chapter 3HCI - Chapter 3
HCI - Chapter 3
 
HCI 3e - Ch 5: Interaction design basics
HCI 3e - Ch 5:  Interaction design basicsHCI 3e - Ch 5:  Interaction design basics
HCI 3e - Ch 5: Interaction design basics
 
HCI 3e - Ch 16: Dialogue notations and design
HCI 3e - Ch 16:  Dialogue notations and designHCI 3e - Ch 16:  Dialogue notations and design
HCI 3e - Ch 16: Dialogue notations and design
 
Chapter 2
Chapter 2Chapter 2
Chapter 2
 
HCI 3e - Ch 14: Communication and collaboration models
HCI 3e - Ch 14:  Communication and collaboration modelsHCI 3e - Ch 14:  Communication and collaboration models
HCI 3e - Ch 14: Communication and collaboration models
 
Chapter1(hci)
Chapter1(hci)Chapter1(hci)
Chapter1(hci)
 
HCI 3e - Ch 10: Universal design
HCI 3e - Ch 10:  Universal designHCI 3e - Ch 10:  Universal design
HCI 3e - Ch 10: Universal design
 
evaluation techniques in HCI
evaluation techniques in HCIevaluation techniques in HCI
evaluation techniques in HCI
 
hci in software development process
hci in software development processhci in software development process
hci in software development process
 
SELECT THE PROPER DEVICE BASED CONTROLS
SELECT THE PROPER DEVICE BASED CONTROLSSELECT THE PROPER DEVICE BASED CONTROLS
SELECT THE PROPER DEVICE BASED CONTROLS
 
golden rules of user interface design
golden rules of user interface designgolden rules of user interface design
golden rules of user interface design
 
Hci in software process
Hci in software processHci in software process
Hci in software process
 

Viewers also liked

MULTIMODAL INTERFACE OF BRAQIN COMPUTER INTERFACE AND ELECTOOCULOGRAPHY
MULTIMODAL INTERFACE OF BRAQIN COMPUTER INTERFACE AND ELECTOOCULOGRAPHYMULTIMODAL INTERFACE OF BRAQIN COMPUTER INTERFACE AND ELECTOOCULOGRAPHY
MULTIMODAL INTERFACE OF BRAQIN COMPUTER INTERFACE AND ELECTOOCULOGRAPHY
chelsiageorge20
 
Multimodal Semiotics
Multimodal SemioticsMultimodal Semiotics
Multimodal Semiotics
svngl
 
Introduction to HCI
Introduction to HCI Introduction to HCI
Introduction to HCI
Deskala
 
human computer interface
human computer interfacehuman computer interface
human computer interface
Santosh Kumar
 

Viewers also liked (20)

Multimodal Interaction: An Introduction
Multimodal Interaction: An IntroductionMultimodal Interaction: An Introduction
Multimodal Interaction: An Introduction
 
MULTIMODAL INTERFACE OF BRAQIN COMPUTER INTERFACE AND ELECTOOCULOGRAPHY
MULTIMODAL INTERFACE OF BRAQIN COMPUTER INTERFACE AND ELECTOOCULOGRAPHYMULTIMODAL INTERFACE OF BRAQIN COMPUTER INTERFACE AND ELECTOOCULOGRAPHY
MULTIMODAL INTERFACE OF BRAQIN COMPUTER INTERFACE AND ELECTOOCULOGRAPHY
 
Multimodality
MultimodalityMultimodality
Multimodality
 
Multimodality
MultimodalityMultimodality
Multimodality
 
MobileHCI 2016 - Technology Literacy in Poor Infrastructure Environments: Cha...
MobileHCI 2016 - Technology Literacy in Poor Infrastructure Environments: Cha...MobileHCI 2016 - Technology Literacy in Poor Infrastructure Environments: Cha...
MobileHCI 2016 - Technology Literacy in Poor Infrastructure Environments: Cha...
 
MobiMed: Comparing Object Identification Techniques on Smartphones
MobiMed: Comparing Object Identification Techniques on SmartphonesMobiMed: Comparing Object Identification Techniques on Smartphones
MobiMed: Comparing Object Identification Techniques on Smartphones
 
Type of interfaces Touch and air based-gesture
Type of interfaces Touch and air based-gestureType of interfaces Touch and air based-gesture
Type of interfaces Touch and air based-gesture
 
Sensor based interaction
Sensor based interaction Sensor based interaction
Sensor based interaction
 
Multimodality
MultimodalityMultimodality
Multimodality
 
Multimodal man machine interaction
Multimodal man machine interactionMultimodal man machine interaction
Multimodal man machine interaction
 
Multimodal Semiotics
Multimodal SemioticsMultimodal Semiotics
Multimodal Semiotics
 
Human-Computer Interaction
Human-Computer InteractionHuman-Computer Interaction
Human-Computer Interaction
 
Introduction to HCI
Introduction to HCI Introduction to HCI
Introduction to HCI
 
Lecture 1: Human-Computer Interaction Introduction (2014)
Lecture 1: Human-Computer Interaction Introduction (2014)Lecture 1: Human-Computer Interaction Introduction (2014)
Lecture 1: Human-Computer Interaction Introduction (2014)
 
Human-Computer Interaction: An Overview
Human-Computer Interaction: An OverviewHuman-Computer Interaction: An Overview
Human-Computer Interaction: An Overview
 
human computer interface
human computer interfacehuman computer interface
human computer interface
 
Human computer interaction
Human  computer interactionHuman  computer interaction
Human computer interaction
 
Gesture recognition
Gesture recognitionGesture recognition
Gesture recognition
 
Research methodology
Research methodologyResearch methodology
Research methodology
 
Research method versus research methodology
Research method versus research methodologyResearch method versus research methodology
Research method versus research methodology
 

Similar to Multimodal Interaction

Chapter 4 universal design
Chapter 4 universal designChapter 4 universal design
Chapter 4 universal design
Pado Pado
 
Multimodal and Affective Human Computer Interaction - Abhinav Sharma
Multimodal and Affective Human Computer Interaction - Abhinav SharmaMultimodal and Affective Human Computer Interaction - Abhinav Sharma
Multimodal and Affective Human Computer Interaction - Abhinav Sharma
Abhinav Sharma
 
UXD Multimedia output
UXD Multimedia outputUXD Multimedia output
UXD Multimedia output
rolfHRO
 
Introduction and Human Information Processing lecture.ppt
Introduction and Human Information Processing lecture.pptIntroduction and Human Information Processing lecture.ppt
Introduction and Human Information Processing lecture.ppt
PradeepJoshi88
 
User interface webpage
User interface webpageUser interface webpage
User interface webpage
dhaval_tare
 

Similar to Multimodal Interaction (20)

ICS3211 lecture 07
ICS3211 lecture 07ICS3211 lecture 07
ICS3211 lecture 07
 
ICS3211 Lecture 07
ICS3211 Lecture 07 ICS3211 Lecture 07
ICS3211 Lecture 07
 
Chapter 4 universal design
Chapter 4 universal designChapter 4 universal design
Chapter 4 universal design
 
ICS3211_lecture 08_2023.pdf
ICS3211_lecture 08_2023.pdfICS3211_lecture 08_2023.pdf
ICS3211_lecture 08_2023.pdf
 
Ubitous computing ppt
Ubitous computing pptUbitous computing ppt
Ubitous computing ppt
 
Multimodal Interaction - Lecture 05 - Next Generation User Interfaces (401816...
Multimodal Interaction - Lecture 05 - Next Generation User Interfaces (401816...Multimodal Interaction - Lecture 05 - Next Generation User Interfaces (401816...
Multimodal Interaction - Lecture 05 - Next Generation User Interfaces (401816...
 
2 4-10
2 4-102 4-10
2 4-10
 
Model-driven engineering of multimodal user interfaces
Model-driven engineering of multimodal user interfacesModel-driven engineering of multimodal user interfaces
Model-driven engineering of multimodal user interfaces
 
Mobile Computing
Mobile ComputingMobile Computing
Mobile Computing
 
HCI Presentation
HCI PresentationHCI Presentation
HCI Presentation
 
Multimodal and Affective Human Computer Interaction - Abhinav Sharma
Multimodal and Affective Human Computer Interaction - Abhinav SharmaMultimodal and Affective Human Computer Interaction - Abhinav Sharma
Multimodal and Affective Human Computer Interaction - Abhinav Sharma
 
Chapter 4
Chapter 4 Chapter 4
Chapter 4
 
UXD Multimedia output
UXD Multimedia outputUXD Multimedia output
UXD Multimedia output
 
Human Computer Interaction
Human Computer InteractionHuman Computer Interaction
Human Computer Interaction
 
Introduction and Human Information Processing lecture.ppt
Introduction and Human Information Processing lecture.pptIntroduction and Human Information Processing lecture.ppt
Introduction and Human Information Processing lecture.ppt
 
User interface webpage
User interface webpageUser interface webpage
User interface webpage
 
universaldesign
 universaldesign universaldesign
universaldesign
 
Human Computer interaction -Interaction design basics
Human Computer interaction -Interaction design basicsHuman Computer interaction -Interaction design basics
Human Computer interaction -Interaction design basics
 
Human computer Interaction
Human computer InteractionHuman computer Interaction
Human computer Interaction
 
Inredis And Machine Learning Nips
Inredis And Machine Learning NipsInredis And Machine Learning Nips
Inredis And Machine Learning Nips
 

Recently uploaded

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 

Recently uploaded (20)

Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 

Multimodal Interaction

  • 2. Multimodal HCI System › Combination of multiple modalities, or usage of more than one independent channel signals for the interaction between a user and a machine is termed as multimodal human computer interaction system (MMHCI). › A multimodal interface acts as a facilitator of human- computer interaction via two or more modes of input. – Input: voice, pen, gesture, face expression, etc. – Output: voice, graphical output, etc.
  • 3. A classic example of a multimodal system is the “Put That There” system.(Bolt,1980)
  • 5. 5 Multimodal Systems – Why? › Provide transparent, flexible, and powerfully expressive means of HCI. › Easier to learn and use. › Robustness and Stability. › If used as front-ends to sophisticated application systems, conducting HCI in modes all users are familiar with, then the cost of training users would be reduced. › Potentially user, task and environment adaptive.
  • 6. 6
  • 7. • Inputs: An interactive multimodal implementation will use multiple input modes such as audio, speech, handwriting, and keyboarding, and other input modes. • Outputs: An interactive multimodal implementation will use one or more modes of output, such as speech, text, graphics, audio files, and animation. • Interaction manager: The interaction manager is the logical component that coordinates data and manages execution flow from various input and output modality component interface objects. The interaction manager maintains the interaction state and context of the application and responds to inputs from component interface objects and changes in the system and environment. In some architectures the interaction manager may be implemented as one single component. In other architectures the interaction manager may be treated as a composition of lesser components. Composition may be distributed across process and device boundaries.
  • 8. Multimodal interfaces process two or more combined user input modes— such as speech, pen, touch, manual gestures, gaze, and head and body movements— in a coordinated manner with multimedia system output. They are a new class of interfaces that aim to recognize naturally occurring forms of human language and behavior, and which incorporate one or more recognition-based technologies (e.g., speech, pen, vision).
  • 9. Challenges for multimodal interface design › More than 2 modes –e.g. spoken, gestural, facial expression, gaze; various sensors › Inputs are uncertain –vs. Keyboard/mouse – Corrupted by noise – Multiple people › Recognition is probabilistic › Meaning is ambiguous
  • 10. Approach Gain robustness via – Fusion of inputs from multiple modalities – Using strengths of one mode to compensate for weaknesses of others—design time and run time – Avoiding/correcting errors – Statistical architecture – Confirmation – Dialogue context – Simplification of language in a multimodal context – Output affecting/channeling input
  • 11. Differences Between Multimodal Interfaces and GUIs GUI 1. Assume that there is a single event stream that controls event loop with processing being sequential. 2. Assume interface actions (e.g. selection of items) are atomic and unambiguous. 3. Built to be separable from application software and reside centrally on one machine. 4. Do not require temporal constraints. Architecture not time sensitive. MULTI-MODAL 1. Typically process continuous and simultaneous input from parallel incoming streams. 2. Process input modes using recognition based technology, good at handling uncertainty. 3. These have large computational and memory requirements and are typically distributed over the network. 4. Require time stamping of input and development of temporal constraints on mode fusion operations.
  • 12. Application Areas › Driver Monitoring › Architecture and Design › Geographical Information Systems › Emergency Operations › Field-based Operations › Mobile Computing and Telecommunications › Virtual Reality › Pervasive/Ubiquitous Computing › Computer-Supported Collaborative Work › Education and Entertainment › Intelligent Homes/Offices › Intelligent Games › Helping People with Disabilities

Editor's Notes

  1. It is easy to use by disabled, illiterate people
  2. Interaction manager — The interaction manager is the logical component that coordinates data and manages execution flow from various input and output modality component interface objects. The interaction manager maintains the interaction state and context of the application and responds to inputs from component interface objects and changes in the system and environment. The interaction manager then manages these changes and coordinates input and output across component interface objects. The Interaction manager is discussed insection 6. In some architectures the interaction manager may be implemented as one single component. In other architectures the interaction manager may be treated as a composition of lesser components. Composition may be distributed across process and device boundaries.
  3. CSCW [is] a generic term, which combines the understanding of the way people work in groups with the enabling technologies of computer networking, and associated hardware, software, services and techniques. Pervasive computing (also called ubiquitous computing) is the growing trend towards embedding microprocessors in everyday objects so they can communicate information.  The words pervasive and ubiquitous mean "existing everywhere." Pervasive computing devices are completely connected and constantly available.