Set the Language

We weren't able to detect the audio language on your flashcards. Please select the correct language below.

Front

Back

Related Flashcards

Flashcards
»
Chapter 8. Multimodal Dialog Systems

Chapter 8. Multimodal Dialog Systems

by Katja-Malkova, Feb. 2015

Subjects: Speech Communication

Favorite

Add to folder

Flag

Related Essays

Influences The Comunicative Modes Of Multimodality In Communication
1. INTRODUCTION 2 2. WHAT IS MULTIMODALITY 2 3. HOW MULTIMODALITY INFLUENCES THE COMMUNICATORS CHOICE OF WORDS AND SELECTION 3 4. CONCLUSION 4 5.
Gestures In Conversational Storytelling Research Paper
In the video-recorded interaction, four people engaged in communicating through gaze, gesture, and speech within an interactional environment. In this paper,...
Non Verbal Communication
Communication is one way to connect people all around the world. It is rather complex way of relaying information, because it is verbal or non-verbal, and w...
Multimodal Discourse Analysis
Multimodality is a theory which looks at the many different modes that society uses for meaning making and interpretation. According to Royce and Bowcher (20...
Comparing The Four Basics Of Gestures
2.4.1 Voice gesture recognition Gestures can be recognized in different ways, voice or speech gestures if those ways. Speech and gesture recognition systems...
The Importance Of Human-Computer Interaction
Some more work is needed on improving the systems’ automatic interpretation of the collected data and adapting the system to the person involved in the Human...
Verbal Communication Importance
Gestures are the tools of non-verbal communication with movement body and arms. Facial expression Eye
Nonverbal Communication In Different Cultures
Bodytalk: The Meaning of Human Gestures. New York: The Crown Publishing
Importance Of Nonverbal Communication In Visual Chat Rooms
In this paper I will be talking about how nonverbal communication plays a role in visual chat rooms. First of all, which of the three functions (expressin...
Facial Emotion Recognition
In recent years, researchers are trying to improve all aspects of interaction between humans and computers especially in the area of human emotion recognitio...

Shuffle
Toggle On

Toggle Off
Alphabetize
Toggle On

Toggle Off
Front First
Toggle On

Toggle Off
Both Sides
Toggle On

Toggle Off
Read
Toggle On

Toggle Off

Reading...

Front

Card Range To Study

through

Play button

Progress

1/12

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

12 Cards in this Set

Front
Back

	What are advantages and disadvantages of multimodal compared to purely speech-based dialog systems?	+ more natural + more robustness - more difficult
	What is the difference between a multimedia and a multimodal system?	Multimedia system doen't summarize the information from user to make the dessicion.
	Name some input and output modalities	For the classification of output modalities, Bernsen (1999) employs the 5 properties - Linguistic vs. non-linguistic: Linguistic modalities are based on a syntactic-semantic-pragmatic system of meaning. Examples for this are e.g. written text, or spoken language - Analogous vs. non-analogous: Analog modalities rest on a similarity between the designator and the designated; therefore, they are alternatively called iconic modalities. Examples for analog modalities are pictures and diagrams. - arbitrary vs. non-arbitrary: Non-arbitrary modalities are based upon an established system of meaning, arbitrary ones don’t. - static vs. dynamic: Static modalities may perceived by a user in principle in any order, and by any duration, while dynamic modalities may not. - Class of media: graphical (visually perceivable), acoustic (to be perceived auditorily) or haptical. Input modalities: As above, but without static/dynamic
	According to which "rules“ can you select appropriate input and output modalities?
	Explain the set-up and the functions of a multimodal dialog system!
	How can a face be recognized automatically?	1. Rule-based: Description of the facial area (relative position of features like mouth, eyes, nose) via a set of rules 2. Determination of invariant features and statistical classification 3. Pattern comparison (of single parts of the entire facial area) 4. Color and combinations
	How can you detect and track gazes?	1. Cornea Reflex Method: Reflextion of a beam of light at the cornea surface 2. Electro Oculograms (EOG): Measurement of the electrical potential between cornea and retina 3. Preparated contact lenses
	What is the advantage of audio-visual over purely audio speech recognition?	Fusion of acoustic and visual information - on a feature level (feature fusion) [problem: asynchronies] - on the level of probabilities (decision fusion)
	What are the classes of Text Recognition?	- Offline recognition of fixed shapes: Static Optical Character Recognition (OCR) of typed or handwritten characters - On-line recognition of shapes in progress: Dynamic handwriting recognition (during the writing process)
	Explain the terms fusion and fission.	The basic idea is that the system makes decisions/reacts based on all the input modalities. Thus, it has to "fuse" the data in order to make a decision. When the decision about the action has been made, it then has to respond using output modalities. What kind of output modalities / a mixture of them to use corresponding to that action is called "fission".
	Which types of gestures do you know, and how can you recognize them automatically?	Classification of Gestures: - Symbolic: Use of symbols which transmit meanings - Deictic: Pointing gestures - Iconic: Visual descriptions of objects, positions and actions - Metaphoric: Description of abstract ideas - Beating or rythmical gestures Gesture recognition: - Intrusive: Direct input via mouse, stylus, data glove - Non-intrusive: Recognition of parts of the body via a camera
	What is an Embodied Conversational Agent (ECA), and how does it work?	- Output of speech, mimics and gestures Advantages: - Virtual interlocutor for the user - Attracting the attention of the user - Feedback about the state of the system - Transport of feedback and emotions - Potentially increasing speech intelligibility Approaches: - Animation of previously taken pictures - Parametrically-steered model

Share This Flashcard Set