Search results

    Search results

    Show all results for ""
    Can not find any results or suggestions for "."

    Search tips

    • Make sure there are no spelling errors
    • Try different search terms or synonyms
    • Narrow your search for more hits

    How can we help?

    Contact Us

    Find Employees

    University of Skövde, link to startpage

    Search results

      Search results

      Show all results for ""
      Can not find any results or suggestions for "."

      Search tips

      • Make sure there are no spelling errors
      • Try different search terms or synonyms
      • Narrow your search for more hits

      How can we help?

      Contact Us

      Find Employees

      University of Skövde, link to startpage

      PhD thesis defence: Vipul Nair

      Date 24 May
      Time 13:15 - 17:00
      Location G Building, Room G110
      Extra information PhD thesis defence

      Vipul Nair defends his thesis "The Observer Lens: Characterizing Visuospatial Features in Multimodal Interactions".

      Join online

      The dissertation will be held in room G110 at the University of Skövde, and will also be streamed online. Join the livestream.

      Abstract

      Understanding the intricate nature of human interactions relies heavily on the ability to discern and interpret inherent information. Central to this interpretation are the sensory features—visual, spatial, and auditory—collectively referred to as visuospatial in this thesis.

      The low-level (e.g., motion kinematics) and high-level (e.g., gestures and speech) visuospatial features significantly influence human perception, aiding in the deduction of intent, goals, emotions, and more.

      From a computational viewpoint, these features are crucial for interpreting events, from discerning body poses to evaluating action similarity, particularly for computational systems designed to interact closely with humans.

      This thesis examines the impact of visuospatial features on human event observation within an informatics context, concentrating on (1) Investigating the effect of visuospatial features on the observers’ perception; and (2) Aligning this investigation towards outcomes applicable to informatics.

      Taking a human-centric perspective, the thesis methodically probes the role of visuospatial features, drawing from prior cognitive research, underscoring the significance of features like action kinematics, gaze, turn-taking, and gestures in event comprehension. Balancing both reductionist and naturalistic perspectives, the research examines specific visuospatial features and their impact on the human's visual processing and attention mechanisms:

      - Visual Processing: It highlights the visual processing effects of action features, including kinematics, local-motion, and global form, as well as the role of factors like semantics and familiarity. These are demonstrated using human performance metrics in perceptual tasks and comparative analyses with selected computational models employing basic kinematic representations, revealing the adaptive nature of the visual system and enhancing Human Action Recognition models.

      - Visual Attention: Also highlights the attentional effects of interaction cues, such as speech, hand action, body pose, motion, and gaze, using the developed `Visuospatial Model'. This model presents a systematic approach for characterizing visuospatial features in everyday events, exemplified using a curated movie dataset and a newly developed comprehensive dataset of naturalistic-multimodal events.

      The findings emphasize the integration of behavioral and perceptual parameters with computationally aligned strategies, such as benchmarking perceptual tasks against human behavioral and psychophysical metrics, thereby providing a richer context for developing systematic tools and methodologies for multimodal event characterization.

      At its core, the thesis characterizes the role of visuospatial features in shaping human perception and its implications for the development of cognitive technologies equipped with autonomous perception and interaction capabilities -- essential for domains like social robotics, autonomous driving, media studies, traffic safety, and virtual characters.

      Opponent

      Keith L. Downing, Professor, Norwegian University of Science and Technology

      Supervisors

      Paul Hemeren, Associate Professor, University of Skövde
      Mehul Bhatt, Professor, Örebro University
      Erik Billing, Associate Professor, University of Skövde

      Committee

      Ibrahim A. Hameed, Professor, Norwegian University of Science and Technology
      Magnus Johnsson, Professor, Kristianstad University
      Ingar Brinck, Professor, Lund University, Lund
      Manfred Jeusfeld, Professor, University of Skövde

      Chair of the Defense

      Jörgen Hansson, Professor, University of Skövde

      Contact

      PhD Student Informatics

      Published: 4/25/2024
      Edited: 4/25/2024
      Responsible: webmaster@his.se