Personalized machine learning for robot perception of affect and engagement in autism therapy by Ognjen Rudovic, Jaeryoung Lee, Miles Dai, Björn Schuller,

Slides:



Advertisements
Similar presentations
High-dimensional analysis of lymphoid CD4+ T cells identified distinct TFH cell subsets in HIV+ patients and HCs. High-dimensional analysis of lymphoid.
Advertisements

Demonstrations I, II, and III.
Clonal bias of CD56−CD16+ NK cell subpopulations.
Basic design concept of human mimetic humanoid.
Comparing previous simulation work with current study.
The functional and structural basis of phospholipid recognition by the PG90 TCR. The functional and structural basis of phospholipid recognition by the.
Three different types of transfer functions with a codomain of [0,1].
TPAD training protocol.
Self-sensing of actuator position.
Examples of AEGIS autonomous target selection.
Workspace comparison of Delta robots.
T2-weighted cross-sectional MR imaging of MSP swarms inside SD rats.
Distribution of the number of collisions and the average closest-neighbor distance as a function of communication range and delay. Distribution of the.
Visual explanation of the interaction terms.
Visual explanation of the interaction terms.
A shared-control method for effective bimanual robot manipulation.
Tactile features for prosthesis perception.
Fig. 1 Movement percepts for all participants.
Comparison of ternary TCR-lipid-CD1 complex structures.
Quasi-static and dynamic trajectories.
Cell viability tests. Cell viability tests. SEM images of (A) MC3T3-E1 cells and (B) MSCs on days 1, 3, and 5 of culture. (C) Survival rates of MC3T3-E1.
Overview of the key stages (sensing, perception, and interaction) during robot-assisted autism therapy. Overview of the key stages (sensing, perception,
PPA-net. PPA-net. The feature layer performs feature fusion using (supervised) auto-encoders designed to reduce noise and handle missing features. The.
T2-weighted cross-sectional MR imaging of MSP swarms inside SD rats.
Source-based CP BCI robotic arm control.
Effect of slenderness on optimal shapes.
Fig. 3 Performance of HoloConvNet.
Self-sensing of actuator position.
Fig. 3 Near-field maps of a multiply cracked α-(BEDT-TTF)2I3 crystal.
Variability in sensation intensity due to changes in impedance during electrotactile stimulation. Variability in sensation intensity due to changes in.
Experimental setup for workspace, bandwidth, and force characterization of the milliDelta. Experimental setup for workspace, bandwidth, and force characterization.
Ligands eluted from hpMR1+EC and hpMR1+MS contain both shared and unique ions. Ligands eluted from hpMR1+EC and hpMR1+MS contain both shared and unique.
GNPS assists in the identification of novel hpMR1 eluted ions.
Fig. 2 Reference-fixing experiment, results.
Fig. 3 Scan rate effects on the layer edge current.
Coadaptive movements of fully soft robotic design.
Fig. 3 Phase-contrast imaging.
Histograms showing the distribution of people with respect to their performances for the single task and multitask conditions. Histograms showing the distribution.
Fig. 1 Touchless manipulation of objects based on the interaction with magnetic fields. Touchless manipulation of objects based on the interaction with.
Feature computation and classification of grating pitch.
Damage resilience of ACES architecture compared with a conventional row-column multiplexed array. Damage resilience of ACES architecture compared with.
Fig. 5 Multivariate fMRI analysis.
Fig. 3 Characteristics of UV and temperature sensors.
Fig. 3 Photon number statistics resulting from Fock state |l, S − l〉 interference. Photon number statistics resulting from Fock state |l, S − l〉 interference.
In situ concurrent removal of MRSA bacteria and MRSA-secreting PFTs by RBC-PL-robots. In situ concurrent removal of MRSA bacteria and MRSA-secreting PFTs.
The prototype and working mechanism.
Galloping-like gait with the design of a two-legged robot.
AEGIS autonomous targeting process.
Examples of AEGIS autonomous target selection.
Overview of the experimental setup and visual stimulus.
Fig. 3 ET dynamics on the control and treatment watersheds during the pretreatment and treatment periods. ET dynamics on the control and treatment watersheds.
Examples of organic sensing in robotics.
Fig. 6 Global sensitivity analysis illustrates the key model parameters that determine the sickling of RBCs. Global sensitivity analysis illustrates the.
by Hiro Nimiya, Tatsunori Ikeda, and Takeshi Tsuji
Fig. 1 Outline of bioacoustic methodology.
CD4-CTL effectors share TCR clonotypes with CD4-CTL precursors.
Relationships between species richness and temperature or latitude
Schematic of the proposed brain-controlled assistive hearing device
Fig. 2 Behavioral modulation of brain change.
Jetting phase analysis.
The biomimetic pressure sensing ability.
Fig. 2 Comparison of the observed DRs and the estimates by the VR model and FL. Comparison of the observed DRs and the estimates by the VR model and FL.
Fig. 2 Bimanual motor task performance scores.
Robot-assisted intervention system.
Robot-initiated joint attention.
Fig. 1 Ecological and evolutionary change of coevolving host and virus populations. Ecological and evolutionary change of coevolving host and virus populations.
Skin-like soft driving system for wirelessly activated fully soft robots. Skin-like soft driving system for wirelessly activated fully soft robots. (A)
Evaluating the performance of the robotic system through comparison with human trackers. Evaluating the performance of the robotic system through comparison.
Comparison of children’s behavior between the three conditions.
Setup used in the study. Setup used in the study. A child interacts with the robot tutor with a large touchscreen sitting between them, displaying the.
Presentation transcript:

Personalized machine learning for robot perception of affect and engagement in autism therapy by Ognjen Rudovic, Jaeryoung Lee, Miles Dai, Björn Schuller, and Rosalind W. Picard Sci. Robotics Volume 3(19):eaao6760 June 27, 2018 Copyright © 2018 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works

Overview of the key stages (sensing, perception, and interaction) during robot-assisted autism therapy. Overview of the key stages (sensing, perception, and interaction) during robot-assisted autism therapy. Data from three modalities (audio, visual, and autonomic physiology) were recorded using unobtrusive audiovisual sensors and sensors worn on the child’s wrist, providing the child’s heart-rate, skin-conductance (EDA), body temperature, and accelerometer data. The focus of this work is the robot perception, for which we designed the personalized deep learning framework that can automatically estimate levels of the child’s affective states and engagement. These can then be used to optimize the child-robot interaction and monitor the therapy progress (see Interpretability and utility). The images were obtained by using Softbank Robotics software for the NAO robot. Ognjen Rudovic et al. Sci. Robotics 2018;3:eaao6760 Copyright © 2018 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works

PPA-net. PPA-net. The feature layer performs feature fusion using (supervised) auto-encoders designed to reduce noise and handle missing features. The inward and outward arrows depict the encoding (in orange)/decoding (in green) of the observed input features (in blue). At the context layer, behavioral scores of the child’s mental, motor, and verbal ability are used to augment the input features using the expert knowledge [quantified by CARS (34)]. Personalization of the network is achieved using the demographic information (culture and gender), followed by individual network layers for each child. The inference layer performs the child-specific estimation of valence, arousal, and engagement levels. The activations of the hidden nodes (in orange) are learned during the network training (see Results). Ognjen Rudovic et al. Sci. Robotics 2018;3:eaao6760 Copyright © 2018 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works

Data analysis and results. Data analysis and results. (A) The fraction of data present across the different modalities both individually and concurrently. (B) The heat maps of the joint distributions for the valence, arousal, and engagement levels, coded by human experts. Large differences in these patterns are present at the culture and individual levels. (C) Clustering of the children from C1 and C2 using the t-SNE, an unsupervised dimensionality reduction technique, applied to the auto-encoded features (see Effects of model personalization). (D) ICC scores per child: C1 (n = 17) and C2 (n = 18) for valence (V), arousal (A), and engagement (E) estimation. Compared with the GPA-net (in gray), the performance of the PPA-net (in black) improved at all three levels (culture, gender, and individual) in the model hierarchy. Bottom: Sorted improvements in the ICC performance (ΔICC) between PPA-net and GPA-net for each child. Ognjen Rudovic et al. Sci. Robotics 2018;3:eaao6760 Copyright © 2018 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works

Interpretability and utility. Interpretability and utility. (A) Interpretability can be enhanced by looking at the influence of the input features on the output target. Here, the output is estimated engagement level. The relative importance scores (y axis) are shown for input features from each behavioral modality (x axis). These are obtained from the DeepLift (38) tool, which provides negative/positive values when the input feature drives the output toward −1/+1. (B) Estimated engagement, arousal, and valence levels of the child as the therapy progresses. These were obtained by applying the learned PPA-net to the held-out data of the target child. We also plotted the corresponding signals measured from the child’s wrist: the movement intensity derived from accelerometer readings (ACC), blood-volume pulse (BVP), and EDA. (C) Summary of the therapy in terms of the average ± SD levels of affect and engagement within each phase of the therapy: (1) pairing, (2) recognition, and (3) imitation. Ognjen Rudovic et al. Sci. Robotics 2018;3:eaao6760 Copyright © 2018 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works

The learning of the PPA-net. The learning of the PPA-net. (A) The supervised AE performs the feature smoothing by dealing with missing values and noise in the input while preserving the discriminative information in the subspace h0—constrained by the CoF0. The learning operators in the PPA-net—(B) learn, (C) nest, and (D) clone—are used for the layer-wise supervised learning, learning of the subsequent vertical layers, and horizontal expansion of the network, respectively. (E) The group-level GPA-net is first learned by sequentially increasing the network depth using learn and nest and then used to initialize the personalized PPA-net weights at the culture, gender, and individual level (using clone). (F) The network personalization is then accomplished via the fine-tuning steps I and II (Materials and Methods). Ognjen Rudovic et al. Sci. Robotics 2018;3:eaao6760 Copyright © 2018 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works