This paper is available on arxiv under CC 4.0 license.
Authors:
(1) Zhihang Ren, University of California, Berkeley and these authors contributed equally to this work (Email: peter.zhren@berkeley.edu);
(2) Jefferson Ortega, University of California, Berkeley and these authors contributed equally to this work (Email: jefferson_ortega@berkeley.edu);
(3) Yifan Wang, University of California, Berkeley and these authors contributed equally to this work (Email: wyf020803@berkeley.edu);
(4) Zhimin Chen, University of California, Berkeley (Email: zhimin@berkeley.edu);
(5) Yunhui Guo, University of Texas at Dallas (Email: yunhui.guo@utdallas.edu);
(6) Stella X. Yu, University of California, Berkeley and University of Michigan, Ann Arbor (Email: stellayu@umich.edu);
(7) David Whitney, University of California, Berkeley (Email: dwhitney@berkeley.edu). Table of Links Abstract and Intro
Related Wok
VEATIC Dataset
Experiments
Discussion
Conclusion
More About Stimuli
Annotation Details
Outlier Processing
Subject Agreement Across Videos
Familiarity and Enjoyment Ratings and References 10. Subject Agreement Across Videos A benefit of the VEATIC dataset is that it has multiple annotators for each video with the minimum number of annotators for any given video being 25 and the maximum being 73. Emotion perception is subjective and observers judgments can vary across multiple people. Many of the previously published emotion datasets have a very low number of annotators, often having only single digit (n < 10) number of annotators. Having so few annotators is problematic because of the increased variance across observers. To show this, we calculated how the average rating for each video in our dataset varied if we randomly sampled, with replacement, five versus all annotators. We repeated this process 1000 times for each video and calculated the standard deviation of the recalculated average rating. Figure 12a shows how the standard deviation of the consensus rating across videos varies if we use either five or all annotators for each video. This analysis shows that having more annotators leads to much smaller standard deviations in the consensus rating which can lead to more accurate representations of the ground truth emotion in the videos. Additionally, We investigated how observers’ responses varied across videos by calculating the standard deviation across observers for each video. Figure 12b shows the standard deviations across videos. We find that the standard deviations for both valence and arousal dimensions were small with valence having an average standard deviation of µ = 0.248 and a median of 0.222 and arousal having an average standard deviation of µ = 0.248 and a median of 0.244, which are comparable with the valence and arousal rating variance from EMOTIC [32]. This paper is available on arxiv under CC 4.0 license. This paper is available on arxiv under CC 4.0 license. Authors: (1) Zhihang Ren, University of California, Berkeley and these authors contributed equally to this work (Email: peter.zhren@berkeley.edu); (2) Jefferson Ortega, University of California, Berkeley and these authors contributed equally to this work (Email: jefferson_ortega@berkeley.edu); (3) Yifan Wang, University of California, Berkeley and these authors contributed equally to this work (Email: wyf020803@berkeley.edu); (4) Zhimin Chen, University of California, Berkeley (Email: zhimin@berkeley.edu); (5) Yunhui Guo, University of Texas at Dallas (Email: yunhui.guo@utdallas.edu); (6) Stella X. Yu, University of California, Berkeley and University of Michigan, Ann Arbor (Email: stellayu@umich.edu); (7) David Whitney, University of California, Berkeley (Email: dwhitney@berkeley.edu). This paper is available on arxiv under CC 4.0 license. Authors: Authors: (1) Zhihang Ren, University of California, Berkeley and these authors contributed equally to this work (Email: peter.zhren@berkeley.edu); (2) Jefferson Ortega, University of California, Berkeley and these authors contributed equally to this work (Email: jefferson_ortega@berkeley.edu); (3) Yifan Wang, University of California, Berkeley and these authors contributed equally to this work (Email: wyf020803@berkeley.edu); (4) Zhimin Chen, University of California, Berkeley (Email: zhimin@berkeley.edu); (5) Yunhui Guo, University of Texas at Dallas (Email: yunhui.guo@utdallas.edu); (6) Stella X. Yu, University of California, Berkeley and University of Michigan, Ann Arbor (Email: stellayu@umich.edu); (7) David Whitney, University of California, Berkeley (Email: dwhitney@berkeley.edu). Table of Links Abstract and Intro Related Wok VEATIC Dataset Experiments Discussion Conclusion More About Stimuli Annotation Details Outlier Processing Subject Agreement Across Videos Familiarity and Enjoyment Ratings and References Abstract and Intro Abstract and Intro Related Wok Related Wok VEATIC Dataset VEATIC Dataset Experiments Experiments Discussion Discussion Conclusion Conclusion More About Stimuli More About Stimuli Annotation Details Annotation Details Outlier Processing Outlier Processing Subject Agreement Across Videos Subject Agreement Across Videos Familiarity and Enjoyment Ratings and References Familiarity and Enjoyment Ratings and References 10. Subject Agreement Across Videos A benefit of the VEATIC dataset is that it has multiple annotators for each video with the minimum number of annotators for any given video being 25 and the maximum being 73. Emotion perception is subjective and observers judgments can vary across multiple people. Many of the previously published emotion datasets have a very low number of annotators, often having only single digit (n < 10) number of annotators. Having so few annotators is problematic because of the increased variance across observers. To show this, we calculated how the average rating for each video in our dataset varied if we randomly sampled, with replacement, five versus all annotators. We repeated this process 1000 times for each video and calculated the standard deviation of the recalculated average rating. Figure 12a shows how the standard deviation of the consensus rating across videos varies if we use either five or all annotators for each video. This analysis shows that having more annotators leads to much smaller standard deviations in the consensus rating which can lead to more accurate representations of the ground truth emotion in the videos. Additionally, We investigated how observers’ responses varied across videos by calculating the standard deviation across observers for each video. Figure 12b shows the standard deviations across videos. We find that the standard deviations for both valence and arousal dimensions were small with valence having an average standard deviation of µ = 0.248 and a median of 0.222 and arousal having an average standard deviation of µ = 0.248 and a median of 0.244, which are comparable with the valence and arousal rating variance from EMOTIC [32]. This paper is available on arxiv under CC 4.0 license. This paper is available on arxiv under CC 4.0 license. available on arxiv

Part of HackerNoon's growing list of open-source research papers, promoting free access to academic material.

VEATIC: Video-based Emotion and Affect Tracking in Context Dataset: Annotation Details

VEATIC: Video-based Emotion and Affect Tracking in Context Dataset: Subject Agreement Across Videos

About Author

Comments

TOPICS

THIS ARTICLE WAS FEATURED IN

Related Stories

A Reference List to Learn More About Image Editing, Video Editing, and Diffusion Models

VEATIC: Video-based Emotion and Affect Tracking in Context Dataset: Abstract and Intro

VEATIC: Video-based Emotion and Affect Tracking in Context Dataset: Discussion

VEATIC: Video-based Emotion and Affect Tracking in Context Dataset: Experiments

VEATIC: Video-based Emotion and Affect Tracking in Context Dataset: VEATIC Dataset

VEATIC: Video-based Emotion and Affect Tracking in Context Dataset: More About Stimuli

A Reference List to Learn More About Image Editing, Video Editing, and Diffusion Models

VEATIC: Video-based Emotion and Affect Tracking in Context Dataset: Abstract and Intro

VEATIC: Video-based Emotion and Affect Tracking in Context Dataset: Discussion

VEATIC: Video-based Emotion and Affect Tracking in Context Dataset: Experiments

VEATIC: Video-based Emotion and Affect Tracking in Context Dataset: VEATIC Dataset

VEATIC: Video-based Emotion and Affect Tracking in Context Dataset: More About Stimuli

Light-Mode

Classic

Newspaper

Minty

Dark-Mode

Neon Noir

Minty

HN StartUps