Abstract
This paper presents a novel method of estimating temporal offsets between multi-view unsynchronized videos. When synchronizing multiple cameras scattered in a large area with a wide baseline (e.g., a sports stadium, an event hall, etc.), conventional epipolar-based approaches sometimes fail due to the difficulty of robust point correspondences. For such cases, 2D projections of human joints can be robustly associated with each other even in wide baseline videos and can be utilized as corresponding points. However, the detected 2D poses include detection errors in general that cause estimation failures. To address these problems, we introduce the motion rhythm of 2D human joints as a cue for synchronization. The proposed method detects motion rhythms from videos and estimates temporal offsets with the best harmonized motion rhythms. Moreover, we propose a hybrid synchronization algorithm to get sub-frame precision. We demonstrate our method's performance with indoor and outdoor data.
Original language | English |
---|---|
Pages (from-to) | 100-110 |
Number of pages | 11 |
Journal | ITE Transactions on Media Technology and Applications |
Volume | 8 |
Issue number | 2 |
DOIs | |
Publication status | Published - 2020 |
Externally published | Yes |
Keywords
- 2D human joints
- Motion rhythms
- Video synchronization
- Wide baseline
ASJC Scopus subject areas
- Signal Processing
- Media Technology
- Computer Graphics and Computer-Aided Design