CPC H04N 13/398 (2018.05) [H04N 21/43072 (2020.08); G01P 15/00 (2013.01)] | 17 Claims |
1. A system, comprising:
circuitry configured to:
control a plurality of image-capture devices to acquire a plurality of video clips of at least one object, wherein
each video clip of the plurality of video clips is acquired by a corresponding image-capture device of the plurality of image-capture devices in a moving state,
each image-capture device of the plurality of image-capture devices is at relative fixed distance from remaining image-capture devices of the plurality of image-capture devices in the moving state,
a first video clip of the plurality of video clips is acquired by a first image-capture device of the plurality of image-capture devices, and
a second video clip of the plurality of video clips is acquired by a second image-capture device of the plurality of image-capture devices;
acquire, from the plurality of image-capture devices, a set of sensor data which is associated with a movement of the plurality of image-capture devices, wherein
the set of sensor data includes first sensor data associated with the first video clip and second sensor data associated with the second video clip;
determine, by cross-correlation, a relative time offset between the first sensor data and the second sensor data, wherein
the relative time offset corresponds to a number of frames by which first positional identifiers of matching frames of the first video clip of the plurality of video clips is offset from second positional identifiers of the matching frames of the second video clip of the plurality of video clips;
determine the matching frames in the first video clip and the second video clip based on the relative time offset;
synchronize the first video clip and the second video clip based on the matching frames, wherein the synchronization includes a shift in the first positional identifiers of the matching frames in the first video clip with respect to the second positional identifiers of the matching frames in the second video clip; and
perform a three-dimensional reconstruction of the at least one object based on the synchronization the first video clip and the second video clip.
|