CPC H04N 21/8549 (2013.01) [G06V 10/764 (2022.01); G06V 10/82 (2022.01); G10L 15/26 (2013.01)] | 13 Claims |
1. A method for controlling an electronic apparatus, the method comprising:
obtaining a video including content that performs a task;
identifying, within a first portion of the video, an object and motion information corresponding to the object;
obtaining first text that describes the first portion of the video based on information corresponding to the object and the motion information;
obtaining second text based on voice information obtained from the first portion of the video; and
providing information for performing the task based on the first text and the second text,
wherein the obtaining of the second text comprises;
converting the voice information obtained from the first portion of the video into text; and
selecting text related to the first text among converted texts as the second text for describing the first portion of the video based on a degree of similarity between the first text and the converted text.
|