CPC H04N 21/478 (2013.01) [H04N 21/435 (2013.01); H04N 21/44008 (2013.01); H04N 21/442 (2013.01); H04N 21/4758 (2013.01)] | 18 Claims |
10. A system for scheduling a communication session, comprising:
a media device;
a user input device; and
control circuitry configured to:
receive, from a source, at the media device, a media asset comprising media data, metadata, and a video of a performance, wherein:
the media data includes at least one frame comprising action information that includes destination information and a communication format;
the metadata includes communication data corresponding to the action information; and
the communication data comprises the destination information, the communication format, and an identification of a trigger to engage in a communication relating to the action information;
determine that the performance has ended, comprising:
identifying a face of a performer in a frame of the media asset;
performing facial recognition on the frame containing the face of the performer to generate a first set of facial characteristics;
periodically detecting whether a subsequent frame of the media asset comprises a face;
performing facial recognition on the face to generate a second set of facial characteristics;
comparing the first set of facial characteristics with the second set of facial characteristics; and
based at least in part on determining that the second set of facial characteristics does not match the first set of facial characteristics, determining that the performance has ended;
based at least in part on the determining that the performance has ended, generate, at the media device, a signal;
generate for display the at least one frame comprising the action information;
receive, from the user input device, a communication command for acting on the action information while the at least one frame is being generated for display; and
based at least in part on receiving the communication command:
determine that the at least one frame comprising action information is currently being generated for display;
extract from the metadata the communication data;
generate a command to engage in a communication session with a server based on the destination information using the communication format;
transmit the extracted communication data and the command to a communication module;
store the extracted communication data and the command in a storage device associated with the communication module;
monitor the signal for the trigger to engage in the communication relating to the action information; and
based at least in part on detecting the trigger after the at least one frame is generated for display, instruct the communication module to execute the command to act on the action information.
|