US 12,170,094 B2
Media segment prediction for media generation
Stephane Villette, San Diego, CA (US); Sen Li, San Diego, CA (US); Pravin Kumar Ramadas, San Diego, CA (US); and Daniel Jared Sinder, San Diego, CA (US)
Assigned to QUALCOMM Incorporated, San Diego, CA (US)
Filed by QUALCOMM Incorporated, San Diego, CA (US)
Filed on Oct. 18, 2022, as Appl. No. 18/047,572.
Prior Publication US 2024/0127838 A1, Apr. 18, 2024
Int. Cl. G10L 13/06 (2013.01); G10L 17/02 (2013.01); G10L 21/01 (2013.01); G10L 25/54 (2013.01); G10L 15/26 (2006.01)
CPC G10L 21/01 (2013.01) [G10L 17/02 (2013.01); G10L 25/54 (2013.01); G10L 13/06 (2013.01); G10L 15/26 (2013.01)] 29 Claims
OG exemplary drawing
 
1. A device comprising:
one or more processors configured to:
input one or more segments of an input media stream into a feature extractor;
pass an output of the feature extractor into an utterance classifier to produce at least one representation of at least one utterance class of a plurality of utterance classes; and
pass the output of the feature extractor and the at least one representation into a segment matcher to determine a media output segment identifier;
pass the media output segment identifier into one or more memory units, wherein each of the one or more memory units includes a set of weights representing a respective media segment.