US 12,170,875 B2
Managing playback of multiple streams of audio over multiple speakers
Alan J. Seefeldt, Alameda, CA (US); Joshua B. Lando, Mill Valley, CA (US); Daniel Arteaga, Barcelona (ES); Mark R. P Thomas, Walnut Creek, CA (US); and Glenn N. Dickins, Como (AU)
Assigned to Dolby Laboratories Licensing Corporation, San Francisco, CA (US); and Dolby International AB, Amsterdam (NL)
Appl. No. 17/630,917
Filed by Dolby Laboratories Licensing Corporation, San Francisco, CA (US); and Dolby International AB, Amsterdam (NL)
PCT Filed Jul. 27, 2020, PCT No. PCT/US2020/043696
§ 371(c)(1), (2) Date Jan. 28, 2022,
PCT Pub. No. WO2021/021707, PCT Pub. Date Feb. 4, 2021.
Claims priority of provisional application 62/705,896, filed on Jul. 21, 2020.
Claims priority of provisional application 62/705,410, filed on Jun. 25, 2020.
Claims priority of provisional application 62/705,351, filed on Jun. 23, 2020.
Claims priority of provisional application 62/705,143, filed on Jun. 12, 2020.
Claims priority of provisional application 62/704,754, filed on May 27, 2020.
Claims priority of provisional application 62/992,068, filed on Mar. 19, 2020.
Claims priority of provisional application 62/971,421, filed on Feb. 7, 2020.
Claims priority of provisional application 62/949,998, filed on Dec. 18, 2019.
Claims priority of provisional application 62/880,111, filed on Jul. 30, 2019.
Claims priority of provisional application 62/880,114, filed on Jul. 30, 2019.
Claims priority of provisional application 62/880,115, filed on Jul. 30, 2019.
Claims priority of application No. P201930702 (ES), filed on Jul. 30, 2019; and application No. 19217580 (EP), filed on Dec. 18, 2019.
Prior Publication US 2022/0272454 A1, Aug. 25, 2022
Int. Cl. H04R 3/12 (2006.01); G10L 15/08 (2006.01); G10L 15/22 (2006.01); G10L 25/78 (2013.01); H04R 5/02 (2006.01); H04R 5/04 (2006.01); H04S 7/00 (2006.01)
CPC H04R 3/12 (2013.01) [G10L 15/08 (2013.01); G10L 15/22 (2013.01); G10L 25/78 (2013.01); H04R 5/02 (2013.01); H04R 5/04 (2013.01); H04S 7/30 (2013.01); G10L 2015/088 (2013.01); G10L 2015/223 (2013.01); H04R 2430/01 (2013.01); H04S 2400/11 (2013.01); H04S 2400/13 (2013.01); H04S 2400/15 (2013.01)] 29 Claims
OG exemplary drawing
 
1. An audio processing system, comprising:
an interface system;
a microphone system including one or more microphones;
a control system comprising:
a first rendering module configured for:
receiving, via the interface system, a first audio program stream, the first audio program stream including first audio signals that are scheduled to be reproduced by at least some speakers of an environment, the first audio program stream comprising first spatial data, the first spatial data including at least one of channel data or spatial metadata; and
rendering the first audio signals for reproduction via the speakers of the environment, to produce first rendered audio signals;
a second rendering module configured for:
receiving, via the interface system, a second audio program stream, the second audio program stream including second audio signals that are scheduled to be reproduced by at least one speaker of the environment; and
rendering the second audio signals for reproduction via at least one speaker of the environment, to produce second rendered audio signals;
wherein the first rendering module is further configured for receiving first microphone signals from the microphone system and for modifying a rendering process for the first audio signals based at least in part on the first microphone signals and at least one of the second audio signals, the second rendered audio signals or characteristics thereof, to produce modified first rendered audio signals; and
wherein the second rendering module is further configured for receiving the first microphone signals and for modifying a rendering process for the second audio signals based at least in part the first microphone signals and on at least one of the first audio signals, the first rendered audio signals or characteristics thereof, to produce modified second rendered audio signals; and
a mixing module configured for mixing the modified first rendered audio signals and the modified second rendered audio signals to produce mixed audio signals,
wherein the control system is further configured for:
determining whether the first microphone signals correspond to a human voice;
modifying the rendering process for at least one of the first audio signals or the second audio signals based at least in part on whether the first microphone signals correspond to the human voice, wherein modifying the rendering process for the first audio signals involves reducing the loudness of the first rendered audio signals reproduced by speakers near the first sound source position, as compared to the loudness of the first rendered audio signals reproduced by speakers farther from the first sound source position; and
providing the mixed audio signals to at least some speakers of the environment.