US 12,170,098 B2
Sound detection method
Shiliang Zhang, Hangzhou (CN); Siqi Zheng, Hangzhou (CN); and Weilong Huang, Hangzhou (CN)
Assigned to Alibaba Damo (Hangzhou) Technology Co., Ltd., Hangzhou (CN)
Filed by Alibaba Damo (Hangzhou) Technology Co., Ltd., Zhejiang (CN)
Filed on Aug. 26, 2022, as Appl. No. 17/822,731.
Claims priority of application No. 202111029142.7 (CN), filed on Sep. 3, 2021.
Prior Publication US 2023/0074906 A1, Mar. 9, 2023
Int. Cl. G10L 25/87 (2013.01); G10L 15/02 (2006.01); G10L 15/06 (2013.01)
CPC G10L 25/87 (2013.01) [G10L 15/02 (2013.01); G10L 15/063 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A sound detection method, comprising:
obtaining an initial sound signal and a spatial distribution spectrum of the initial sound signal;
segmenting the initial sound signal to obtain a target sound segment;
obtaining a timestamp corresponding to the target sound segment, wherein the target sound segment comprises a speech of at least one object, and the timestamp is used for indicating a start time of the target sound segment and an end time of the target sound segment;
segmenting the spatial distribution spectrum by using the timestamp, to obtain a spatial distribution spectrum segment corresponding to the target sound segment; and
inputting the target sound segment and the spatial distribution spectrum segment into a sound detection model, to obtain a first sound detection result, wherein the first sound detection result is used for describing whether sound of a plurality of objects exists in the initial sound signal.