US 12,170,092 B2
Signal processing device, method, and program
Takao Fukui, Tokyo (JP)
Assigned to Sony Group Corporation, Tokyo (JP)
Appl. No. 17/434,696
Filed by Sony Group Corporation, Tokyo (JP)
PCT Filed Feb. 20, 2020, PCT No. PCT/JP2020/006789
§ 371(c)(1), (2) Date Apr. 14, 2022,
PCT Pub. No. WO2020/179472, PCT Pub. Date Sep. 10, 2020.
Claims priority of application No. 2019-039217 (JP), filed on Mar. 5, 2019.
Prior Publication US 2022/0262376 A1, Aug. 18, 2022
Int. Cl. G10L 19/06 (2013.01); G10L 19/02 (2013.01); G10L 21/02 (2013.01)
CPC G10L 19/06 (2013.01) [G10L 19/02 (2013.01); G10L 21/02 (2013.01)] 10 Claims
OG exemplary drawing
 
1. A signal processing device for generating an output sound signal from an original sound signal, the output sound signal having a higher quality than the original sound signal, comprising:
a sound quality improvement processing unit configured to receive an input compressed sound source signal, to compute an excitation signal from the input compressed sound source signal, and to output the excitation signal;
a switching unit configured to receive the input compressed sound source signal and to control a switch to connect to either the input compressed sound source signal or the excitation signal and to output either the input compressed sound source signal or the excitation signal;
a fast fourier transform processing unit electrically connected to the switch and configured to compute a fast fourier transform from either the input compressed sound source signal or the excitation signal depending on a configuration of the switch;
a calculation unit electrically connected to the fast fourier transform processing unit and configured to calculate a parameter for generating a difference signal corresponding to an input compressed sound source signal on a basis of a prediction coefficient and the fast fourier transform from the input compressed sound source signal, the prediction coefficient being obtained by learning using, as training data, a difference signal between an original sound signal and a learning compressed sound source signal obtained by compressing and coding the original sound signal;
a difference signal generation unit electrically connected to the fast fourier transform processing unit and the calculation unit, and configured to generate the difference signal on a basis of the parameter and the fast fourier transform;
an inverse fast fourier transform unit electrically connected to the difference signal generation unit and configured to compute an inverse fast fourier transform from the difference signal; and
a synthesis unit electrically connected to the inverse fast fourier transform unit and configured to receive the input compressed source signal and to synthesize the inverse fast fourier transform of the generated difference signal and the input compressed sound source signal to generate the output sound signal with the higher quality than the original sound signal.