close

Вход

Забыли?

вход по аккаунту

?

JP2017147636

код для вставкиСкачать
Patent Translate
Powered by EPO and Google
Notice
This translation is machine-generated. It cannot be guaranteed that it is intelligible, accurate,
complete, reliable or fit for specific purposes. Critical decisions, such as commercially relevant or
financial decisions, should not be based on machine-translation output.
DESCRIPTION JP2017147636
Abstract: [Problem] To adjust to a sound signal with suppressed listening without giving a sense
of discomfort to surrounding listeners. A sound signal adjustment apparatus according to the
present invention comprises an audibility characteristic information estimating means for
estimating audibility characteristics of an input environmental sound signal, and a frequency
characteristic of the input sound signal based on the audibility characteristic of the
environmental sound. A sound signal characteristic adjusting means for adjusting the sound
quality, and an environmental characteristic adjusting means for giving a gain based on the
auditory characteristic of the environmental sound signal to the whole of the frequency
characteristic component of the sound signal adjusted by the sound signal characteristic
adjusting means. It features. [Selected figure] Figure 1
Sound signal adjusting device, sound signal adjusting program and acoustic device
[0001]
The present invention relates to a sound signal adjustment device, a sound signal adjustment
program, and an audio device, and can be applied to, for example, a sound signal adjustment
device, a sound signal adjustment program, and an audio device that adjust a sound signal in
which hearing is suppressed.
[0002]
Conventionally, as a method of not diffusing the audio output, there is a method of outputting the
masking sound described in Patent Document 1 to suppress the listening to the surrounding
07-05-2019
1
listeners.
According to the technology described in Patent Document 1, a masking sound is output around
an output sound source to be masked to prevent the surrounding listener from hearing the
output sound source.
[0003]
JP-A-6-175666
[0004]
However, in the prior art, a masking sound is generated as a method of suppressing listening to
the surrounding listeners.
In this case, it may cause the surrounding listeners to listen to the unnecessary sound, which may
cause discomfort for the surrounding listeners.
[0005]
Therefore, there is a need for a sound signal adjustment device, a sound signal adjustment
program, and a sound device that adjust the sound signal to a reduced listening level without
giving discomfort to the surrounding listeners.
[0006]
The sound signal adjustment apparatus according to the first aspect of the present invention
comprises: (1) auditory characteristic information estimation means for estimating the auditory
characteristic of the input environmental sound signal; and (2) input based on the auditory
characteristic of the environmental sound Sound signal characteristic adjusting means for
adjusting the frequency characteristic of the sound signal, and (3) a gain based on the auditory
characteristic of the environmental sound signal is given to the entire frequency characteristic
component of the sound signal adjusted by the sound signal characteristic adjusting means And
environmental characteristic adjusting means.
[0007]
07-05-2019
2
A sound signal adjustment program according to a second aspect of the present invention
comprises: a computer; (1) auditory characteristic information estimation means for estimating
auditory characteristic of an input environmental sound signal; and (2) auditory characteristic of
environmental sound. Sound signal characteristic adjusting means for adjusting the frequency
characteristic of the input sound signal, and (3) gain based on the auditory characteristic of the
environmental sound signal, all over the frequency characteristic component of the sound signal
adjusted by the sound signal characteristic adjusting means It is characterized in that it is made
to function as an environmental characteristic adjustment means to be given to it.
[0008]
An audio apparatus according to a third aspect of the present invention is the audio system
according to the first aspect of the present invention, wherein (1) a microphone for acquiring an
environmental sound signal, and (2) an environmental sound signal is acquired from the
microphone and the input sound signal is adjusted. And a speaker for outputting the sound signal
adjusted by the sound signal adjusting means.
[0009]
According to the present invention, listening can be suppressed without giving discomfort to
surrounding listeners.
[0010]
It is a block diagram which shows the structure of the audio | voice spreading | diffusion
suppression apparatus which concerns on embodiment.
It is a block diagram showing the internal configuration of the voice adjustment device
concerning an embodiment.
It is explanatory drawing explaining the method of calculating a loudness correction value from
the equal loudness curve which concerns on embodiment.
It is a flowchart which shows the estimation process of the loudness correction value in the
loudness estimation part which concerns on embodiment.
07-05-2019
3
It is explanatory drawing explaining the frequency characteristic of the signal input-output by the
feature-value adjustment part which concerns on embodiment.
It is an explanatory view explaining the frequency characteristic of the signal inputted and
outputted by the environmental characteristic adjustment part concerning an embodiment.
[0011]
(A) Main Embodiment Embodiments of the sound signal adjustment device, the sound signal
adjustment program, and the sound device according to the present invention will be described
in detail with reference to the drawings.
[0012]
Here, “sound signal” is a concept including an audio signal, an acoustic signal, and the like.
In this embodiment, the case where the sound signal is an audio signal will be described as an
example.
[0013]
Further, in this embodiment, a case where the present invention is applied and applied to an
audio adjusting device of an audio spread suppressing device will be exemplified and described.
[0014]
(A-1) Configuration of Embodiment FIG. 1 is a configuration diagram showing a configuration of
a speech diffusion suppressing apparatus 10 according to the embodiment.
[0015]
In FIG. 1, the voice diffusion device 10 includes a voice adjustment device 100, a speaker device
200, and a microphone device 201.
[0016]
07-05-2019
4
Some noise is generated in a general environment.
The present invention uses the characteristic that the output sound source is difficult to hear
when the output sound source is a signal having a feature similar to the ambient environmental
sound under the environment where such noise is generated.
[0017]
When outputting the input signal SI input as an output signal, the voice diffusion suppression
apparatus 10 acquires an ambient environment signal NI indicating the characteristics of the
ambient sound near the microphone device 201, and approximates the ambient environment
signal NI. Output the adjusted output signal SO.
[0018]
The speaker device 200 emits a sound based on the output signal SO adjusted to the
characteristic approximate to the environmental sound by the sound adjustment device 100.
For example, when the listener 20 is in the vicinity of the speaker device 200, the listener 20 can
listen to the sound output from the speaker device 200.
[0019]
The microphone device 201 acquires (captures) environmental sound around the microphone
device 201, converts the acquired environmental sound into an environmental signal NI
(electrical signal), and outputs it to the voice adjustment device 100.
In this embodiment, the microphone device 201 is disposed in the vicinity of a listener 30 who
does not want to listen to the output sound.
[0020]
07-05-2019
5
The voice adjustment device 100 receives the input signal SI and the ambient environment signal
NI, and outputs an output signal SO that is masked by the ambient sound of the listener 30 to the
speaker device 200 by processing described later.
[0021]
Here, the input signal SI input to the voice adjustment device 100 is a voice that the listener 20
wants to listen to but the listener 30 does not want to listen to.
The input signal SI is processed by the audio adjustment device 100 and generated as an output
signal SO, and then emitted from the speaker device 200.
Although the emitted sound may be transmitted to the listener 30, the processed sound is
processed by the sound adjustment apparatus 100 so that the propagated sound is masked by
the ambient sound of the listener 30, It is made difficult for the listener 30 to listen to the voice.
[0022]
FIG. 2 is a block diagram showing an internal configuration of the voice adjustment device 100
according to the embodiment.
[0023]
In FIG. 2, the voice adjustment device 100 includes a frequency conversion unit 101, a loudness
estimation unit 102 as an auditory characteristic information estimation unit, a feature quantity
adjustment unit 103 as a sound signal characteristic adjustment unit, an environmental
characteristic conversion unit 104, and an environmental characteristic adjustment unit. And an
inverse frequency conversion unit 106.
[0024]
The hardware configuration of the voice adjustment device 100 can be, for example, a device
having a CPU, a ROM, a RAM, an EEPROM, an input / output interface unit, and a dedicated chip
for controlling a sound signal.
07-05-2019
6
For example, the CPU executes the sound signal adjustment program stored in the ROM to
implement the sound signal adjustment process according to the embodiment.
Note that the sound signal adjustment program may be installed in hardware such as the audio
adjustment device 100 so that the system can be configured. Even in that case, the sound signal
adjustment program can be represented as functioning as the block shown in FIG.
[0025]
The frequency conversion unit 101 receives an input signal SI, and outputs an input frequency
signal SFx obtained by frequency conversion of the input signal SI to the feature amount
adjustment unit 103.
[0026]
The loudness estimation unit 102 estimates a loudness correction value as an auditory
characteristic of the environmental sound based on the ambient environment signal NI from the
microphone device 201, and adjusts the feature adjustment information LFx including the
estimated loudness correction value to the feature amount adjustment unit 103. Output to
[0027]
The feature amount adjustment unit 103 adjusts the frequency component of the input
frequency signal SFx based on the feature adjustment information LFx from the loudness
estimation unit 102, and outputs the loudness adjustment signal LSx to the environment
characteristic adjustment unit 105.
[0028]
The environmental characteristic conversion unit 104 receives the ambient environmental signal
NI from the microphone device 201, and outputs an environmental characteristic signal NFx
obtained by frequency-converting the ambient environmental signal NI to the environmental
characteristic adjustment unit 105.
[0029]
The environmental characteristic adjustment unit 105 adjusts the frequency component of the
loudness adjustment signal LSx based on the environmental characteristic signal NFx from the
environmental characteristic conversion unit 104, and outputs the environmental characteristic
07-05-2019
7
adjustment signal NSx to the frequency inverse conversion unit 106.
[0030]
The frequency inverse conversion unit 106 converts the environmental characteristic adjustment
signal NSx from the environmental characteristic adjustment unit 105 into a signal in the time
domain by inverse frequency conversion, and outputs the output signal SO to the speaker 200.
[0031]
(A-2) Operation of Embodiment Next, the operation of the sound signal adjustment processing in
the audio adjustment device 100 according to the embodiment will be described in detail with
reference to the drawings.
[0032]
The input signal SI is converted to the frequency domain by the frequency converter 101, and
the input frequency signal SFx converted to the frequency domain is output to the feature
amount adjuster 103.
In addition, the existing technology can be widely applied to the conversion method to the
frequency domain, and a known FFT method may be used.
[0033]
In addition, the ambient noise of the ambient environment signal NI is acquired by the
microphone device 201 as the ambient environment signal NI.
The ambient environment signal NI is output to the loudness estimation unit 102, and the
loudness estimation unit 102 estimates a loudness correction value.
Loudness is a measure of the auditory strength of a sound.
07-05-2019
8
[0034]
Next, estimation processing of the loudness correction value will be described with reference to
FIGS. 3 and 4.
[0035]
FIG. 3 is an explanatory view illustrating a method of calculating the loudness correction value
from the equal loudness curve according to the embodiment.
FIG. 4 is a flowchart showing estimation processing of the loudness correction value in the
loudness estimation unit 102 according to the embodiment.
[0036]
The loudness correction value is a frequency characteristic that sounds the same size as the
ambient sound around the microphone device 201, and is a relative value with the reference
frequency set to 0 dB.
[0037]
In FIG. 4, the loudness estimation unit 102 estimates the sound pressure level Pn of the ambient
environment signal NI input by the microphone device 201 (step S1).
Here, estimation of the sound pressure level Pn of the ambient environment signal NI is designed
in advance based on the characteristics of the microphone device 201.
[0038]
Next, the loudness estimation unit 102 selects an equal loudness curve in which the sound
pressure level Pn of the ambient environment signal NI matches the sound pressure level of the
equal loudness curve at the reference frequency Ff (step S2).
07-05-2019
9
The loudness estimation unit 103 calculates the difference between the sound pressure level Pn
at each frequency component and the sound pressure level at the reference frequency Ff in the
selected equal loudness curve (step S3).
The loudness estimation unit 103 outputs the difference value between the sound pressure level
Pn at each frequency component and the sound pressure level at the reference frequency Ff to
the feature amount adjustment unit 103 (step S4).
[0039]
Here, in this embodiment, for example, the case where the reference frequency Ff is 1 kHz will be
described.
However, the reference frequency Ff is not limited to 1 kHz, and may be another frequency or a
dynamic frequency value.
For example, the ambient environment signal NI may be calculated at regular intervals using a
known autocorrelation function, and the frequency calculated from the maximum delay amount
of the autocorrelation function may be used as the reference frequency Ff.
[0040]
The calculation method of the difference value of the sound pressure level for every said
frequency component is illustrated using FIG.
[0041]
Here, the equal loudness curve indicates a change in sound pressure level obtained by physically
measuring that a human senses listening at the same loudness level based on the change in
frequency for each level of loudness. It is a curve.
[0042]
The equal loudness curve is replaced with a table listing a plurality of frequency characteristics
07-05-2019
10
as shown in FIG. 3 (A).
FIG. 3A shows sound pressure levels of frequency components of equal loudness curves (a1, a2,
a3,... An,...) As frequency components (F1, F2,..., Ff,. Are listed.
[0043]
For example, the loudness estimation unit 102 sets the sound pressure level of each equal
loudness curve at the reference frequency Ff as exemplified in FIG. 3A (for example, a1 (f1), a1
(f2), in the case of equal loudness curve a1). An equal loudness curve that matches the sound
pressure level Pn of the ambient environment signal NI is selected from ..., a1 (ff, ... etc.) (see FIG.
3B).
In FIG. 3B, it is assumed that the equal loudness curve an is selected.
[0044]
The loudness estimation unit 102 determines the sound pressure level (an (f1), an (f2),..., An
(ff),...) At each frequency component of the equal loudness curve an and the sound pressure level
at the reference frequency Ff. The difference with an (Ff) is calculated, and the difference value
Dn of the sound pressure level is as shown in FIG. 3 (C). Therefore, the sound pressure level
difference at the reference frequency Ff is 0 dB.
[0045]
The loudness correction value is used as the difference value Dn of the sound pressure level
calculated and obtained as described above. The difference value Dn corresponds to the
frequency characteristic illustrated in FIG. 5B described later.
[0046]
07-05-2019
11
Here, in this embodiment, the equal loudness curve illustrated in FIG. 3A exemplifies the case of
using the equal loudness curve described in International Standardized Standard ISO 226: 2003.
However, the equal loudness curve may be changed in consideration of the frequency
characteristics of the microphone device 201 and the speaker device 200. When the sound
pressure level of the equal loudness curve is larger than the sound pressure level Pn of the
acquired ambient environment signal NI, the value of the sound pressure level difference
becomes a positive number, and is higher than the sound pressure level Pn of the acquired
ambient environment signal NI. When the sound pressure level of the equal loudness curve is
small, the value of the sound pressure level difference is a negative number. Thus, the sound
output from the speaker device 200 can be masked by the microphone device 201 by setting the
sound to be heard (loudness) equivalent to the ambient environmental sound. The loudness
estimation unit 102 outputs the loudness correction value estimated as described above to the
feature amount adjustment unit 103 as feature adjustment information LFx.
[0047]
Next, the adjustment method of the frequency component in the feature amount adjustment unit
103 will be described using FIG.
[0048]
FIG. 5 is an explanatory diagram for explaining frequency characteristics of signals input and
output by the feature amount adjustment unit 103 according to the embodiment.
[0049]
The feature amount adjustment unit 103 receives the input frequency signal SFx and the feature
adjustment information LFx.
In the feature amount adjustment unit 103, the frequency component of the input frequency
signal SFx is SFn = {SF1, SF2,...}, The frequency component at the reference frequency Ff is the
reference gain SF (Ff), and the frequency component LFn of the feature adjustment information
LFx = {LF1 , LF2, ...}, and adjust as follows for each frequency.
[0050]
Feature amount adjustment section 103 has a value (SF (Ff) + LFn) obtained by adding the value
of reference gain SF (Ff) at reference frequency Ff to the value of frequency component LFn of
07-05-2019
12
feature adjustment information LFx, and the frequency component of input frequency signal SFx
The value of the frequency component SFn of the input frequency signal SFx is adjusted
according to the comparison result with the value of SFn.
[0051]
When SFn ≦ (SF (Ff) + LFn) is satisfied, the feature amount adjustment unit 103 does not adjust
the value of the frequency component SFn of the input frequency signal SFx.
That is, the feature amount adjustment unit 103 outputs the value of the frequency component
SFn of the input frequency signal SFx as the value of the frequency component LSn of the
loudness adjustment signal LSx (that is, LSn = SFn).
[0052]
When SFn> (SF (Ff) + LFn) is satisfied, the feature amount adjustment unit 103 adjusts the value
of the frequency component SFn of the input frequency signal SFx.
That is, feature amount adjustment section 103 adds a value (SF (Ff) + LFn) obtained by adding
the value of reference gain SF (Ff) at reference frequency Ff to the value of frequency component
LFn of feature adjustment information LFx. It adjusts and outputs so that it may become the
value (namely, LSn = (SF (Ff) + LFn)) of the frequency component LSn.
[0053]
FIG. 5A is an explanatory view showing a frequency characteristic of the input frequency signal
SFx, and FIG. 5B is an explanatory view showing a frequency characteristic of the feature
adjustment information LFx.
[0054]
In FIG. 5C, the solid line indicates the characteristic of the frequency component of the input
frequency signal SFx of FIG. 5A, and the dotted line indicates the value of the frequency
component of the feature adjustment information LFx, the reference gain SF at the reference
frequency Ff. The characteristic (ie, SF (Ff) + LFn) when the value of (Ff) is added is shown.
07-05-2019
13
[0055]
As shown in FIG. 5C, the feature amount adjustment unit 103 uses the characteristic (solid line)
of the frequency component of the input frequency signal SFx and the value of the frequency
component of the feature adjustment information LFx as the reference gain SF at the reference
frequency Ff. By comparison with the characteristic (dotted line) when the value of Ff) is added,
the frequency characteristic of the loudness adjustment signal LSx as shown in FIG.
[0056]
The frequency components in FIG. 5A have the characteristics of the solid line in FIG. 5D due to
the adjustment.
However, the suppression amount may be changed in accordance with the frequency
characteristic of the sound output from the speaker device 200 as the output signal SO.
[0057]
The environmental characteristic conversion unit 104 acquires the frequency component of the
reference frequency Ff of the ambient environmental signal NI and outputs it as the
environmental characteristic signal NFx.
In this embodiment, after frequency conversion of the ambient environment signal NI by a
known FFT method, the component NS (Ff) of the reference frequency Ff is used as the
environment characteristic signal NFx.
That is, the frequency component at the reference frequency Ff in FIG. 6B becomes the
environmental characteristic signal NFx. However, the method of acquiring the environmental
characteristic signal NFx is not limited to the above method. For example, a coefficient taking into
consideration the frequency characteristic of the sound output from the speaker device 200 may
be added to the frequency component NS (Ff) acquired by the above method. That is, when the
ambient environment signal NI is output from the speaker device 200, frequency characteristic
conversion coefficients may be added so as to have the same frequency characteristics as the
07-05-2019
14
sound around the microphone device 201, and the result may be used as the environment
characteristic signal NFx. Further, the reference frequency Ff is the same as the reference
frequency Ff used in the loudness estimation unit 102, and is 1 kHz in this embodiment.
[0058]
FIG. 6 is an explanatory diagram for explaining frequency characteristics of signals input and
output by the environment characteristic adjusting unit 105 according to the embodiment.
[0059]
The loudness adjustment signal LSx and the environmental characteristic signal NFx are input to
the environmental characteristic adjustment unit 105.
[0060]
In the environmental characteristic adjustment unit 105, the loudness adjustment signal LSx is
set such that the value of the frequency component of the loudness adjustment signal LSx and
the value of the frequency component of the input environmental characteristic signal NFx at the
reference frequency Ff have similar magnitudes. The entire signal is amplified and output to the
frequency inverse converter 106 as the environmental characteristic adjustment signal NSx.
[0061]
FIG. 6A is an explanatory view showing the frequency characteristic of the loudness adjustment
signal LSx, and the solid line shows the frequency characteristic of the loudness adjustment
signal LSx.
The loudness adjustment signal LSx shown in FIG. 6 (A) is the same as that shown in FIG. 5 (D)
described above.
[0062]
FIG. 6B shows the frequency characteristic of the environmental characteristic signal NFx.
[0063]
As shown in FIG. 6C, the environmental characteristic adjusting unit 105 has the value of the
07-05-2019
15
frequency component LS (Ff) of the loudness adjustment signal LSx at the reference frequency Ff
and the frequency component NF (Ff) of the environmental characteristic signal NFx at the
reference frequency Ff. The difference value with the value of) is set as the amplification amount
A.
[0064]
That is, the environmental characteristic adjustment unit 105 calculates the value of
amplification amount A = NF (Ff) −LS (Ff) and amplifies it to the entire loudness adjustment
signal LSx (that is, the value of each frequency component of the loudness adjustment signal
LSx). Add (amplify) the amount A.
[0065]
The frequency characteristic of the loudness adjustment signal LSx indicated by the solid line in
FIG. 6A is adjusted to the frequency characteristic of the environment characteristic adjustment
signal NSx indicated by the solid line in FIG. Be done.
[0066]
However, the method of determining the amplification amount A may be changed in accordance
with the characteristics of the output signal SO output from the speaker device 200.
In addition, even in the determination that the component of the reference frequency Ff of the
loudness adjustment signal LSx matches the input environmental characteristic signal NFx, it is
not limited to this. For example, it is determined that the case where it is within LS (Ff) ± 3 dB is
matched You may
Furthermore, the frequency component LS (Ff) at the reference frequency Ff of the loudness
adjustment signal LSx may not coincide with the environmental characteristic signal NFx.
[0067]
Furthermore, in the environmental characteristic adjustment unit 105 of this embodiment, the
case where the amplification amount A is added to the entire loudness adjustment signal LSx is
07-05-2019
16
illustrated.
However, for example, only the frequency component higher than the reference frequency Ff
may be added with the amplification amount A, and as a result, it may be adjusted so that the
characteristic that the output signal SO is output from the speaker device 200 becomes
appropriate.
[0068]
The environmental characteristic adjusting unit 105 sets the signal subjected to the adjustment
as the environmental characteristic adjusting signal NSx and outputs the signal to the frequency
inverse converter 106.
In the frequency inverse converter 106, the environmental characteristic adjustment signal NSx
is converted from the frequency domain to the time domain, and is output to the speaker device
200 as the output signal SO.
[0069]
In order to suppress the diffusion of the output sound, without using the masking sound, by
making the component close to the surrounding environmental sound to suppress the output
sound, it becomes difficult to audibly listen to the sound and the surroundings to suppress the
output sound It is possible to suppress the spread of the output sound without causing another
sound to be output.
In this embodiment, the feature amount adjustment unit 103 and the environment characteristic
adjustment unit 105 are applied to the input frequency signal SFx in this order, but the effect is
not affected even if the application order is reversed.
[0070]
(A-3) Effects of the Embodiment As described above, according to this embodiment, in order to
suppress the diffusion of the output sound, no masking sound is used, and a component close to
the ambient sound that suppresses the output sound is used. This makes it difficult to audibly
07-05-2019
17
listen to the sound and to suppress the diffusion of the output sound without causing another
sound to be output around the output sound to be suppressed.
[0071]
(B) Other Embodiments In the above-described embodiment, various modified embodiments are
mentioned, but the present invention can be applied to the following embodiments.
[0072]
In the above-mentioned embodiment, the case where the present invention is an audio
adjustment device was illustrated.
However, the present invention can be embodied as, for example, a system, a method, a program,
a storage medium, and the like.
Specifically, the present invention may be applied to a system configured of a plurality of devices,
or may be applied to an apparatus composed of one device and a program installed in the
apparatus.
[0073]
The above-described embodiment exemplifies the case where there is one microphone device.
However, a plurality of microphone devices may be provided, and environmental sound
characteristics may be calculated in consideration of sound pressure levels of environmental
sound signals from the plurality of microphone devices.
[0074]
In the above-described embodiment, the input signal input to the speech diffusion suppressing
apparatus may be, for example, a recorded sound signal or a sound signal received through a
07-05-2019
18
connected communication line. In addition, it may be a sound signal acquired by a separately
provided microphone device.
[0075]
DESCRIPTION OF SYMBOLS 10 Speech diffusion suppression apparatus 200 Speaker apparatus
201 Microphone apparatus 100 Audio adjustment apparatus 101 Frequency conversion part
102 Loudness estimation part 103 Feature quantity adjustment part 104 Environment
characteristic conversion part 105 ... environment characteristic adjustment unit, 106 ...
frequency inverse conversion unit.
07-05-2019
19
Документ
Категория
Без категории
Просмотров
0
Размер файла
27 Кб
Теги
jp2017147636
1/--страниц
Пожаловаться на содержимое документа