close

Вход

Забыли?

вход по аккаунту

?

JP2006201286

код для вставкиСкачать
Patent Translate
Powered by EPO and Google
Notice
This translation is machine-generated. It cannot be guaranteed that it is intelligible, accurate,
complete, reliable or fit for specific purposes. Critical decisions, such as commercially relevant or
financial decisions, should not be based on machine-translation output.
DESCRIPTION JP2006201286
The present invention provides an onboard sound pickup apparatus and an in-vehicle sound
pickup information display method capable of grasping the direction of a sound source and the
volume of the sound source more intuitively than in the prior art. SOLUTION: The voice of a
speaker is picked up in a car by two microphones 17A and 17B installed in a car, and image data
representing an input sound pressure level and image data representing directivity by two
microphones 17A and 17B are obtained. The display data is read out from the data memory 14
and dot-expanded display data is generated from the image data and displayed on the display 15.
Therefore, the direction and volume of the speaker's voice as a sound source are made more
intuitive than before. It becomes possible to grasp, for example, the navigation device can
perform voice operation at an appropriate utterance volume. [Selected figure] Figure 1
In-vehicle sound collection device and in-vehicle sound collection information display method
[0001]
The present invention relates to an in-vehicle sound collection device and an in-vehicle sound
collection information display method suitable for use in a navigation device which enables an
operation by voice recognition, a hands free unit for using a mobile phone in a car, and the like.
[0002]
Heretofore, there have been developed a navigation device capable of the above-described
operation by voice recognition and a hands-free unit for using a mobile phone in a car.
08-05-2019
1
As a navigation device that enables an operation by voice recognition, one that can distinguish
voice against noise has been proposed (see, for example, Patent Document 1). In this proposal,
the noise input to the microphone and the input voice are compared to determine whether the
ratio of the voice signal to the noise signal is equal to or greater than a predetermined value, and
the ratio of the voice signal is equal to or greater than the predetermined value. Make a
predetermined display on. The speaker speaks while watching this display, and adjusts the
volume of the voice at the time of speaking.
[0003]
By the way, in patent document 1, although there is no problem in particular when using what
does not have directivity as a microphone, when using what has directivity, it is natural to use an
audio signal within the range and out of the range. Since the ratio will be different, the display
will not be constant. That is, the ratio of the noise signal to the voice signal within the directivity
range and the ratio of the noise signal to the voice signal outside the directivity range will be
different, and naturally, the ratios will not be the same.
[0004]
Even if there is no directivity, it is possible to estimate the direction of the sound source by
preparing at least two microphones, and by estimating the direction of the sound source, it is
constant no matter from which position the sound is emitted Display becomes possible.
[0005]
Here, a system has been proposed in which the direction of the sound source is estimated (see,
for example, Patent Document 2).
In this proposal, a plurality of microphones are arranged at regular intervals in the vertical and
horizontal directions, and the arrival time difference of the collected sound caused by the
arrangement position of each microphone and the eigenvalue and eigenvector for each frequency
of the collected sound signal are calculated. The noise component of the collected signal is
calculated by the equation (4), and the arrival direction of the sound source is estimated based on
the calculated noise component. Further, in this proposal, the estimated sound source direction is
displayed so that the sound source direction can be visually grasped.
08-05-2019
2
[0006]
JP-A-9-44183 JP-A-2002-186084
[0007]
However, although the technique disclosed in Patent Document 1 gives an indication as to how
loud the noise should be produced with respect to noise, it is intuitive to know how loud it is
even if it is above the noise level. I don't know.
The technique disclosed in Patent Document 2 makes it possible to visually grasp the estimated
sound volume of the sound source by changing the color and density of the displayed plot based
on the magnitude, but intuitively It is difficult to do.
[0008]
The present invention has been made in view of such circumstances, and it is an object of the
present invention to provide an in-vehicle sound collection device and an in-vehicle sound
collection information display method capable of grasping the direction of a sound source and its
volume more intuitively than before. To aim.
[0009]
The above object is achieved by the following configurations and methods.
According to the present invention, there is provided an on-vehicle sound collection device
comprising: sound collection means for collecting a speaker's voice in a car; and directionality of
the sound collection means when sound collection of the speaker is performed by the sound
collection means And a display means for simultaneously displaying an image representing the
image and an image representing the input sound pressure level of the speaker voice to the
sound collection means.
[0010]
08-05-2019
3
With this configuration, when the voice of the speaker is picked up, an image showing the
directivity of the sound pickup means and an image showing the input sound pressure level of
the speaker voice to the sound pickup means are simultaneously displayed. The direction of the
sound source and the volume thereof can be grasped more intuitively than before.
[0011]
Further, in the on-vehicle sound collection device of the present invention, the display unit
displays an image representing an input sound pressure level of sound to the sound collection
unit when the sound collection is performed by the sound collection unit. .
[0012]
With this configuration, when the sound such as noise is picked up, an image representing the
input sound pressure level of the sound to the sound pickup means is displayed, so the noise etc.
in the car can be intuitively grasped .
[0013]
Further, in the on-vehicle sound pickup apparatus of the present invention, the sound pickup unit
includes two microphones and spectral subtraction processing unit that obtains narrow
directivity with the two microphones.
[0014]
With this configuration, two microphones and spectral subtraction processing can electrically
produce narrow directivity, and can efficiently pick up the respective utterances of the driver's
seat speaker and the passenger's seat speaker, etc. it can.
[0015]
Further, the on-vehicle sound pickup apparatus of the present invention comprises arithmetic
means for estimating the sound source direction from the sound signal picked up by the sound
pickup means, and the display means is directed according to the direction estimated by the
arithmetic means. Change the display of the image representing
[0016]
According to this configuration, the directivity can be automatically changed, and the
convenience can be improved as compared with the case of changing it manually.
08-05-2019
4
[0017]
Further, in the on-vehicle sound collection device of the present invention, the display means
displays an image representing an input sound pressure level of the speaker's voice for each of
the estimated sound source directions when there are a plurality of speakers.
[0018]
With this configuration, an image representing directivity and an image representing the input
sound pressure level of the speaker's voice can always be displayed in the same positional
relationship, so that the utterance volume can be correctly grasped for each speaker.
[0019]
Further, in the vehicle-mounted sound collection device of the present invention, the display
means automatically starts display at the start of communication using the sound collection
means.
[0020]
With this configuration, when a hands free call using a mobile phone is received, etc., the display
is automatically started. Therefore, the timing of the voice and the display coincide with each
other, and the speaker can correctly grasp the speech volume.
Also, manual operation for starting display can be eliminated.
[0021]
Further, in the on-vehicle sound collection device of the present invention, the display means
automatically starts display when the voice recognition function of the navigation device is
activated.
[0022]
According to this configuration, since the display is automatically started when the speech
recognition function of the navigation device is activated, the timing of the speech and the
display matches, and the speaker can correctly grasp the speech volume.
08-05-2019
5
[0023]
Further, in the in-vehicle collected sound information display method of the present invention, an
input sound pressure level when sound of a speaker in the car is collected by a microphone is
measured, and an image representing the measured input sound pressure level and the
microphone An image showing directivity is visibly displayed on a display.
[0024]
By this method, the speaker can more intuitively grasp the direction of the sound source and the
volume thereof than before.
[0025]
According to the vehicle-mounted sound collection device of the present invention, when sound
collection of a speaker's voice is performed, an image representing the directivity of the sound
collection means and an input sound pressure level of the speaker voice to the sound collection
means are displayed. Since the image is simultaneously displayed, the direction of the sound
source and the volume thereof can be grasped more intuitively than before.
[0026]
Hereinafter, preferred embodiments for carrying out the present invention will be described in
detail with reference to the drawings.
[0027]
FIG. 1 is a block diagram showing a schematic configuration of a vehicle-mounted sound
collecting device according to an embodiment of the present invention.
In this figure, an on-vehicle sound pickup device 10 according to the present embodiment
includes a CPU 11 (arithmetic means) for controlling each part of the device, a program memory
12 storing a program for operating the CPU 11, and a work used in the operation of the CPU 11.
A memory 13, a data memory 14 storing various image data, a display 15 using a color liquid
crystal panel, and a display control unit 16 for controlling display of an image related to sound
collection on the display 15, respectively It is equipped with a spectral subtraction processing
unit 18 which realizes narrow directivity with two non-directional microphones 17A and 17B,
two non-directional microphones 17A and 17B, and an operation unit 19 for performing various
08-05-2019
6
operations in the present apparatus. ing.
The microphones 17A and 17B and the spectral subtraction processing unit 18 constitute a
sound collection means.
Also, the sound collection means may be composed of one directional microphone.
In this embodiment, an example in which two microphones are used to change the directivity of
sound collection by the sound collection means will be described.
In the case of one directional microphone, it can be implemented by mounting the microphone in
a rotating manner.
[0028]
The CPU 11, the program memory 12, the work memory 13, the data memory 14, the display
control unit 16, the spectral subtraction processing unit 18 and the operation unit 19 are
connected by a bus line 20.
[0029]
Although the two microphones 17A and 17B are nondirectional, respectively, the spectral
subtraction processing unit 18 can obtain an electrically narrow directivity.
Further, the direction of the directivity is automatically changed by the CPU 11 performing the
sound source direction estimation process.
For example, when speaking at the driver's seat, the directivity is directed to the driver's seat.
In this case, as shown in FIG. 2, an image G3 representing narrow directivity directed toward the
driver's seat is displayed on the display 15.
08-05-2019
7
In this figure, G1 is an image representing a vehicle, G2 is an image representing a microphone,
G4 is an image representing the input sound pressure level (the loudness of the utterance)
(details will be described later), and G5 is a vehicle seat. It is an image to represent.
The images of the vehicle G1, the microphone G2 and the sheet G5 may be images such as
bitmaps stored in the data memory 14 in advance.
Further, it may be an image of a camera installed indoors on the ceiling of the vehicle.
[0030]
The CPU 11 displays the above-described sound source direction estimation processing
(processing for estimating the sound source direction from the voice output of the microphones
17A and 17B, that is, the direction in which the passenger who utters is located) Processing,
processing of changing directivity to the estimated sound source direction, processing of
displaying an image representing the input sound pressure level on the display 15, and sound
pickup using one of the two microphones 17A and 17B. And processing for obtaining the noise
level and displaying the input sound pressure level.
[0031]
FIG. 3 is a view showing an example of an image representing an input sound pressure level.
In the example shown in this figure, the input sound pressure level is represented by four images
of "small", "medium", "large" and "maximum".
In this case, the area of the image is increased from “small” to “maximum”, and in
particular, in the “maximum” image, for example, to notify that the speaker's voice volume is
level over. In order to understand the severity, we are trying to express "Twigly."
In addition, colors are also added to each image.
08-05-2019
8
For example, "green" is added to "small" and "medium", "orange" is added to "large", and "red" is
added to "maximum".
Specifically, assuming that the passenger (driver) at the driver's seat utters a sound volume
corresponding to "high" with the directivity of the microphones 17A and 17B directed to the
driver's seat, as shown in FIG. As shown in FIG. 2, an image G3 representing directivity and an
image G4 representing a "large" input sound pressure level are simultaneously displayed.
For example, “blue” is added to the image G3 representing the directivity described above.
[0032]
On the other hand, estimation of the sound source direction is performed using two microphones
17A and 17B. FIG. 4 is a functional block diagram of the CPU 11 that performs sound source
direction estimation. In this figure, the two microphones 17A and 17B are arranged at a known
distance ΔL. When audio signals are output from the microphones 17A and 17B, the crosscorrelation calculating unit 110 correlates two time waveforms Twa and Twb in a frame length N
(can be T, and the unit is time). That is, it is calculated to what extent the delayed time waveform
Twb is shifted to coincide with the time waveform Twa.
[0033]
Next, from the correlation waveform Twc calculated by the cross-correlation calculation unit 110,
the Δt estimation unit 111 estimates a temporal shift Δt of the two time waveforms Twa and
Twb. Then, the arrival angle θ calculation unit 112 calculates the arrival angle θ of the sound
from the estimated temporal deviation Δt. In this case, the arrival angle θ is obtained using the
equation Δx = ΔL · cos θ = v · Δt (where 0 ° ≦ θ ≦ 90 °), where v is the speed of sound.
The CPU 11 obtains the arrival angle θ of the sound in this manner. Then, change the directivity
in the direction determined.
[0034]
In the noise level display by the CPU 11, as described above, one of the two microphones 17A
08-05-2019
9
and 17B is collected to obtain the noise level, and the obtained noise level is displayed in an area
other than the directivity display area. In this case, for example, if a color is used, "black" is
displayed when the noise level is low, and "yellow" is displayed when the noise level is high.
Image data for performing this display is also stored in the data memory 14. The data memory
14 stores image data for representing the images G1 to G5 described above.
[0035]
Next, the operation of the on-vehicle sound collection device having the above configuration will
be described. In this case, a case where an on-vehicle sound pickup device is used for a
navigation device will be described. When the passenger performs an operation of activating the
voice recognition function of the navigation device, the CPU 11 starts voice collection triggered
by this operation. Further, image data for representing a vehicle, image data for representing a
microphone, and image data for representing a seat of a vehicle are read out from the data
memory 14 and input to the display control unit 16. The display control unit 16 generates
display data in which dots are expanded from each input image data, and displays the display
data on the display 15. Thereby, an image G1 representing a vehicle, an image G2 representing a
microphone, and an image G5 representing a seat of the vehicle as shown in FIG. 2 are displayed.
[0036]
In sound collection, first, one of the two microphones 17A and 17B is used to measure the noise
level. When the noise level is measured, image data for representing the noise level is read out
from the data memory 14 and input to the display control unit 16. The display control unit 16
generates display data in which dots are expanded from the input image data, and displays the
display data on the display 15. At this time, color is also added. For example, "black" is added
when the noise level is below a predetermined level, and "yellow" is added when the noise level
exceeds the predetermined level.
[0037]
After displaying the noise level, when there is an audio input, the two microphones 17A and 17B
are used to estimate the sound source direction. After estimating the sound source direction, the
input sound pressure level is detected. If the input sound pressure level can be detected, the
08-05-2019
10
image data for representing directivity corresponding to the sound source direction and the
image data for representing the input sound pressure level are read out from the data memory
14 and these are input to the display control unit 16 . The display control unit 16 generates
display data in which dots are expanded from each input image data, and displays the display
data on the display 15.
[0038]
For example, if an utterance is made at the driver's seat and the input sound pressure level at
that time is "high", as shown in FIG. 2, an image G4 representing the input sound pressure level
"high" and the directivity toward the driver's seat The image G3 showing the sex is displayed. The
above process is repeated at a predetermined timing, and in each case, an image representing the
noise level, the input sound pressure level, and the directivity is updated and displayed. For
example, when the driver's seat changes from the driver's seat to the passenger's seat, an image
G3 showing the directivity toward the passenger's seat is displayed at the next timing after the
change, and the input sound pressure level is An image G4 to be displayed is displayed. Thus, the
display of the directivity is switched following the change of the microphone directivity, and the
input sound pressure level is displayed.
[0039]
As described above, according to the on-vehicle sound collection device 10 of the present
embodiment, the two microphones 17A and 17B installed in the vehicle collect the voice of the
speaker in the vehicle, and the image data representing the input sound pressure level The image
data representing the directivity of the two microphones 17A and 17B is read out from the data
memory 14, and dot-expanded display data is generated from these image data and displayed on
the display 15, so that the speaker's utterance is used as a sound source. The direction and the
volume can be grasped more intuitively than in the past, and the navigation device can perform
voice operation at an appropriate vocal volume.
[0040]
In addition, since the noise level is detected using either one of the two microphones 17A and
17B and an image representing the noise level is displayed on the display 15, the noise level can
be intuitively grasped. It becomes possible.
[0041]
In addition, since the spectral subtraction processing unit 18 electrically produces narrow
08-05-2019
11
directivity, it is possible to efficiently collect the respective utterances of the speakers on the
driver's seat side and the front passenger's side.
[0042]
In addition, since the direction of the sound source is estimated and the directivity is changed in
that direction, the convenience can be improved as compared with the case of manual change.
[0043]
Further, since the image representing the directivity and the image representing the input sound
pressure level of the speaker's voice are always displayed in the same positional relationship, it is
possible to correctly grasp the speech volume for each speaker.
[0044]
Although two microphones 17A and 17B are provided in the present embodiment, the number is
not limited and may be two or more.
In addition, it may be one directional microphone installed on a movable mount.
[0045]
Further, although the present embodiment is applied to the navigation device, the present
invention can, of course, be applied to a hands-free unit using a mobile phone.
When applied to a hands-free unit, it is good to start the display automatically when an incoming
call.
[0046]
Further, in the present embodiment, the directivity is displayed for the driver's seat and the
passenger's seat, but the directivity for both the driver's seat and the passenger's seat may be
displayed.
08-05-2019
12
[0047]
The present invention has an effect that the direction of the sound source and the volume thereof
can be grasped more intuitively than in the past, and can be applied to a navigation device, a
hands free unit of a mobile phone, and the like.
[0048]
A block diagram showing a schematic configuration of a vehicle-mounted sound collecting device
according to an embodiment of the present invention A diagram showing an example of a display
of the vehicle-mounted sound collecting device according to the embodiment A figure showing
an example of a picture showing pressure level A figure for explaining sound source direction
presumption processing of a vehicle-mounted sound collection device concerning the abovementioned embodiment
Explanation of sign
[0049]
DESCRIPTION OF SYMBOLS 10 in-vehicle sound collection apparatus 11 CPU (calculation means)
12 program memory 13 work memory 14 data memory 15 display (display means) 16 display
control part 17A, 17B Microphone (sound collection means) 18 spectral subtraction processing
part (sound collection means) 19 operation unit 110 cross correlation calculation unit 111 Δt
estimation unit 112 arrival angle θ calculation unit
08-05-2019
13
Документ
Категория
Без категории
Просмотров
0
Размер файла
23 Кб
Теги
jp2006201286
1/--страниц
Пожаловаться на содержимое документа