close

Вход

Забыли?

вход по аккаунту

?

JP2004147205

код для вставкиСкачать
Patent Translate
Powered by EPO and Google
Notice
This translation is machine-generated. It cannot be guaranteed that it is intelligible, accurate,
complete, reliable or fit for specific purposes. Critical decisions, such as commercially relevant or
financial decisions, should not be based on machine-translation output.
DESCRIPTION JP2004147205
[PROBLEMS] To provide an image / sound recording device capable of stereo recording of sound
and capable of recording a realistic moving image. An image and sound recording apparatus 10
captures an object scene and forms an image signal 103 representing the object scene. Also, the
left and right sounds of the object are collected to form a left sound signal 108 and a right sound
signal 110, respectively. Further, a motion vector is detected from the image signal 103 by signal
processing, and the most likely moving direction in the image is determined from the motion
vector. The left audio signal 108 and the right audio signal 110 are respectively adjusted so that
the left and right volume balance changes according to the moving direction, and these audio
signals are stereo-recorded to enhance the sense of movement of the audio, A certain video
recording is realized. [Selected figure] Figure 1
Image and voice recording device
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an
image and sound recording apparatus capable of stereo sound recording and capable of
recording moving pictures or still moving pictures. 2. Description of the Related Art
Conventionally, in order to reproduce a sound with a sense of reality when playing back moving
pictures, a video and audio recording apparatus, for example, when photographing a moving
sounding body, a signal specific to the sound signal. It is known to apply processing and to
record these signals on a recording medium. For example, since the time required for sound
image localization processing is longer than that for image signal processing, the position of the
sound generator on the reproduction screen may not coincide with the sound image localization
position in the reproduction sound. In order to eliminate this inconsistency, the motion vector of
the sounding body is detected from the image signal obtained by capturing the moving sounding
body, and the position of the sounding body after a certain period of time is predicted based on
10-05-2019
1
this motion vector. As described above, there is known a sound image localization control device
capable of realistically reproducing sound by performing sound image localization processing of
a sound signal (Japanese Patent Application Laid-Open No. 2000-112118). [Patent Document 1]
JP-A-8-181962. The sound image localization control device described in the above-mentioned
Patent Document 1 moves the sound image localization in accordance with the movement of the
sounding body, so a video conference in which the sounding body is specified It can be
advantageously applied to systems and the like. However, this device performs sound image
localization processing of the sound emitted by the sounding body according to the movement of
the sounding body, and since the sound source to collect the sound is specified, the sound is
emitted other than the sounding body in this scene. When there is a sound source, it is not
possible to reproduce sound that represents the movement of the sound source. For example,
even when the sound source other than the sound generator and the sound generator move in
different directions and the amount of movement of the sound source is larger than the amount
of movement of the sound generator, the sound from the sound source is Accordingly, sound
image localization processing is performed, and when the sound generator does not move even if
the sound source moves, sound from the sound source is not subjected to sound image
localization processing. As a result, sound reproduction is performed as a whole object scene,
expressing a sense of movement that gives a sense of discomfort, and the sense of realism is lost.
[0006] By the way, some image and sound recording devices such as video cameras generally
have microphones on the left and right for stereo recording, but these are provided close to each
other because of mechanical limitations of the cameras. The distance between microphones is
short. Therefore, when shooting an object scene using this apparatus, it is difficult to reproduce a
three-dimensional effect even if sound is collected in a sound field corresponding to the object
scene.
In particular, when the sounding body moves, it has been difficult to give a proper sense of
movement to the listening sound. The present invention overcomes the drawbacks of the prior
art and can record audio in stereo so as to increase the sense of reality during reproduction, and
can record moving pictures or still moving pictures. Intended to provide. According to the
present invention, an imaging section for imaging an object field to form an image signal
representing the object field, and for collecting the voice of the object field A plurality of
microphones forming audio signals, the plurality of microphones collecting left audio of the
object to form a first left audio signal; and A video / audio recording apparatus including a right
microphone that collects right-hand audio to form a first right audio signal comprises: detection
means for detecting a motion vector from the image signal; and left and right volume according
to the motion vector Adjusting means for adjusting the first left audio signal and the first right
audio signal to change the balance to form the corresponding second left audio signal and the
second right audio signal, respectively; Adjustment means, this movement Judging from vector,
and adjusting such that the volume of the leading moving direction of the image increases.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Referring now to the attached
10-05-2019
2
drawings, an embodiment in which a video and audio recording apparatus according to the
present invention is applied to a digital camera will be described in detail. As shown in FIG. 1, the
image / sound recording apparatus 10 according to the embodiment detects a motion vector of
an image captured by the imaging unit 14 through the optical lens system 12 in the signal
processing unit 16, The sound information calculation unit 18 determines the volume balance
according to the motion vector, and the sound signal preprocessing unit 24 preprocesses the
sound signal collected by the left microphone 20 and the right microphone 22 according to the
volume balance. The signal processing unit 16 processes and outputs an image signal and a
preprocessed audio signal in the information processing medium 26. Note that parts not directly
related to the understanding of the present invention are not shown and redundant explanations
are avoided. In the image and sound recording apparatus 10, the optical lens system 12 is an
imaging optical system that forms an object scene image on an imaging surface of the imaging
unit 14. The imaging unit 14 has a function of photoelectrically converting the field image into
the electric signal 102, and may be, for example, a solid-state imaging device such as a charge
coupled device. The electric signal 102 is an analog image signal 102. In this embodiment, the
analog-to-digital converter 15 performs analog-to-digital (A / D) conversion processing and
outputs the signal to the signal processing unit 16.
In the following description, each signal is identified by the reference numeral of the connection
line where it appears. The signal processing unit 16 has a function of processing an audio signal,
an image signal, and the like as digital signals. Particularly in the present embodiment, the
motion vector detection unit 17 and the memory buffer 27 are included. The motion vector
detection unit 17 has a function of detecting a motion vector of a pattern to be represented from
the digital image signal 103, and outputs the detected motion vector as an information signal
104 to the voice information calculation unit 18. The memory buffer 27 has a function of
temporarily storing a digital image signal immediately before or several frames before as an
internal memory, and the stored digital image signal is used in the motion vector detection unit
17. On the other hand, digital signal processing is performed on the audio signal supplied from
the analog-to-digital converter 25, which is the digital left audio signal 112 and the digital right
audio signal 114 in this embodiment, and the digital image signal 103 I do. For example, by this
signal processing, a recording signal 116 that can be recorded on the information recording
medium 26 is generated and recorded on the information recording medium 26. The audio
information calculation unit 18 obtains an adjustment amount for changing the balance of the
left and right volume according to the motion vector represented by the supplied information
signal 104. For example, this motion vector determines the position of each part of the image,
and the direction and amount of movement, and when the part moves in one direction, if it is the
most dominant direction in the entire image, the volume of that direction Find the amount of
adjustment to emphasize. In this embodiment, in order to correspond to the left and right
volume, this adjustment amount may be, for example, to change the left gain and the right gain,
and is supplied to the audio signal preprocessing unit 24 as the information signal 106. Further,
10-05-2019
3
a microcomputer or the like may be used as the voice information calculation unit 18. The left
microphone 20 and the right microphone 22 have a function of collecting audio, respectively
generate an analog left audio signal 108 and an analog left audio signal 110, and output the
analog left audio signal 108 and the analog left audio signal 110 to the audio signal
preprocessing unit 24. In the present invention, three or more microphones may be provided in
order to collect the sound of the object from various directions to make it more realistic. In the
present embodiment, in particular, the left microphone 20 and the right microphone 22 may be
disposed on the left side and the right side of the imaging unit 14 in order to collect sounds from
the left and right of the object field and perform stereo recording.
The audio signal pre-processing unit 24 has a function of processing an analog audio signal to
generate an audio signal emphasizing a sense of movement, and in the present embodiment, the
left audio signal amplifier 21 and the right audio signal amplifier 23 Contains. These amplifiers
21 and 23 may be, for example, analog amplifiers such as microphone amplifiers, and amplify or
attenuate the supplied analog left audio signal 108 and analog right audio signal 110 with the
left gain and right gain to obtain the left audio signal 109. And generate the right audio signal
111. These gains are variable and are adjusted by an information signal 106 that represents the
amount of adjustment of speech. In the present embodiment, the audio signals 109 and 111 are
supplied to the analog-to-digital converter 25, subjected to A / D conversion processing, and
output to the signal processing unit 16. The information recording medium 26 has a function of
recording an image signal and an audio signal, and a package containing a rotary recording
medium such as a memory card or a magneto-optical disk on which a semiconductor memory is
mounted is used. May be. Next, the operation of the image and sound recording apparatus 10 in
this embodiment will be described in the case of shooting a moving image. In the image and
sound recording apparatus 10, an object scene image is formed on the imaging unit 14 by the
optical lens system 12, and the object scene image is photoelectrically converted by the imaging
unit 14 to generate an electrical signal 102 which is an analog image signal. Ru. Also, at this
time, the sound from the object scene is collected by the left microphone 20 and the right
microphone 22, and the analog left audio signal 108 and the analog left audio signal 110 are
generated, respectively. The analog image signal 102 is supplied to an analog-to-digital converter
15, and the analog left audio signal 108 and the analog left audio signal 110 are supplied to an
audio signal preprocessing unit 24, respectively. The analog image signal 102 is A / D converted
by an analog-to-digital converter 15 to generate a digital image signal 103, which is output to a
signal processing unit 16. The digital image signal 103 is subjected to digital signal processing in
a signal processing unit to detect a motion vector. At this time, a method of detecting a motion
vector is, for example, a method of taking an inter-frame difference from a previous image signal,
or the like, but it is preferable to shift two images so as to minimize variance and norm. Also,
advantageously, motion vectors can be determined effectively by using motion compensation.
10-05-2019
4
In these processes, since not only the current image but also the past image, that is, the image
immediately before or several frames before is required, the memory buffer 27 provided in the
signal processing unit 16 stores the past digital image signal. You may Also, in the signal
processing unit 16, an information signal 104 representing the detected motion vector is
generated and supplied to the audio information calculation unit 18. The voice information
calculation unit 18 uses the motion vector represented by the information signal 104 to
determine the most likely moving direction in the image, and determines an adjustment amount
to change the balance of the volume according to the direction. Be For example, the horizontal
direction component in each part of the image can be extracted from the motion vector, and the
direction in which the movement is offset can be determined from the position in the image of
each part and the direction and amount of the horizontal direction component. . In this
embodiment, in order to express the sense of presence by the change in the left and right of the
volume, the balance of the left and right volume is changed according to the strong moving
direction and the amount thereof. In this embodiment, the balance of the left and right volume is
changed as follows according to the movement of the image. As shown in FIG. 2, when the
subject 32 is stopped and it is determined that the motion vector of the image 30 does not have a
strong moving direction, as shown in FIG. , The left volume 34 and the right volume 36 have the
same magnitude, i.e. the normal state. However, as shown in FIG. 4, when the subject 32 moves
in the right direction 202 from the center of the image 30, and it is determined that the motion
vector of the image 30 is moving in the right direction, as shown in FIG. To make the right
volume 36 larger than the left volume 34. Also, conversely, as shown in FIG. 6, when the subject
32 moves in the left direction 204 from the center of the image, and it is determined that the
motion vector of the image 30 moves in the left direction, As shown in FIG. 7, the left volume 36
is made larger than the right volume 34. As described above, when the subject 32 moves as
shown in FIG. 4 or FIG. 6, the volume on the opposite side of the moving direction may be
reduced. As these left and right volumes 34 and 36 are gradually changed, they become more
realistic. Furthermore, when the subject 32 moves from the moving state to the stopping state,
and it is determined that the strong movement of the motion vector of the image 30 decreases
and disappears, the left and right volume 34 and 36 are gradually set to the standard state. It
may be migrated.
The audio information calculation unit 18 determines the adjustment amount so as to change the
left and right volume as described above, and generates an information signal 106 indicating the
adjustment amount and supplies it to the audio signal preprocessing unit 24. Be done. The
adjustment amounts represented by the information signal 106 in the left audio signal amplifier
21 and the right audio signal amplifier 23 in the audio signal pre-processing unit 24 respectively
adjust the left gain and the right gain, and The left audio signal 108 and the analog left audio
signal 110 are balanced by being amplified or attenuated by these left gain and right gain,
respectively, and the left audio signal 109 and the right audio signal 111 are generated. The left
and right audio signals 109 and 111 are supplied to an analog-to-digital converter 25 and
10-05-2019
5
subjected to A / D conversion processing to generate a digital left audio signal 112 and a digital
right audio signal 114, respectively. Further, these left and right digital audio signals 112 and
114 are supplied to the signal processing unit 16. The digital image signal 103 and the digital
left audio signal 112 and the digital audio signal 114 are further subjected to digital signal
processing in the signal processing unit 16, and in the present embodiment, the recording signal
116 which can be recorded on the information recording medium 26. Are generated and
recorded on the information recording medium 26. As described above, according to the present
invention, when capturing a moving image, the image and voice recording apparatus has a sense
of movement so that the balance between the left and right of the volume changes in accordance
with the motion vector of the image. An enhanced audio signal can be generated. The present
device is effectively applied to, for example, a video camera, etc., and this sound can be recorded
in stereo to record realistic images and sounds. Advantageously, by determining the dominant
motion of the left and right of the image from the horizontal direction component of the motion
vector of the image signal and adjusting the left and right audio signals according to the result of
the determination, the left and right It is possible to generate an audio signal in which the feeling
of movement is emphasized. At this time, it is possible to record a more realistic sound by
adjusting the left and right volume to be gradually changed to generate an audio signal. Further,
the image and the sound recorded in this way can represent a sound with a sense of presence
even when using a reproduction device which can not express a sense of presence. The present
invention is effectively applied not only to complete moving images but also to recording sound
in accordance with a plurality of continuous still images such as still moving images (still movies)
and images taken by continuous shooting. be able to.
BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a block diagram showing an embodiment of an
image and sound recording apparatus according to the present invention. FIG. 2 is an
explanatory view showing an example of an object scene image determined to have no dominant
moving direction in the image in the image and sound recording apparatus of the embodiment
shown in FIG. 1; 3 is an explanatory view corresponding to FIG. 2 showing the volume balance on
the left and right when it is determined that there is no dominant moving direction in the image
in the image and sound recording apparatus of the embodiment shown in FIG. 4 is an
explanatory view showing an example of an object scene image in which it is determined that the
image is moving in the right direction in the image and sound recording apparatus of the
embodiment shown in FIG. 1; 5 is an explanatory view corresponding to FIG. 4 showing the
volume balance on the left and right when it is determined that the image is moving to the right
in the image voice recording apparatus of the embodiment shown in FIG. 1; . 6 is an explanatory
view showing an example of an object scene image in which it is determined that the image is
moving in the left direction in the image and sound recording apparatus of the embodiment
shown in FIG. 1; 7 is an explanatory view corresponding to FIG. 6 showing left and right volume
balance when it is determined that an image is moving in the left direction in the image voice
recording apparatus of the embodiment shown in FIG. 1; [Description of the code] 10 image /
10-05-2019
6
voice recording device 12 optical lens system 14 imaging unit 15 analog / digital converter 16
signal processing unit 17 motion vector detection unit 18 audio information operation unit 20
left microphone 21 left audio signal amplifier 22 right microphone 23 right Audio signal
amplifier 24 Audio signal pre-processing unit 25 Analog to digital converter 26 Information
recording medium 27 Memory buffer
10-05-2019
7
Документ
Категория
Без категории
Просмотров
0
Размер файла
18 Кб
Теги
jp2004147205
1/--страниц
Пожаловаться на содержимое документа