close

Вход

Забыли?

вход по аккаунту

?

JP2010166534

код для вставкиСкачать
Patent Translate
Powered by EPO and Google
Notice
This translation is machine-generated. It cannot be guaranteed that it is intelligible, accurate,
complete, reliable or fit for specific purposes. Critical decisions, such as commercially relevant or
financial decisions, should not be based on machine-translation output.
DESCRIPTION JP2010166534
To provide a theater system capable of automatically performing lip sync correction and time
alignment even when the topology of the surround system is changed. A topology detection unit
(19) inside an AV controller (2) detects the topology of a surround system and records it as
topology information in an audio / video correction value memory (18). A video signal delay unit
(13) and an audio processing unit (14) When the topology is changed, lip-sync correction and
time alignment are performed by adjusting the delay amount of the video signal and the audio
signal according to the topology information. [Selected figure] Figure 1
Control device, audio output device, theater system, control method and program
[0001]
The present invention relates to, for example, a control device, an audio output device, a theater
system, a control method, and a program, which reproduce video and audio signals to be output
synchronously.
[0002]
The surround system has more channels (multi-channel) than 2 channels (stereo sound), and can
reproduce the details of the sound image and the atmosphere of the sound field more faithfully
at the listening position (listening point) Refers to an acoustic system.
FIG. 17 shows a configuration diagram of the 5.1 ch surround system. As shown in FIG. 17, the
09-05-2019
1
5.1ch surround system is composed of six speakers: front speakers L and R, center speakers C,
surround speakers SL and SR, and subwoofer SW. The front speakers L · R are disposed on the
front left and right of the listening position (the position indicated by the broken line with the
microphone 1010). Further, the center speaker C is disposed at the front center of the listening
position, and the surround speakers SL and SR are disposed at the rear left and right of the
listening position. The subwoofer SW is a speaker that produces a bass effect. By outputting a
defined sound for each speaker, it is possible to reproduce a more realistic three-dimensional
sound than stereo. The source device 1001 as a source source of the surround system includes a
DVD audio, a super audio CD (SACD), a terrestrial digital radio, and the like.
[0003]
Further, as described in Patent Document 1 and Patent Document 2, there is also a configuration
example in which the speakers are connected in a daisy chain or tree topology (connection form)
using an IEEE 1394 network or the like as an interface. Constructing a surround system with
such a topology has an advantage that wiring can be made simpler, such as improvement in the
degree of freedom in cable connection and reduction in power wiring.
[0004]
Since the surround system is composed of a plurality of speakers as described above, the
reproduction sound field at the listening position is optimum depending on the difference in
distance between the listening position and each speaker, the shape of the room, the presence or
absence of an obstacle, etc. Many cases do not become. Therefore, there is one that has a time
alignment function to correct the reproduction sound field before performing the sound
reproduction.
[0005]
Generally, the time alignment function is provided to the AV controller. The time alignment will
be described with reference to FIG. When performing time alignment, first, the AV controller
1002 sequentially supplies and outputs test signals for sound field correction to the speakers (C,
L, R, SL, SR) connected to the AV controller 1002. And the sound output from each speaker (C, L,
R, SL, SR) is collected using the microphone 1010 installed in the listening position. Then, the
microphone 1010 feeds back the collected audio signal to the AV controller 1002. By the above
09-05-2019
2
operation, the AV controller 1002 can grasp for each speaker the time until the audio signal
output from the own device is output from each speaker and is propagated to the listening
position (microphone 1010). Therefore, the AV controller 1002 can perform so-called time
alignment which adjusts the delay amount of the audio signal supplied to each speaker so that
the sound output from each speaker is simultaneously transmitted to the listening position.
[0006]
In recent years, home theater systems combining a surround system and a television receiver
such as a liquid crystal television or a plasma television have become widespread. Along with
such image quality improvement of video signals, processing load such as scanning line number
conversion and image quality enhancement processing in a television receiver increases, and the
time taken for video signal processing becomes longer than audio signal processing. As a result,
there is a problem that the video and audio are out of sync. For this reason, recent television
receivers are provided with a so-called lip sync correction function for delaying the audio output
from the built-in speaker of the television receiver according to the time required for video signal
processing.
[0007]
Here, as shown in FIG. 18, a home theater system is constructed in which the video signal and
the audio signal from the source device 1001 are reproduced by the television receiver 1003 and
the multi-channel speakers (C, L, R, SL, SR, SW) Think about the case. In this case, since the
television receiver 1003 and the external speakers (C, L, R, SL, SR, SW) operate independently,
the external speakers take time for the video signal processing performed in the television
receiver 1003. I can not grasp. Therefore, lip-sync correction can not be performed, and the
video and audio are out of synchronization.
[0008]
Patent document 3 is mentioned as a prior art which solves the above-mentioned problem. The
lip-sync correction of the home theater system described in Patent Document 3 will be described
below with reference to FIG. First, the AV controller 1002 supplies a test signal for correction to
external speakers (C, L, R, SL, SR, SW) connected to its own device and the television receiver
1003 built-in speaker 1011 and sequentially Make it output. And the sound which each speaker
09-05-2019
3
output is collected using the microphone 1010 installed in the listening position. By the above
operation, the AV controller 1002 can grasp for each speaker the time taken for the test signal
supplied to the speaker to be output from the speaker and propagated to the listening position
(the installation position of the microphone 1010). As described above, the built-in speaker 1011
incorporated in the television receiver delays the supplied audio signal by the time required for
the video signal processing, and then performs the output. Therefore, lip-sync correction and
time alignment can be performed by adjusting the output timing of the external speakers (C, L, R,
SL, SR, SW) in accordance with the output timing of the television receiver built-in speaker 1011.
Hereinafter, the above-described conventional lip-sync correction and time alignment are
referred to as initial lip-sync correction. JP-A-2002-199487 JP-A-2003-61186 JP-A-2007329633
[0009]
However, in the case where external speakers are connected by daisy chain connection or tree
connection as described in Patent Document 1 and Patent Document 2 described above, there
are problems described below.
[0010]
FIG. 19 shows a configuration example of a home theater system in which external speakers are
connected in a daisy chain.
In the surround system shown in FIG. 19, according to the topology of the external speakers (C,
L, R, SL, SR, SW) on the network, the audio signal from the AV controller 1002 is propagated to
each speaker The time (hereinafter, voice transmission time) changes. This will be described with
reference to FIGS. 19A and 19B as an example.
[0011]
In the topology (A) shown in FIG. 19A, the audio signal output from the AV controller 1002
passes through two speakers before propagating to the surround speakers SL and SR which are
end speakers. On the other hand, in the topology (B) shown in FIG. 19B, the audio signal supplied
by the AV controller 1002 passes through five speakers before propagating to the subwoofer SW
which is the terminal speaker. That is, when the topology (A) and the topology (B) are compared,
the topology (B) passes through three more speakers before the audio signal reaches the end
09-05-2019
4
speaker. That is, in the topology (B), the audio transmission time taken to the end speaker is
longer.
[0012]
Here, a case is considered where initial lip-sync correction (lip-sync correction and time
alignment) is performed using the method of Patent Document 3 in the surround system of
topology (A). Then, after performing the initial lip-sync correction, it is assumed that only the
topology is changed from the topology (A) to the topology (B) without changing the position of
the speaker. Then, as described above, since the audio transmission time of each speaker
changes, the audio and video, and the audio are out of synchronization with each other. The
above problems become more pronounced as the number of speaker channels increases to 7.1ch
and 9.1ch. Further, it becomes more remarkable when an interface for digitally modulating and
transmitting an audio signal and performing digital modulation / demodulation every time it
passes through a node is adopted.
[0013]
In a surround system in which external speakers are connected in a daisy chain shape or tree
shape, it may be considered that there is a case where the topology is changed without changing
the positions of the speakers as described above due to room remodeling. For example, although
the user was enjoying the video appreciation with the home theater system of the topology (B)
shown in FIG. 19, the sofa was placed at the listening position, and the cable 2001 became an
obstacle. Therefore, in this case, the connection form of the external speakers is changed to the
topology (A).
[0014]
In such a case, in the prior art, each time the user changes the topology of the speakers, it is
necessary to install a microphone at the listening position and perform initial lip-sync correction
such as emitting a test signal from each speaker. , Very laborious. In addition, since the position
of the speaker is not changed depending on the user, it may be considered that the sound field is
not changed only by the change of the topology, and the user may make an erroneous judgment.
09-05-2019
5
[0015]
In view of the above problems, according to the present invention, even if the topology of the
audio output device is changed after the initial lip-sync correction, lip-sync correction can be
automatically performed by estimating the audio transmission time to the audio output device
and correcting it appropriately. The purpose is to enable time alignment.
[0016]
In order to achieve the above object, the present invention has the following configuration.
A control device connected to a plurality of audio output devices and a video display device,
outputting a video signal to the video display device, and outputting an audio signal to the audio
output device, the audio output device comprising And detecting means for detecting the
connection form, and audio delay means for delaying the audio signal to be output to the audio
output device based on the connection form detected by the detection means.
[0017]
According to the present invention, even when the topology of the audio output device is
changed after the initial lip-sync correction, lip-sync correction and time alignment can be
performed automatically. Therefore, it is possible to improve the convenience by saving the
trouble of the user consciously performing the correction.
[0018]
Hereinafter, an embodiment of a home theater system proposed by the present invention will be
described with reference to the drawings.
[0019]
First Embodiment FIG. 1 is a block diagram of a home theater system according to the present
embodiment.
09-05-2019
6
The home theater system of FIG. 1 includes external speakers (C, L, R, SL, SR, SW) which are a
plurality of audio output devices independent of the source device 1, the AV controller 2, the
television set 3, and the television set 3. And consists of. At this time, the external speakers (C, L,
R, SL, SR, SW) are daisy-chained as shown in FIG. Further, a microphone 10 for sound field
correction can be connected to the AV controller 2. The daisy chain connection is a connection
form in which devices are connected in series, and if focusing on the topology, it can be regarded
as a branch-free tree connection.
[0020]
The source device 1 is a video and audio reproduction device such as a DVD player and a home
server. The video and audio signals output from the source device 1 pass through the AV
controller 2, the video signal to the television receiver 3, and the audio signal to external
speakers (C, L, R, SL, SR, SW). It is transmitted. The AV controller 2 is an audio / video control
device that totally controls an audio signal and a video signal, and an AV amplifier or the like is a
typical example.
[0021]
The configuration of the AV controller 2 will be described. The I / F 12 inside the AV controller 2
separates the video and audio signals transmitted from the source device 1 into video and audio
signals. Here, the I / F 12 is an interface such as HDMI.
[0022]
The video signal output from the I / F 12 is output to the television receiver 3 through the video
signal delay unit 13. On the other hand, the audio signal output from the I / F 12 is subjected to
signal processing such as adjustment of the delay amount in the audio processing unit 14, and
then passed through the I / F 15 to external speakers (C, L, R, SL, SR , SW). That is, the audio
processing unit 14 also has the function of an audio signal delay unit. Although details will be
described later, the lip-sync correction and time alignment in the home theater system of FIG. 1
will be briefly described below.
[0023]
09-05-2019
7
First, the test signal generation unit 16 in the AV controller 2 generates and outputs a test signal
for correction. The output test signal is sequentially supplied to the external speakers (C, L, R, SL,
SR, SW) and the television receiver 3 built-in speaker 11 through the audio processing unit 14.
Then, the external speakers (C, L, R, SL, SR, SW) and the television receiver 3 built-in speaker 11
sequentially reproduce the supplied test signals. The reproduced sound is collected using the
microphone 10 installed at the listening position, and is fed back to the sound field analysis unit
17 as a response signal. The sound field analysis unit 17 analyzes the input response signal, and
stores the analysis result in the audio / video correction value memory 18 as delay time
information. The audio processing unit 14 reads the delay time information stored in the audio /
video correction value memory 18 and adjusts the delay amount of the audio signal. As described
above, in the present embodiment, lip-sync correction and time alignment are performed by
providing a feedback system for audio signals. Hereinafter, the above-described lip-sync
correction and time alignment processing will be referred to as initial lip-sync correction.
[0024]
Further, the topology detection unit 19 detects the topology on the network of the speakers
constituting the surround system at an arbitrary timing. Then, it is updated and stored in the
audio / video correction value memory 18 as arbitrary topology information or at every arbitrary
timing or when there is a change in connection configuration. The topology detection method
will be described later with reference to FIGS. 4 and 5.
[0025]
Although the test signal generating unit 16 is provided in the AV controller 2 in the present
embodiment, the present invention is not limited to this form. For example, it may be provided
outside the AV controller 2 or may be reproduced from the source device 1 using a DVD medium
or the like on which a test signal is recorded.
[0026]
Further, in the present embodiment, the explanation will be made as a 5.1ch surround system,
but the present invention is applicable to multichannels such as 7.1ch and 9.1c, and the number
09-05-2019
8
of channels is not particularly limited. Further, in the present embodiment, two are provided as
output ports from the AV controller 2 to the external speaker, but it is not particularly limited
because it is sufficient to have one or more and less than the number of channels.
[0027]
<Propagation of Audio Signal> Next, a method of propagating an audio signal in the surround
system of the present embodiment will be described. First, the I / F 15 of the AV controller 2 will
be described. The I / F 15 is composed of the modulation unit 20 and the demodulation unit 21
as shown in FIG. The modulation unit 20 performs predetermined modulation on the voice signal
from the voice processing unit 14 or the topology diagnostic signal from the topology detection
unit 19 described later, and outputs the signal to external speakers (C, L, R, SL, SR, SW). . Further,
the demodulation unit 21 performs predetermined demodulation on a topology notification
signal from each speaker to be described later, and outputs the signal to the topology detection
unit 19.
[0028]
The modulation / demodulation method applied by the modulation unit 20 and the demodulation
unit 21 is not particularly limited. Any modulation / demodulation scheme may be used as long
as it is a modulation / demodulation scheme capable of voice transmission. In FIG. 1, the AV
controller 2 and the external speaker, or the external speaker and the external speaker are
connected by one cable, but this is not a limitation, and two or more cables are used. Also good.
[0029]
Next, the configuration and operation of the external speakers (C, L, R, SL, SR, SW) will be
described. FIG. 2 is a block diagram of an external speaker. The I / F 22, which is an interface
unit of external speakers (C, L, R, SL, SR, SW), includes a modulator 23, a modulator 25, a
demodulator 24, and a demodulator 26. The demodulation unit 24 performs predetermined
demodulation on the audio signal or the topology diagnostic signal from the AV controller 2 or
the upstream external speaker, and outputs the audio signal to the own channel extraction unit
28 and the topology diagnostic signal to the topology notification unit 27. Here, it is assumed
that the audio signal has a configuration in which data of all CHs is included in one frame as
shown in FIG. The own channel extraction unit 28 extracts only the audio signal of the own
09-05-2019
9
channel from the input audio signal, and outputs the input audio signal to the modulation unit 25
as it is. Then, the modulation unit 25 performs predetermined modulation on the input audio
signal, and outputs it to the downstream external speaker. The audio signal of the own channel
extracted by the own channel extraction unit 28 is converted into an analog signal by the DAC
30, the analog signal is amplified by the AMP 31, and the sound is emitted as sound by the
speaker 32.
[0030]
In the present embodiment, it is assumed that the sound signal is propagated as described above.
In the present embodiment, it is assumed that an ID (hereinafter referred to as role ID
information) indicating the role of the own CH is held in advance in the role notification unit 33
in each speaker as information unique to the device. Here, the role means C, L, R, SL, SR, SW. The
own channel extraction unit 28 and the topology notification unit 27 described later grasp the
role of the own channel by reading the role ID information stored in the role notification unit 33,
for example, at the time of system activation.
[0031]
Next, the operation of the topology notification unit 27 of FIG. 2 will be described. The topology
notification unit 27 receives the topology diagnosis signal from the topology detection unit 19
and sends a topology notification signal, which is connection form information including the role
ID information of the own machine and information on the network configuration, to the
modulation unit 23 as a new topology. The diagnostic signal is output to the modulation unit 25
respectively. The details of the topology notification signal and the topology diagnostic signal will
be described later with specific examples. The modulation unit 23 having received the topology
notification signal performs predetermined modulation and outputs the signal to the upstream
external speaker or the AV controller 2. Further, the modulation unit 25 having received the
topology diagnostic signal performs predetermined modulation and outputs the signal to the
downstream external speaker.
[0032]
The demodulation unit 26 demodulates the topology communication signal from the downstream
external speaker and outputs the result to the topology notification unit 27. The topology
09-05-2019
10
notification unit 27 that has received the topology notification signal outputs the topology
notification signal to the modulation unit 23 as it is. At this time, the topology notification signal
of the own device may be added to the topology notification signal from the downstream external
speaker and output.
[0033]
Next, a specific example of topology detection will be described using FIGS. 4 and 5. FIG. 4 shows
an example of the topology diagnostic signal and the topology notification signal. The topology
diagnosis signal is configured by 3 bits as shown in FIG. 4, and includes topology information
(hereinafter referred to as stage number information) indicating the number of stages of the
hierarchy from the AV controller 2. On the other hand, the topology notification signal is
configured by 6 bits, and is configured by adding 3 bits of role ID information indicating the role
of the own CH to 3 bits including stage number information.
[0034]
A topology detection method using the topology diagnostic signal and the topology notification
signal of FIG. 4 will be described with reference to FIG. First, the topology diagnosis signal 000 is
output from the AV controller 2. The center speaker C and the subwoofer SW receiving the
topology diagnostic signal 000 transfer 001 obtained by adding 1 to the number-of-stages
information to the downstream front speaker L and front speaker R as a new topology diagnostic
signal 001. Similarly, the front speaker L and the front speaker R having received the topology
diagnostic signal 001 transfer 010 obtained by adding 1 to the stage number address as the new
topology diagnostic signal 010 to the surround speaker SL and the surround speaker SR.
[0035]
Here, it is assumed that 3-bit role ID information indicating the role of the own channel is added
to each speaker in advance. As described above, the role ID information is stored in the role
notification unit 33 inside the external speaker. Here, as shown in FIG. 5, role IDs of 000, 001,
010, 011, 100, and 101 are set for the center speaker C, front speaker L, front speaker R,
surround speaker SL, surround speaker SR, and subwoofer SW, respectively. It is done. The
external speaker receiving the topology diagnostic signal outputs a topology notification signal in
which the role ID information of the own channel is added to the topology diagnostic signal to
09-05-2019
11
the upstream external speaker or the AV controller 2. Also, the topology notification signal from
the downstream external speaker is output to the upstream external speaker or the AV controller
2 as it is or by adding its own topology notification signal. As described above, each speaker can
transmit a topology notification signal including the role ID information of the own CH and the
topology information (stage number information) of the own CH to the AV controller 2. Then, the
topology detection unit 19 of the AV controller 2 that receives the topology notification signal
from each external speaker grasps the topology of the external speakers (C, L, R, SL, SR, and SW)
constituting the surround system on the network can do. The audio / video correction value
memory 18 updates and records the topology information grasped by the topology detection unit
19 every arbitrary timing or when there is a change in connection configuration.
[0036]
The above-described topology detection method is an example, and another method may be used.
Also, the configurations of the topology diagnostic signal and the topology notification signal
may be changed as needed.
[0037]
<Initial Lip-Sync Correction Method> Next, in the present embodiment, a method of initial lipsync correction and time alignment when the home theater system is installed will be described.
As described above, the means of this initial correction is hereinafter referred to as initial lip-sync
correction.
[0038]
First, when the initial lip-sync correction is performed, the microphone 10 is placed at the
listening position as shown in FIG. The microphone 10 is connected to the sound field analysis
unit 17 of the AV control 2, and the sound field analysis unit 17 converts the sound signal
collected by the microphone 10 into a digital signal and performs signal analysis.
[0039]
09-05-2019
12
When performing the initial lip-sync correction, first, the test signal generation unit 16 in the AV
controller 2 generates a test signal for correction and outputs the test signal to the audio
processing unit 14. At this time, the test signal generation unit 16 outputs time information
(hereinafter, output timing information) regarding the timing at which the test signal is output to
the sound field analysis unit 17. The audio processing unit 14 first supplies a test signal to the
television receiver 3. The television receiver 3 having received the test signal delays the audio
signal by the time required for video signal processing inside the television receiver 3 and then
outputs it from the built-in speaker 11.
[0040]
The sound output from the television receiver 3 propagates in space and is collected using the
microphone 10. Then, an audio signal collected by the microphone 10 is output to the sound
field analysis unit 17 as a response signal. Upon receiving the response signal, the sound field
analysis unit 17 compares the time difference between the timing at which the response signal is
input and the output timing of the test signal. Then, the voice delay time (voice transmission time
up to the built-in speaker of the television receiver 3 + space propagation time until voice output
from the speaker is propagated to the microphone) is grasped.
[0041]
Similarly, the test signal is supplied to the external speakers (C, L, R, SL, SR, SW), and the
microphone 10 collects the sound emitted from the external speakers. Then, the response signal
is received and analyzed by the sound field analysis unit 17 to understand the voice delay time
(voice transmission time to each external speaker + space propagation time until voice output
from the speaker is propagated to the microphone) Do.
[0042]
The sound field analysis unit 17 converts the information related to the audio delay time of the
television receiver built-in speaker 11 and the external speakers (C, L, R, SL, SR, SW) grasped by
the above operation (hereinafter referred to as delay time information) Write to the image
correction memory 18 Then, based on the delay time information written in the audio / video
correction memory 18, the audio processing unit 14 delays the delay until the audio signal is
supplied to the external speaker in accordance with the audio delay time of the television
09-05-2019
13
receiver built-in speaker 11. Adjust the amount. Thereafter, the built-in speaker 11 of the
television receiver 3 is controlled to mute.
[0043]
By performing the above correction, in the present embodiment, initial lip-sync correction (lipsync correction and time alignment) is performed.
[0044]
In the present embodiment, the above-described method is used as the initial lip-sync correction
method, but this method is not specific, and lip-sync correction and time alignment may be used
by another method.
[0045]
<Rip-Sync Correction and Time Alignment Corresponding to Topology Change> Next, a lip-sync
correction method in the case where the topology of the external speakers constituting the
surround system is changed after the above-described initial lip-sync correction process is
performed will be described.
Here, as a specific example, a case where the initial lip-sync correction is performed in the
topology (A) shown in FIG. 6 (A) and then the topology (B) shown in FIG. 6 (B) is changed will be
described.
[0046]
When the topology is changed from the topology (A) to the topology (B), the number of stages of
the front speakers R, the surround speakers SR, and the subwoofer SW via the speakers from the
AV controller changes.
Therefore, the voice transmission time will also change.
[0047]
09-05-2019
14
FIG. 7 illustrates changes in voice transmission time in external speakers (C, L, R, SL, SR, SW)
caused by a change in topology. Here, it is assumed that the audio signal and the video signal
shown in FIG. 7 are signals to be simultaneously propagated to the listening position at
synchronous timing.
[0048]
Before changing the topology, that is, in the state of the topology (A), since the initial lip-sync
correction is performed, as shown in FIG. 7A, each audio signal and video signal are
simultaneously at listening positions at synchronous timing. It is being propagated. However, if
the topology (B) is changed in this state, the number of external speakers through which the
audio information passes increases. Therefore, as shown in FIG. 7B, a delay (a shift in sound
transmission time) of Td occurs in the surround speaker SR, 3Td in the front speaker R, and 5Td
in the subwoofer. Here, Td is a delay time that occurs when an audio signal passes through one
speaker, that is, a time taken for modulation / demodulation processing. In addition, although the
delay in a cable also changes in fact, since it is minute compared with the time concerning a
modulation-demodulation process, it disregards in this embodiment.
[0049]
When the topology of the external speakers constituting the surround system is changed from
the topology (A) to the topology (B) as described above, a delay of Td occurs in the surround
speaker SR. The reason is that the number of vias of the speakers (hereinafter, the number of
vias) between the AV controller 2 and the surround speakers SR is increased by one. Similarly, as
the front speaker R has three stages, the subwoofer SW has five stages, and the number of relay
stages increases with the topology change, delays of 3Td and 5Td respectively occur as shown in
FIG. 7B.
[0050]
As described above, due to the change in the topology of the external speakers, a deviation
occurs in the audio transmission time in each external speaker. As a result, the video and the
audio and the audio are out of synchronization with each other. However, the AV controller 2
09-05-2019
15
grasps in advance the change in topology and the delay time when the audio signal passes
through one speaker: Td in advance, and the audio processing unit 14 appropriately adjusts the
delay amount of the audio signal. Therefore, lip sync correction and time alignment
corresponding to the topology change can be performed. The method of lip-sync correction and
time alignment corresponding to the topology change will be described below.
[0051]
As described above, the topology detection unit 19 of FIG. 1 detects the topology of the external
speaker at an arbitrary timing. Here, the arbitrary timing is an arbitrary period at the time of
system startup, at the start of sound reproduction, or during sound reproduction, and is not
particularly specified if a change in topology can be detected.
[0052]
As described above, the topology detection unit 19 outputs the topology diagnosis signal to each
external speaker at an arbitrary timing, and detects the topology of the external speaker by
receiving the topology notification signal from the external speaker. The topology information
detected by the topology detection unit 19 is stored in the audio / video correction memory 18 at
arbitrary timing or when there is a change in the topology. By this, it has a function as
connection form storage means.
[0053]
The voice processing unit 14 performs lip-sync correction and time alignment processing based
on the delay time information and the topology information described above. When the topology
of the external speaker is changed, the topology detection unit 19 stores the topology
information after the topology change in the audio / video correction value memory 18. The
audio / video correction value memory 18 calculates the difference between the stored topology
information before and after the topology change (hereinafter, topology change information) and
stores it as topology change information. The audio processing unit 14 estimates a change in
audio transmission time of each speaker based on the topology change information, and adjusts
the delay amount of the audio signal based on the estimated value. By the above-described
operation, the synchronization deviation between the voices caused by the topology change is
corrected. At the same time, the video signal delay unit 13 adjusts the delay amount of the video
09-05-2019
16
signal based on the topology change information of the audio / video correction memory so that
the video and audio synchronization does not occur. As described above, lip sync correction and
time alignment after topology change are performed.
[0054]
Hereinafter, a specific correction method will be described by taking a case where the topology
(A) of FIG. 6 is changed to the topology (B) as an example. FIG. 8 shows a specific example of the
topology information held in the audio / video correction memory 18. The information held in
the table 1 of FIG. 8 is stage number information of each speaker in the topology (A) when the
initial lip sync correction is performed. For example, since the center speaker C is directly
connected to the AV controller 2, the stage number information is held as 0. Further, since the
front speaker L is connected via the center speaker C, the step number information is held at 1.
As described above, the table 1 holds the number-of-steps information of each speaker when the
initial lip-sync correction is performed, and this information is not updated until the next initial
lip-sync correction.
[0055]
On the other hand, the topology information held in Table 2 is the number-of-stages information
of each speaker in the topology of the surround system at the present time, and this information
is updated every time the topology detection unit 19 performs topology detection. Then, the
table 3 holds the difference between the stage number information held in the table 1 and the
table 2 (topology change information).
[0056]
FIG. 8I is topology information held before the topology change, that is, in the audio / video
correction memory of the topology (A). In the table 1, stage number information of the external
speakers in the topology (A) subjected to the initial lip-sync correction is registered. Further, in
the table 2, stage number information of the same topology (A) as the table 1 is registered. Table
3 holds topology change information. Since no topology change has been made at this stage, all
0's are held in Table 3. The voice processing unit 14 reads in the topology change information of
Table 3 at an arbitrary timing. In FIG. 8I, since all the values held in Table 3 are 0, the audio
processing unit 14 considers that the topology of the external speaker has not been changed, and
09-05-2019
17
the correction value of the previous time is the value of the audio signal. Adjust the amount of
delay.
[0057]
When the topology is changed from the topology (A) to the topology (B), the topology
information is updated as shown in FIG. 8 (II). Although the topology information in the topology
(A) is held as it is in the table 1 of FIG. 8 (II), the topology information in the topology (B) after
the change is registered in the table 2. The table 3 holds the difference in the number of stages
information between the topology (A) and the topology (B).
[0058]
The voice processing unit 14 determines that the topology change has been performed because
the values held in the table 3 include things other than 0. By reading Table 3, the voice
processing unit 14 compares the current topology (topology (B)) with the topology (topology (A))
in which the initial lip sync was performed. As a result, it can be understood that the number of
passing stages of the surround speaker SR is increased by one step, the front speaker R by three
steps, and the subwoofer SW by five steps. The audio processing unit 14 performs correction
based on a value at which the number of relay stages increases most, in this example, five stages
of the subwoofer SW.
[0059]
The audio processing unit 14 adjusts the delay amount at the time of supplying the audio signal
to the external speaker so that the audio transmission time to all the speakers is delayed in the
subwoofer SW similarly to 5 Td. FIG. 7C shows the timing of audio and video after correction.
Specifically, the center speaker C delays 5 Td, the front speaker L 5 Td, the front speaker R 2 Td,
the surround speaker SL 5 Td, and the surround speaker SR 4 Td to transmit an audio signal. By
the correction operation using the correction value described above, as shown in FIG. 7C, the
audio signal is propagated to the listening position in synchronization with the synchronization
timing. At this time, the video signal delay unit 13 also adjusts the video signal to be delayed by
5Td as shown in FIG. 7C, and performs video display.
09-05-2019
18
[0060]
By performing the correction as described above, in the present embodiment, even when the
topology of the external speaker is changed after the initial lip-sync correction is performed, the
lip-sync correction can be automatically performed.
[0061]
Next, contrary to the above-described example, a case where the topology (B) of FIG. 6 is changed
to the topology (A) will be described.
A state of output timing of audio and video at this time is shown in FIG. 9 in the same format as
FIG. The topology information held in the audio / video correction memory is shown in FIG. 10 in
the same format as FIG.
[0062]
FIG. 10 (I) is topology information held before the topology change, that is, inside the audio /
video correction memory of the topology (B) as in FIG. 8 (I). In the table 1, stage number
information of the external speakers in the topology (B) subjected to the initial lip-sync
correction is registered. Further, in the table 2, stage number information of the same topology
(B) as that of the table 1 is registered. Table 3 holds topology change information. Since no
topology change has been made at this stage, all 0's are held in Table 3. Since all the values held
in the table 3 are 0, the audio processing unit 14 considers that the topology of the external
speaker is not changed, and adjusts the delay amount of the audio signal with the correction
value up to this point.
[0063]
When the topology is changed from the topology (B) to the topology (A), the topology
information is updated as shown in FIG. 10 (II). Although topology information in the topology
(B) is held as it is in the table 1 of FIG. 10 (II), topology information in the topology (A) after
change is registered in the table 2. The table 3 holds the difference in the number of stages
information between the topology (B) and the topology (A).
09-05-2019
19
[0064]
The voice processing unit 14 determines that the topology change has been performed because
the values held in the table 3 include things other than 0. The audio processing unit 14 reads the
table 3 and in the current topology (A), compared with the topology (B) where initial lip
synchronization was performed, the surround speaker SR is one stage, the front speaker R is
three stages, and the subwoofer SW is five stages , It can be seen that the number of passing
stages has decreased. Therefore, as shown in FIG. 9C, the audio processing unit 14 adjusts only
the difference value of the audio transmission time so as to delay each speaker in which the
difference has occurred. That is, by delaying the output of the surround speaker SR by Td, the
front speaker R by 3Td, and the subwoofer SW by 5Td, lip sync correction and time alignment
can be performed as shown in FIG. 9C.
[0065]
As described above, even if the topology of the speaker is changed after the initial lip-sync
correction, lip-sync correction and time alignment can be performed automatically without the
user's awareness, and convenience is improved. be able to.
[0066]
Second Embodiment In the first embodiment, the case where a home theater system is
constructed using a surround system in which external speakers are daisy-chained has been
described.
In this embodiment, a case where a home theater system is constructed using a surround system
in which external speakers are tree-connected and / or daisy-chain connected will be described.
[0067]
FIG. 11 is a block diagram of a home theater system in this embodiment. Similar to FIG. 1, the
home theater system of FIG. 11 includes an external speaker (C, L, R, SL, SR, SW) independent of
the source device 1, the AV controller 2, the television receiver 3, and the television receiver 3. It
consists of At this time, the external speakers (C, L, R, SL, SR, SW) are tree-connected as shown in
09-05-2019
20
FIG. Further, a microphone 10 for sound field correction can be connected to the AV controller 2.
[0068]
Here, since the source device 1, the AV controller 2, and the television receiver 3 have the same
configuration as in the first embodiment, detailed operations will be omitted. The difference from
the first embodiment is that the number of output ports owned by the external speaker is
increased from 1 to 2, and the number of other external speakers that can be connected as an
output destination is increased. In the present embodiment, the number of output ports of the
external speaker is limited to two, but two or more output ports may be possessed.
[0069]
<Propagation of Audio Signal> Next, the configuration and operation of the external speakers (C,
L, R, SL, SR, SW) will be described. FIG. 12 is a block diagram of external speakers (C, L, R, SL, SR,
SW) in the present embodiment. A different point from the external speaker configuration (FIG.
2) described in the first embodiment is that the downstream speaker has two output ports.
[0070]
The operations of blocks other than the modulation unit 33 and the demodulation unit 34 and
the propagation method of the audio signal are the same as in the first embodiment, and thus
detailed description will be omitted. The modulation unit 33 performs predetermined modulation
on the new topology diagnostic signal from the topology notification unit 27 and the audio signal
from the own channel extraction unit 28, and outputs the result to two (or one) external speakers
connected downstream. Do. Further, the demodulation unit 34 receives topology notification
signals from two (or one) external speakers connected downstream, performs predetermined
demodulation, and outputs the result to the topology notification unit 27.
[0071]
Next, a specific example of topology detection according to this embodiment will be described
09-05-2019
21
with reference to FIGS. 4 and 13. The topology diagnostic signal and the topology notification
signal have the configuration shown in FIG. 4 as in the first embodiment.
[0072]
A topology detection method using the topology diagnostic signal and the topology notification
signal of FIG. 4 will be described with reference to FIG. First, the AV controller 2 outputs a
topology diagnostic signal 000 to an external speaker. The front speaker L having received the
topology diagnostic signal 000 outputs 001 obtained by adding 1 to the number-of-stages
information as a new topology diagnostic signal 001 to the downstream center speaker C and the
surround speaker SL, which are external speakers. Similarly, the subwoofer SW having received
the topology diagnostic signal 000 outputs 001 obtained by adding 1 to the number-of-stages
information as a new topology diagnostic signal 001 to the front speaker R and the surround
speaker SR which are external speakers downstream.
[0073]
In addition, 3-bit role ID information indicating the role of the own channel is added to each
speaker. Here, it is assumed that role ID information of 000, 001, 010, 011, 100, and 101 is set
in advance for the center speaker C, front speaker L, front speaker R, surround speaker SL, and
surround speaker SR, respectively. . The external speaker receiving the topology diagnostic signal
outputs a topology notification signal in which the role ID information of the own channel is
added to the topology diagnostic signal to the upstream external speaker or the AV controller 2.
Further, the topology notification signal from the downstream external speaker is output to the
upstream external speaker or the AV controller 2 as it is or by adding the topology notification
signal of its own device. As described above, each external speaker can transmit to the AV
controller 2 a topology notification signal including the role ID information of the own CH and
the topology information (stage number information) of the own CH. Then, the topology
detection unit 19 of the AV controller 2 that has received the topology notification signal from
each external speaker can grasp the topology of the surround system.
[0074]
<Rip-Sync Correction Method after Topology Change> Next, a lip-sync correction method in the
case where the topology of the surround system is changed after the initial lip-sync correction
09-05-2019
22
process is performed in the home theater system of the present embodiment will be described.
The initial lip-sync correction method is the same as in the first embodiment, and thus the
description thereof is omitted.
[0075]
Here, as a specific example, a case will be described where the initial lip-sync correction is
performed in the topology (C) shown in FIG. 14 (C) and then the topology (D) shown in FIG. 14
(D) is changed. When the topology is changed from the topology (C) to the topology (D), the
number of transmission stages from the AV controller 2 changes in the center speaker C, the
front speaker L, the surround speaker SL, the surround speaker SR, and the subwoofer SW.
Therefore, the voice transmission time will also change.
[0076]
FIG. 15 illustrates the change in voice transmission time in the external speakers (C, L, R, SL, SR,
SW) caused by the change in topology. Before changing the topology, that is, in the state of
topology (C), since initial lip-sync correction is performed, as shown in FIG. 15 (a), each audio
data and video data are listened simultaneously at synchronous timing. Propagated to However,
when the topology (D) is changed in this state, as shown in FIG. 15 (b), a shift from the
synchronization timing before the topology change (topology (C)) occurs. Here, the center
speaker C is Td advanced (difference in audio transmission time, in other words, -Td delay), and
the front speaker L, surround speaker SL, and surround speaker SR are Td delayed, and the
subwoofer is 2Td delayed. It will occur. Here, as in the first embodiment, Td is a delay time that
occurs when an audio signal passes through one speaker. As described above, the topology
detection unit 19 outputs the topology diagnosis signal to each external speaker at an arbitrary
timing, and detects the topology of the external speaker by receiving the topology notification
signal from each speaker. The topology information detected by the topology detection unit 19 is
stored in the audio / video correction memory 18.
[0077]
FIG. 16 shows a specific example of the topology information held in the audio / video correction
memory. As in the first embodiment, the information held in the table 1 is the number-of-stages
information of each speaker in the topology (C) when the initial lip-sync correction is performed.
09-05-2019
23
Further, the topology information held in the table 2 is the number-of-stages information of
speakers in the topology of the surround system at the present time. The table 3 holds the
difference value (topology change information) of the stage number information held in the table
1 and the table 2.
[0078]
FIG. 16I is topology information held before the topology change, that is, in the audio / video
correction memory of the topology (C). In the table 1, stage number information of the external
speakers in the topology (C) subjected to the initial lip-sync correction is registered. Further, in
the table 2, stage number information of the same topology (C) as the table 1 is registered. Also,
topology change information is registered in Table 3. Since no topology change has been made at
this stage, all 0's are held in Table 3.
[0079]
As in the first embodiment, the voice processing unit 14 reads the topology change information
in the table 3 at an arbitrary timing. Since all the values held in Table 3 are 0, the audio
processing unit 14 considers that the topology of the external speaker is not changed, and
adjusts the delay amount of the audio signal with the correction value so far.
[0080]
When the topology is changed from the topology (C) to the topology (B), topology information is
updated as shown in FIG. 16 (II). Although the topology information in the topology (C) is held as
it is in the table 1 of FIG. 16 (II), the topology information in the topology (D) after the change is
registered in the table 2. The table 3 holds the difference in the number of stages information
between the topology (C) and the topology (D).
[0081]
The voice processing unit 14 determines that the topology change has been performed because
the values held in the table 3 include something other than 0. By reading Table 3, the voice
09-05-2019
24
processing unit 14 knows that the number of passing stages has increased in the current
topology (D) as compared with the topology (C) in which the initial lip synchronization was
performed. Here, it can be understood that the center speaker C has one stage, the front speaker
L has one stage, the surround speaker L has one stage, the surround speaker R has one stage, the
subwoofer SW has two stages, and the number of passing stages has increased. The audio
processing unit 14 performs correction based on a value at which the number of relay stages
increases most, in this example, two stages of the subwoofer SW.
[0082]
The audio processing unit 14 adjusts the delay amount at the time of supplying the audio signal
to the external speaker so that the audio transmission time of all the speakers is delayed in the
subwoofer SW similarly to 2Td. Specifically, the center speaker C delays 3 Td, the front speaker L
Td, the front speaker R 2 Td, the surround speaker SL Td, and the surround speaker SR delays Td
to transmit an audio signal. At the same time, the video signal delay unit 13 also adjusts the video
signal to be delayed by 2Td and performs video display. By the operation as described above,
audio and video are simultaneously propagated to the listening position as shown in FIG. 15 (c).
[0083]
As described above, even if the topology of the speaker is changed after the initial lip-sync
correction, the audio delay and the video delay can be appropriately performed to automatically
perform lip-sync correction and time alignment. As a result, the user need not consciously make
corrections, and convenience can be improved.
[0084]
<Notes> The object of the present invention is to supply a recording medium storing a program
for realizing the functions of the above-described embodiments to a system or apparatus, and a
computer of the system or apparatus reads the program stored in the storage medium. It is also
achieved by executing. In this case, the executable program itself read from the storage medium
implements the functions of the above-described embodiments, and the program itself and the
storage medium storing the program constitute the present invention.
09-05-2019
25
[0085]
For example, the processing of the voice processing unit 14 and the topology detection unit 19
can be implemented by software.
[0086]
In the present invention, an operating system (OS) or the like running on a computer performs
part or all of the actual processing based on the instructions of the program, and the functions of
the above-described embodiment are realized by the processing. Also included.
Furthermore, the present invention is also applied to the case where a program read from a
storage medium is written to a memory provided in a function expansion card inserted in a
computer or a function expansion unit connected to the computer. In that case, the CPU provided
in the function expansion card or the function expansion unit performs part or all of the actual
processing based on the written program instruction, and the functions of the above-described
embodiment are realized by the processing.
[0087]
In addition, the embodiments of the invention describe an apparatus or method configured
around the present invention. For this reason, in the present embodiment, additional constituent
features are described in addition to the essential parts of the present invention. That is, having
the configuration requirements of the apparatus or method described in the embodiments of the
invention is a sufficient condition for realizing the present invention, but is not a necessary
condition.
[0088]
FIG. 1 is an example of a block diagram of the entire home theater system according to a
proposed embodiment 1 to which the present invention is applicable. It is an example of the
block diagram of the external speaker in the home theater system of Embodiment 1 which this
invention can apply. It is an example figure of frame composition of a frame of an audio signal in
a proposed home theater system to which the present invention is applicable. FIG. 2 is an
example of a configuration of a topology diagnostic signal and a topology notification signal in a
09-05-2019
26
proposed home theater system to which the present invention is applicable. It is a figure
explaining the topology detection method in the home theater system of Embodiment 1 which
this invention can apply. It is a figure which shows the example of a topology structure of the
home theater system of Embodiment 1 which this invention can apply. It is a figure showing the
change of the synchronous timing of an audio | voice and an image by the case where it changes
from topology (A) to topology (B) in the home theater system of Embodiment 1 which this
invention can apply. It is a table figure regarding the topology information memorize | stored in
an audio / video correction value memory, when it changes from topology (A) to topology (B) in
the home theater system of Embodiment 1 which this invention can apply. It is a figure showing
the change of the synchronous timing of an audio | voice and an image by the case where it
changes from a topology (B) to a topology (A) in the home theater system of Embodiment 1
which this invention can apply. It is a table figure regarding the topology information memorize |
stored in an audio / video correction value memory, when changing from a topology (B) to a
topology (A) in the home theater system of Embodiment 1 which this invention can apply. FIG. 7
is an example of a block diagram related to the entire home theater system according to a
proposed embodiment 2 to which the present invention is applicable. It is an example of the
block diagram of the external speaker in the home theater system of Embodiment 2 which this
invention can apply. It is an example of the figure explaining the topology detection method in
the home theater system of Embodiment 2 which this invention can apply. It is an example figure
showing the example of topology composition of the home theater system of Embodiment 2
which the present invention can apply. It is a figure showing the change of the synchronous
timing of an audio | voice and an image at the time of changing from a topology (C) to a topology
(D) in the home theater system of Embodiment 2 which this invention can apply. It is a table
figure regarding the topology information memorize | stored in audio | voice / imaging | video
correction value memory, when it changes from topology (C) to topology (D) in the home theater
system of Embodiment 2 which this invention can apply.
It is a block diagram of a general 5.1 CH surround system. It is a block diagram of a common
home theater system. FIG. 7 is an example block diagram of a home theater system in which
external speakers are daisy-chained together.
[0089]
1 source device 2 AV controller 3 television receiver 10 microphone 11 television receiver builtin speaker 12 I / F 13 video signal delay unit 14 audio processing unit 15 I / F 16 test signal
generation unit 17 sound field analysis unit 18 audio / video correction Value memory 19
topology detection unit 20 modulation unit 21 demodulation unit
09-05-2019
27
Документ
Категория
Без категории
Просмотров
0
Размер файла
44 Кб
Теги
jp2010166534
1/--страниц
Пожаловаться на содержимое документа