close

Вход

Забыли?

вход по аккаунту

?

JP2009171150

код для вставкиСкачать
Patent Translate
Powered by EPO and Google
Notice
This translation is machine-generated. It cannot be guaranteed that it is intelligible, accurate,
complete, reliable or fit for specific purposes. Critical decisions, such as commercially relevant or
financial decisions, should not be based on machine-translation output.
DESCRIPTION JP2009171150
[Problem] To provide a technology that allows a listener to listen to a lecturer's voice without a
sense of discomfort in a seminar or a presentation. A delay time for each terminal 2 is stored in a
storage unit 14 of a server device 1. When the lecturer A1 holds the microphone MA to give a
lecture, the voice of the lecturer A1 is picked up by the microphone MA, converted into an audio
signal, and output to the server device 1. The server device 1 converts the audio signal output
from the microphone MA into audio data in digital format, refers to the table stored in the
storage unit 14, and transmits to each terminal 2 only the delay time corresponding to each
terminal 2. Deliver audio data with delay. Each terminal 2 emits noise according to the
distributed voice data. [Selected figure] Figure 1
Voice delivery device
[0001]
The present invention relates to an audio delivery device.
[0002]
In class sessions and presentations, lecturers and presenters give classes and presentations to
multiple listeners.
In such a seminar or the like, Patent Document 1 proposes a voice electronic blackboard which
can be controlled by voice in order to more suitably conduct the course or the like. According to
09-05-2019
1
this voice electronic blackboard, characters which can not be written even if the reading is
understood can be entered by voice inputting. Unexamined-Japanese-Patent No. 06-85983
[0003]
By the way, when giving a lecture etc. in a large meeting place, the listener of a position far from
a lecturer etc. may be hard to catch a teacher's voice. Even in the case where a lecture or the like
is conducted using a microphone and a speaker, a listener at a position away from the speaker
may have difficulty in hearing the voice of the lecturer. The present invention has been made
under the above-described background, and it is an object of the present invention to provide a
technique that enables a listener to more suitably listen to the voice of a lecturer or the like in a
training session or the like. Do.
[0004]
In order to solve the above problems, according to the present invention, the sound emitted from
the sound emitting means installed at a predetermined position reaches the respective positions
of a plurality of terminals installed at a predetermined position. Transmission time storage means
for storing the transmission time required for each terminal, and network delay time generated in
delivery of each of the plurality of terminals to the voice signal output from the sound collection
means is stored for each terminal For each of the plurality of terminals, the network delay time
storage means delays the voice signal output from the sound collection means by a time
corresponding to the difference between the transfer time corresponding to each terminal and
the network delay time. A voice distribution apparatus is provided, comprising: distribution
means for distributing to each of the terminals.
[0005]
In a preferred aspect of the present invention, transmission time measuring means for measuring
the transmission time for each terminal, and transmission time storage for storing the
transmission time for each terminal measured by the transmission time measuring means in the
transmission time storage means Control means may be provided.
[0006]
Further, in a further preferred aspect of the present invention, a network delay time measuring
means for measuring the network delay time for each of the terminals, and a network delay time
for each of the terminals measured by the network delay time measuring means. And network
delay time storage control means for storing in the storage means.
09-05-2019
2
[0007]
Further, in another preferable aspect of the present invention, a distance measurement pulse
emitting means for emitting a distance measurement pulse from the sound emitting means, and a
distance measurement pulse emitted by the distance measurement pulse emitting means are
each of the plurality of terminals. The transmission time measuring means may measure the
transmission time for each of the terminals based on the detection result of the pulse detection
means.
[0008]
Further, in another preferred aspect of the present invention, a correspondence relationship
memory that stores the correspondence relationship between the position in the space where the
plurality of terminals are installed and the transmission time, and stores the correspondence
relationship between the position and the terminals. The transmission time measuring unit
specifies the transmission time corresponding to the position of each of the plurality of terminals
with reference to the correspondence relationship stored in the correspondence relationship
storage unit. Good.
[0009]
Further, in another preferred aspect of the present invention, a representative position
determination means for determining a representative position in the space where the plurality
of terminals are installed is provided, and the delivery means is determined by the representative
position determination means. The voice signal output from the sound collecting means may be
delayed for distribution to each of the terminals by a time according to the difference between
the transmission time of the terminal corresponding to the position and the network delay time.
[0010]
Further, in another preferable aspect of the present invention, a distribution detection means for
detecting a distribution of listeners in a space where the plurality of terminals are installed is
provided, and the delivery means is responsive to the detection result of the distribution
detection means. The delay time may be determined, and the audio signal output from the sound
collection means may be delayed for distribution by the determined delay time.
[0011]
In a further preferred aspect of the present invention, the video signal output means may be
provided for delaying the video signal by a time according to the delay time of the distribution
09-05-2019
3
means and outputting the delayed video signal to the display means.
[0012]
According to the present invention, in a seminar or the like, the listener can listen more favorably
to the voice of the lecturer or the like in comparison with the prior art.
[0013]
<A: Configuration> FIG. 1 is a diagram showing a configuration of a system according to an
embodiment of the present invention.
In this embodiment, a case where a training session is performed at a certain venue will be
described.
In the figure, the hall S has a lecturer A1 and a plurality of listeners A211, A212, ..., A2 mn (m
and n are natural numbers) who listen to the lecture of the lecturer A1, and a plurality of
listeners A211, A212, ... , A2 mn are located in the seats provided in advance, and listen to the
lectures of the lecturer A1 at their respective positions.
The lecturer A1 gives a lecture using the microphone MA.
The microphone MA picks up and outputs an audio signal representing the picked up voice.
The audio signal output from the microphone MA is output to the power amplifier PA and to the
server device 1.
The power amplifier PA amplifies the audio signal output from the microphone MA, and outputs
the amplified audio signal to the speaker SP.
The speaker SP emits sound at an intensity corresponding to the audio signal supplied from the
power amplifier PA.
09-05-2019
4
The speaker SP is installed at a predetermined position in the hall S (for example, on the center
front side of the hall S).
The listener of the lecture can listen to the lecture content of the lecturer A1 by listening to the
sound emitted from the speaker SP.
[0014]
Terminals 211, 212,..., 2 mn are respectively installed in the plurality of seats. The server device
1 and the plurality of terminals 211, 212,..., 2mn are communicably connected via a
communication network 3 such as a LAN (Local Area Network). In this system, the listener can
listen to the sound emitted from the speaker SP, and listens to the sound emitted from the
terminals 211, 212,..., 2mn assigned to each of the listeners. It can be done. In the following
description, for convenience of explanation, when it is not necessary to distinguish between the
terminals 211, 212,... 2mn, these will be referred to as “terminal 2”. Similarly, in the case
where it is not necessary to distinguish the listener terminals A211, A212,..., A2mn, these will be
described as “listener A2”.
[0015]
FIG. 2 is a block diagram showing an example of the hardware configuration of the server device
1. In the figure, a central processing unit (CPU) 11 reads out a computer program stored in a
read only memory (ROM) 12 or a storage unit 14, loads it into a random access memory (RAM)
13, and executes the program. , Controls each unit of the server device 1. The storage unit 14 is,
for example, a large-capacity storage unit such as a hard disk. The display unit 15 includes a
liquid crystal display and the like, and displays various screens such as a menu screen for
operating the server device 1 under the control of the CPU 11. The operation unit 16 includes a
mouse and a keyboard, and outputs a signal corresponding to the content of the operation to the
CPU 11. The communication unit 17 communicates with devices connected via the
communication network 3.
[0016]
09-05-2019
5
The storage unit 14 has a delay time table storage area 141 as illustrated. The delay time table
storage area 141 stores a delay time table to be referred to when the CPU 11 distributes voice
data to the terminal 2. FIG. 3 is a diagram showing an example of the contents of the delay time
table. As illustrated, in the table, items of “terminal ID”, “transmission time”, “network
delay time”, and “delay time” are stored in association with one another. The identification
information which identifies the terminal 2 is memorize | stored in the item of "terminal ID"
among these items. Transmission time data indicating the transmission time required for the
sound wave emitted from the speaker SP at the hall S to reach the terminal 2 is stored in the item
of “transmission time”. In the item of “network delay time”, network delay time data
indicating a network delay time that occurs when the server device 1 performs streaming
delivery of audio data representing voice collected by the microphone MA to the terminal 2 is
stored. . In the item “delay time”, delay time data indicating a delay time for each terminal 2
when the CPU 11 distributes sound data to the terminal 2 is stored.
[0017]
FIG. 4 is a block diagram showing an example of the hardware configuration of the terminal 2. In
the figure, the CPU 21 reads out a computer program stored in the ROM 22 or the storage unit
24, loads the read computer program into the RAM 23, and controls each unit of the server
apparatus 1 by executing this. The storage unit 24 is, for example, a large-capacity storage unit
such as a hard disk. The display unit 25 includes a liquid crystal display and the like, and displays
various screens such as a menu screen for operating the terminal 2 under the control of the CPU
21. The operation unit 26 includes a mouse and a keyboard, and outputs a signal corresponding
to the content of the operation to the CPU 21. The microphone 30 picks up and outputs an audio
signal representing the picked up voice. The audio processing unit 28 A (analog) / D (digital)
converts an audio signal output from the microphone 30 to generate audio data. Further, the
audio processing unit 28 converts audio data supplied from the CPU 21 into an analog signal by
D / A conversion and supplies the analog signal to the speaker 29. The speaker 29 emits sound
with an intensity corresponding to the analog signal output from the audio processing unit 28.
[0018]
In this embodiment, although the case where the microphone 30 and the speaker 29 are
included in the terminal 2 will be described, the audio processing unit 28 is provided with an
input terminal and an output terminal, and the audio processing unit 28 is externally connected
to the input terminal via the audio cable. A microphone may be connected, and similarly, an
09-05-2019
6
external speaker may be connected to the output terminal via an audio cable. In this
embodiment, an audio signal input from the microphone 30 to the audio processing unit 28 and
an audio signal output from the audio processing unit 28 to the speaker 29 are analog audio
signals, but digital audio data You may input and output. In such a case, the audio processing
unit 28 does not have to perform A / D conversion or D / A conversion. The same applies to the
display unit 25 and the operation unit 26, and an external output terminal may be provided and
an external monitor may be connected, or an external input terminal may be provided and an
operator such as a mouse and a keyboard may be connected.
[0019]
When the venue S is wide, a listener at a position far from the speaker SP may not easily hear the
sound emitted from the speaker SP. In order to cope with such a case, in this embodiment, the
audio data representing the audio collected by the microphone MA is streamed to the terminal 2
assigned to each of the listeners by the server device 1 and distributed. Is played on the terminal
2. Thus, the listener can listen to the voice of the lecturer A1 by the voice emitted from the
speaker 29 of the terminal 2 in addition to the voice emitted from the speaker SP.
[0020]
<B: Operation> Next, the operation of this embodiment will be described. <B-1: Transmission
Time Measurement Operation> First, the measurement operation of the transmission time will be
described. In this operation example, the distance measurement pulse is emitted from the
speaker SP, and the transmission time is measured by measuring the impulse response on the
terminal 2 side. A worker such as a system administrator installs a speaker SP at a predetermined
position in the hall S, and for each of the terminals 2, the delay time (transmission time) due to
the distance between each terminal 2 and the speaker SP The operation for measuring is
performed using the operation unit 16 of the server device 1. In this operation example, in order
to measure the delay time due to the distance between each terminal 2 and the speaker SP, the
impulse response at the position of each terminal 2 is measured, but it is not limited to
measuring the impulse response. As long as it measures the delay time due to the distance to the
speaker SP, any other method may be used. The operation unit 16 outputs a signal according to
the content of the operation to the CPU 11, and the CPU 11 supplies a signal indicating a
distance measurement pulse to the speaker SP in accordance with the signal supplied from the
operation unit 16. Thereby, the distance measurement pulse is emitted from the speaker SP.
09-05-2019
7
[0021]
The distance measurement pulse emitted from the speaker SP is collected by the microphone 30
of each terminal 2, converted into an audio signal, and output to the audio processing unit 28.
The audio processing unit 28 A / D converts an audio signal supplied from the microphone 30,
and the CPU 21 calculates a delay time (transmission time) by analyzing the converted audio
data. The CPU 21 transmits data indicating the calculated transmission time to the server device
1.
[0022]
Data indicating transmission time of each of the plurality of terminals 2 is transmitted to the
server device 1. The server device 1 registers data indicating the transmission time received from
each of the plurality of terminals 2 in the table stored in the delay time table storage area 141.
[0023]
<B-2: Network Delay Time Measurement Operation> Further, using the operation unit 16, the
operator performs an operation for measuring the network delay time for each of the terminals
2. The CPU 11 performs processing for measuring the delay time in accordance with the
operation signal from the operation unit 16. As this processing, for example, measurement may
be performed by measuring the time required for looping back Ping (Packet Internet Groper)
packets, and processing other than this may be used as long as it is processing to measure delay
time. . The CPU 11 of the server device 1 measures the network delay time for each of the
plurality of terminals 2 and registers the measurement result in the delay time table storage area
141.
[0024]
<B-3: Delay Time Calculation Operation> When storing the transfer time for each terminal 2 and
the network delay time in the delay time table, the CPU 11 calculates the time according to the
difference between the transfer time for each terminal 2 and the network delay time. The delay
time calculated and stored as the delay time is stored in the delay time table. In this operation
example, the CPU 11 reduces (or matches) the difference between the timing at which the
09-05-2019
8
speaker SP is emitted and reaches the position of each terminal 2 and the timing at which the
speaker 29 of the terminal 2 is emitted. To calculate the delay time. Specifically, for example,
when the transfer time of a certain terminal 2 is t11 and the network delay time is t21, and t11>
t21, the delay time is calculated as (t11-t21). .
[0025]
<B-4: Voice Distribution Operation> Next, the voice distribution operation will be described. The
lecturer gives a lecture using the microphone MA. At this time, the lecturer operates the
operation unit 16 of the server device 1 to cause the display unit 15 to display materials related
to the lecture. The server device 1 outputs the image data indicating the material to the display
unit 15 according to the signal output from the operation unit 16, and the display unit 15
displays the image indicating the material according to the signal supplied from the server device
1. Display
[0026]
The voice of the lecturer A1 is collected by the microphone MA, converted into a voice signal,
and output to the power amplifier PA and the server device 1. The power amplifier PA amplifies
the audio signal output from the microphone MA and supplies it to the speaker SP, and the
speaker SP emits sound at an intensity corresponding to the audio signal supplied from the
power amplifier PA. The listener of the lecture recognizes the content of the lecture of the
lecturer A1 by listening to the sound emitted from the speaker SP.
[0027]
Next, the operation of the server device 1 to which an audio signal is supplied from the
microphone MA will be described. The CPU 11 of the server device 1 distributes the audio signal
output from the microphone MA to the terminal 2. At this time, the CPU 11 delays the voice data
for each of the plurality of terminals 2 by the delay time stored in the delay time table storage
area 141 and distributes the voice data to each of the terminals 2. That is, the time when the
difference between the timing when the sound is emitted from the speaker SP and reaches the
position of each terminal 2 and the timing when the sound is emitted from the speaker 29 of the
terminal 2 becomes smaller (or matched) Only deliver audio data delayed. Specifically, for
example, when the transfer time of a certain terminal 2 is t11 and the network delay time is t21,
09-05-2019
9
and t11> t21, the CPU 11 sets the voice data to a time (t11). It is delivered to the terminal 2 with
a delay of -t21).
[0028]
When the terminal 2 receives audio data from the server device 1, the terminal 2 converts the
received audio data into an analog signal and outputs the analog signal to the speaker 29. The
speaker 29 emits sound at an intensity corresponding to the supplied audio signal. The listener
can recognize the lecture content of the lecturer A1 by listening to the sound emitted from the
speaker 29.
[0029]
Here, an example of a specific operation of the delay process performed by the server device 1
will be described. Here, it is assumed that the transfer time corresponding to the terminal 2m3
shown in FIG. 1 is t11 and the network delay time is t21. Further, it is assumed that the terminal
2m3 and the speaker SP are separated, and t11> t21. The CPU 11 distributes the voice data to
the terminal 2m3 with a delay of time (t11 to t21). At this time, the sound wave emitted from the
speaker SP reaches the position of the terminal 2m3 over time t11. Therefore, a listener at the
position of the terminal 2m3 listens to the emitted sound with a delay of time t11 after the sound
is emitted from the speaker SP. On the other hand, audio data is delivered to the terminal 2m3
with a delay of time (t11 to t21) from the server device 1. At this time, since network delay time
t21 occurs in data transmission from the server device 1 to the terminal 2m3, the voice data
distributed to the terminal 2m3 is delayed by {(t11−t21) + t21} = t11. . As described above, both
the sound emitted from the speaker SP and the sound emitted from the speaker 29 are delayed
by the time t11 and reach the listener's ear.
[0030]
By the way, when the voice of the lecturer is emitted by the speaker SP and output to the
terminal 2 of the listener, the timing at which the speaker 29 is emitted and reaches the listener's
ear and the output from the speaker 29 of the terminal 2 There may be a time lag between the
timing of reaching the listener's ear. The amount of deviation depends on the processing
capability of the terminal 2, the delay of the network, and the distance from the speaker SP to the
listener. In particular, the amount of delay increases as the distance from the speaker SP to the
09-05-2019
10
listener increases, so the amount of deviation increases.
[0031]
As described above, the listener may feel discomfort when there is a temporal difference between
the sound heard from the speaker SP and the sound emitted from the speaker 29 of the terminal
2. This discomfort may make it difficult for the listener to understand the contents of the course.
[0032]
On the other hand, in this embodiment, the CPU 11 delays the audio data so that the timing at
which the speaker SP emits a sound and reaches each listener matches the timing at which the
speaker 29 of the terminal 2 emits a sound. Since the content is distributed, the listener does not
feel different from the sound heard from the speaker SP and the sound heard from the speaker
29, and does not feel discomfort.
[0033]
As described above, according to the present embodiment, the server device 1 delays the voice
data to the terminal 2 so that the timing of sound generation between the sound from the
speaker SP and the sound from the speaker 29 of the terminal 2 does not shift. Since the
distribution is performed, this allows the listener to hear the presentation, the lecture, etc.
without feeling discomfort, without the occurrence timing of the sound from the two different
speakers being shifted.
[0034]
<C: Modified Example> Although the embodiment of the present invention has been described
above, the present invention is not limited to the above-described embodiment, and can be
implemented in other various forms.
An example is shown below.
(1) In the above embodiment, the CPU 11 measures the transmission time of the sound from the
speaker SP to the terminal 2 using the distance measurement pulse, but the method of specifying
09-05-2019
11
the transmission time is not limited thereto, for example, in the hall S The correspondence
relationship between the position and the transmission time at the position is stored in advance
in the storage unit 14, and the CPU 11 refers to the correspondence relationship stored in the
storage unit 14 for each of the plurality of terminals 2 The transmission time corresponding to
the position of may be specified.
[0035]
(2) In the above-mentioned embodiment, although the transmission time of each terminal 2 was
measured about each of a plurality of terminals 2, it replaces with this and the server apparatus
1 determines the position which becomes a representative within the hall S, The voice data may
be delayed and distributed to each of the terminals 2 by a time corresponding to the difference
between the transmission time of the terminal 2 corresponding to the determined representative
position and the network delay time. Specifically, for example, the CPU 11 specifies a central
position in the hall S, and only the time according to the difference between the transmission
time to the specified position and the network delay time for delivery to the terminal at that
position, Audio data may be delayed for delivery.
[0036]
As another example, the server device 1 detects the distribution of listeners in the hall S,
determines the delay time according to the detection result, and delays the audio data for
distribution by the determined delay time. May be Specifically, for example, when there are a
large number of listeners ahead in the hall S, the server device 1 delays and distributes the audio
data by the time according to the transmission time to the front position. It is also good. As a
method of detecting the distribution of listeners in the hall S, for example, the storage unit 14 of
the server device 1 stores in advance the correspondence between the identification information
for identifying the terminal 2 and the position at which the terminal is installed; The CPU 11
detects for each of the plurality of terminals 2 by monitoring whether the terminal 2 is being
used by the listener (whether the power is on, logged in, etc.), and is used by the listener The
distribution of the listener may be detected by detecting the distribution of the terminal 2
determined to be present.
[0037]
(3) In the above-mentioned embodiment, although the example by which the system concerning
the above-mentioned embodiment is used at a seminar was explained, the utilization form of the
09-05-2019
12
system may be not only a seminar but a presentation etc. For example, it may be used in a
theater performed in a theater or a hall.
[0038]
(4) In the above-described embodiment, the server device 1 delays and distributes voice data.
However, instead of the server device, a plurality of terminals 2 may perform delay processing.
In this case, a delay time storage area 241 (shown by a chain line in FIG. 4) for storing the delay
time corresponding to each terminal 2 is provided in the storage unit 24 in the terminal 2
respectively, and the CPU 21 of the terminal 2 When audio data is received from 1, the received
audio data may be delayed by the delay time stored in the storage unit 14 and output to the
speaker SP.
[0039]
Also, when the terminal 2 performs the delay processing, the user may be able to adjust the delay
time. Specifically, for example, the listener performs an operation for adjusting the delay time
using the operation unit 26, and the CPU 11 changes the delay time according to the operation
signal output from the operation unit 26. It is also good.
[0040]
(5) In the above-described embodiment, when lectures and the like are performed in a plurality
of venues, the delay time table may be stored in the storage unit 14 for each venue. In this case,
the CPU 11 may select a table to be referred to according to the venue.
[0041]
(6) In the above embodiment, the CPU 11 determines the delay time according to both the sound
wave transmission time and the network delay time, but the aspect of the delay time
determination is not limited to this, and the network delay time is considered. You may not do. In
09-05-2019
13
this case, the CPU 11 distributes the voice data to each terminal 2 by delaying the sound wave
transmission time. For example, when the network delay time is extremely small and can be
ignored, voice data is thus delayed according to only the transmission time without considering
the network delay time. Good.
[0042]
(7) In the above-described embodiment, when displaying the material relating to the lecture on
the display unit 15, the server device 1 delays the video signal representing the content of the
material by the time corresponding to the above-mentioned delay time. It may be output to 15.
By doing this, it is possible to synchronize the sound emission timing of the voice and the display
timing of the material.
[0043]
(8) The program executed by the CPU 11 of the server device 1 in the embodiment described
above is a computer such as a magnetic recording medium (magnetic tape, magnetic disc, etc.),
an optical recording medium (optical disc, etc.), May be provided stored in a readable recording
medium. Moreover, it is also possible to make it download to the server apparatus 1 via
networks, such as the internet.
[0044]
It is a figure which shows an example of a structure of a voice delivery system. FIG. 2 is a block
diagram showing an example of a hardware configuration of a server device 1; It is a figure
which shows an example of the content of a delay time table. FIG. 2 is a block diagram showing
an example of a hardware configuration of a terminal 2;
Explanation of sign
[0045]
DESCRIPTION OF SYMBOLS 1 ... Server apparatus, 2 ... Terminal, 3, 4 ... Communication network,
11, 21 ... CPU, 12, 22 ... ROM, 13, 23 ... RAM, 14, 24 ... Storage part, 15, 25 ... Display part, 16,
09-05-2019
14
26: operation unit, 17: communication unit, 28: voice processing unit, 29: speaker, 30:
microphone, 141: delay time table storage area.
09-05-2019
15
Документ
Категория
Без категории
Просмотров
0
Размер файла
26 Кб
Теги
jp2009171150
1/--страниц
Пожаловаться на содержимое документа