close

Вход

Забыли?

вход по аккаунту

?

JP2017038342

код для вставкиСкачать
Patent Translate
Powered by EPO and Google
Notice
This translation is machine-generated. It cannot be guaranteed that it is intelligible, accurate,
complete, reliable or fit for specific purposes. Critical decisions, such as commercially relevant or
financial decisions, should not be based on machine-translation output.
DESCRIPTION JP2017038342
Abstract: To provide an automatically controllable directional speaker and an LED lighting device.
The automatically controllable directional speaker 1 includes a speaker 10, a video capture 11,
an arithmetic unit 12, a direction control motor 13, and an amplitude control unit 14. The video
capture 11 detects and captures the surrounding environment and generates video information.
The computing unit 12 is connected to the video capture 11 and determines, based on the video
information, whether at least face information is present. The orientation control motor 13 is
connected to the speaker 10 and controls the audio output orientation of the speaker 10 based
on face information. The amplitude control unit 14 is in electrical communication with the
speaker 10 and controls the volume of the speaker 10 based on the face information. The autocontrollable directional speaker 1 is combined with an LED light to form an LED light combining
audio output and lighting. [Selected figure] Figure 1
Automatically controllable directional speaker and LED lighting
[0001]
The present invention relates to an audio output device, and in particular, an automatically
controllable directional speaker capable of reproducing audio toward a specific person by
automatically identifying a face and improving life convenience by combining with an LED lamp.
And an LED lamp.
[0002]
A speaker is a device that is widely used nowadays and is very commonly used, and converts
03-05-2019
1
current frequency into sound and provides it for reproduction of various sound files.
Furthermore, the speakers can be coupled to each other to be assembled as a playback device
such as a speaker box.
[0003]
The sound field of the speaker is limited by its mounting position. That is, the user can clearly
hear the output sound of the speaker only in a certain range. Speech received by human beings is
the result of auditory processing and analysis of the brain and is judged and felt by factors such
as the direction, perspective, and magnitude of the speech. For example, when listening to an
audio or radio broadcast indoors, if the mounting position of the speaker is fixed, when the user
moves, the sound entering the ear also changes according to the movement. As an example, when
the user leaves the speaker, the sound that is heard becomes gradually smaller and harder to
hear due to the increase in distance, the angle deviation, and the like. Therefore, in order to
provide the user with omnidirectional voice hearing, it is only necessary to install a large number
of speakers at each place in the room or install an expensive speaker having a stereoscopic
surround sound field effect. However, installing a large number of speakers or reducing the
number of speakers and attaching a surround sound field effect speaker requires a large
installation cost.
[0004]
Furthermore, when many people are in the same space, the sound reproduced from the speaker
may be influenced by others. The reason is that the sound field range of the speaker can not be
output only to a specific person. For this reason, known speakers can not achieve the effect of
having a specific person listen in a space where there are many people, public places, and the
like. Although there is a method of manually controlling the output direction of the speaker from
a remote location in order to solve this problem, this method, however, needs to control the
speaker manually so as to constantly follow the movement of the target to be listened. As
described above, the means for controlling the direction of the speaker has only the effect of
making the listener listen to the sound, and does not have the effect of adjusting the voice heard
by the listener. Furthermore, it may not be a good solution as it may be influenced by the
presence of others around the listener.
03-05-2019
2
[0005]
Besides speakers, lamps are one of the indispensable electronic devices today. A large amount of
LED lamps are now used in place of the former incandescent lamps, and this type of lamp is
mounted for lighting both indoors and outdoors. The lamp can be connected to a power supply to
obtain start-up power. Furthermore, the light emission state of the lamp can be adjusted through
the control circuit. Both speakers and lighting devices belong to electronic devices, but generally
they are separately installed, so the occupied space also becomes large and our living space is
under pressure.
[0006]
Now that science is developing tremendously, users are also pursuing quality of life further, so
the present invention can further improve the use effect of the speaker and its practicality, and at
the same time reduce the space occupied physically. In order to provide the user with a more
convenient design, an automatically controllable directional speaker and LED lighting device are
devised to effectively solve the problems described above.
[0007]
SUMMARY OF THE INVENTION It is an object of the present invention to provide an autocontrollable directional speaker that detects and identifies the presence of a person in the
environment and then reproduces the voice toward the automatically designated person.
[0008]
Another object of the present invention is to provide an LED lamp coupled with an automatically
controllable directional speaker, which has an effect of reproducing sound and illuminating the
person automatically designated by simultaneously using the LED lamp.
[0009]
In order to achieve the above-mentioned object, the automatically controllable directional
speaker of the present invention comprises a speaker, a video capture which shoots the
surrounding environment in real time to generate at least one video information, the video
capture and the telecommunication A computing unit connected to determine whether there is at
least one face information of the video information, and connected to the speaker, and
controlling an audio output direction of the speaker based on the face information An orientation
03-05-2019
3
control motor, and an amplitude control unit in electrical communication with the speaker and
controlling the speaker volume based on the face information.
Preferably, the computing unit analyzes using fuzzy operations.
[0010]
The video capture has an infrared focusing device or an ultrasonic focusing device, and the face
information includes the distance to the face of the target person and the orientation of the face
of the target person, and the target person is The distance to the face and the orientation of the
face are determined by the infrared focusing device or the ultrasonic focusing device.
In addition, the relative distance between the person and the automatically controllable
directional speaker and the orientation are determined to provide a more accurate audio output
of the speaker.
[0011]
In the present invention, an LED auxiliary light may be additionally included, and attached to the
side of the video capture to provide sufficient illuminance to the surrounding environment to
further enhance the visibility of the video information. You can do better.
[0012]
Attaching a wide-angle imaging lens to the video capture or attaching the video capture to a
pivoting tool so that the video capture can provide the video information captured in a wider
image field, the video capture The photographing angle and range may be adjusted vertically and
horizontally through the turning tool.
[0013]
In addition to this, according to the present invention, the sound output unit of the speaker may
be provided with a movable wave guiding structure.
03-05-2019
4
The movable wave guiding structure includes an audio input port, an audio output port, and an
audio path interposed between the audio input port and the audio output port, and the audio
input port and the audio output port respectively. By adjusting the relative aperture size, the
speaker output volume can be adjusted to provide a better listening effect.
[0014]
In another embodiment, the present invention provides an LED light comprising the
automatically controllable directional speaker described above.
The LED lighting device forms a light bulb or downlight, and an arithmetic unit inside thereof is
also analyzed by fuzzy operation.
Thereby, the present invention can be provided with a function of combining the speaker with
the lamp and outputting sound from the speaker at the same time as the lamp illuminates a
specific person.
[0015]
As described above, the present invention detects and analyzes the presence of the surrounding
person, automatically adjusts the voice to be directed to the specified person, and optimally
optimizes hearing even when the user is in the moving state. In addition to being enjoyable, it can
be set to be listened to by a specific person, and audio can be prevented from affecting others. In
addition to this, the present invention further combines the lamp with a lamp to form an LED
lamp, and by recognizing the face of a person, it simultaneously outputs a voice while
illuminating a specific person, thereby making it convenient for life. It can be improved.
[0016]
It is a figure showing the module concerning a 1st example of the present invention. It is a figure
showing the solid mode concerning a 1st example of the present invention. It is a figure which
shows the application of 1st Example of this invention (the 1). It is a figure which shows the
application of 1st Example of this invention (the 2). It is a figure which shows the application of
03-05-2019
5
1st Example of this invention (the 3). FIG. 6 shows a steric aspect according to another
embodiment of the first example of the present invention. It is a figure which shows the threedimensional aspect based on 2nd Example of this invention. FIG. 6 shows a steric aspect
according to another embodiment of the second embodiment of the present invention.
[0017]
For the examiners to further understand the contents of the present invention, the embodiments
of the present invention will be described below with reference to the drawings. Please refer to it.
[0018]
See FIGS. 1, 2, 3, 4A and 4B showing a block module of the first embodiment of the present
invention, a diagram showing a steric aspect and a diagram showing application (Part 1) to (Part
3) Do. The automatically controllable directional speaker of the present invention includes a
speaker, a video capture, a computing unit, a direction control motor, and an amplitude control
unit.
[0019]
Among them, the video capture 11 detects and captures the surrounding environment in real
time to generate at least one video information. The computing unit 12 is in electrical
communication with the video capture 11 and determines if there is at least one face information
in the video information generated by the video capture 11. Preferably, the video capture 11 is
provided with an infrared focusing device 111 or an ultrasonic focusing device to detect and
capture an environment to obtain video information based on the infrared principle or the
ultrasonic principle. The face information includes the distance to the face of the target person
and the direction of the face of the target person, and the distance to the face of the target
person and the direction of the face is infrared focusing device 111 or ultrasonic focusing. It is
determined by the device. In this embodiment, an example in which the infrared focusing device
111 is installed in the video capture 11 is exemplified.
[0020]
03-05-2019
6
The orientation control motor 13 is connected to the speaker 10 and controls the audio output
orientation of the speaker 10 based on the orientation of the face of the person of interest among
the face information. The amplitude control unit 14 is connected to the speaker 10, and controls
the sound of the speaker 10 based on the distance to the face of the target person among the
face information. In this embodiment, after the environment around the directional speaker 1
that can be automatically controlled is detected and photographed in advance using the video
capture 11, the detected and photographed video information is analyzed by the calculator 12,
and face information is present Determine if you want to If face information is included, the
direction of the face and the distance to the face of the target person are further determined, and
the speaker 10 is controlled by the direction control motor 13 and the amplitude control unit 14.
The voice is accurately reproduced toward the location of the person, and the volume is adjusted
according to the distance between the person and the speaker 10. Furthermore, in order to
provide image content of a wide field of view with video information detected and captured by
the video capture 11, for example, a wide-angle imaging lens is attached to the video capture 11
to accurately generate video information even in a wide space. Achieve.
[0021]
As described above, the present invention analyzes the video information captured using video
capture 11 at computing unit 12 whether such video information includes face information. The
face detection technique simply detects the presence of a face in an image by means of capturing
features from the image or the like, and then specifies information about the position of the face
and the number thereof. The face detection can be divided into two types of calculation methods.
One type is based on features and the other type is based on learning. Schemes based on features
can be further classified as follows. The first encodes the knowledge condition that constitutes a
typical face, and determines the area of the face based on the geometrical properties of the face
shape. For example, an ellipse is used as the basis of the judgment shape of the face. That is, the
edge detection means captures an elliptical area in the image and then determines whether it is a
face. The second method is an arithmetic method in which characteristics such as facial feature
tissue, skin color, facial symmetry and feature position distribution are described as the basis of
determination. Then, the third method is a template matching method, that is, the input image
and a plurality of stored face sample images are matched, and it is determined whether or not a
face image exists from the relativity of the image and the face sample. These all belong to the
judgment method by the feature of the face. The methods described above are seldom used
alone, and usually face detection is performed by integrating one or more methods. For example,
when the skin color is selected for face detection, it is interfered with the background, and in
order to improve the accuracy, it is necessary to judge other features as well.
03-05-2019
7
[0022]
Methods based on learning belong to learning methods. That is, the distribution of the face trains
a large amount of appearance change of the face part of the image through the artificial neural
network, and after forming many patterns, the position of the face from a single image whose
size range is fixed To detect However, this type of scheme is prone to detection failure because
some features are unclear.
[0023]
As an example, when the feature of the face pattern is used as a secondary determination
requirement, with the skin color being the main determination condition for face detection, the
method divides the input image and analyzes the color tone information of the skin color of the
face. , Processing by morphological operations, separating the possible areas of the face, and
further matching the oval outline of the face contour to the length-to-width ratio, together with
the geometric shape of the pattern, Eliminate the areas not belonging to the human face and
position the face. In the aforementioned method, the face position is determined by combining a
plurality of arithmetic methods and theoretical calculations. The color information of the shadow
image may cause the difference between the information of the whole shadow image and the
ideal value under the influence of the color tone of light or the color deviation of the shadow
image capturing device, and to eliminate such external environment factor. The contents of the
operation theory may be adjusted or supplementary processed as needed. In order to improve
the accuracy of the determination, the positions of the plurality of face areas are separated from
the image, further together with the combined area. As described above, combining multiple
features and processing by multiple operations allows the location of the face area to be
separated from the digital image, and even if there are differences in the face caused by the
influence of the condition and angle of light rays, Those mutation conditions can be set and
eliminated by calculation.
[0024]
Accurate results are obtained in the determination steps of the various arithmetic methods
described above. In other words, this type of system conventionally uses a set operation system,
and according to the conventional set theory, the relation between a single element in a region
03-05-2019
8
and one set is two kinds of relation It is not limited to. In other words, the dependency of a single
element on one set is two elements, and is “1” if not “0”. Therefore, the face recognition
system constructed by these arithmetic methods divides the image and performs segmentation or
judgment based on facial features, etc., and the result is simple "present", "absent", "yes" Belongs
to dualism such as "No".
[0025]
Unlike conventional set theory, the extended line calculation algorithm is called fuzzy operation
and belongs to fuzzy set theory. The fuzzy set theory is to evolve that the characteristic function
of the conventional set theory is the selection of two elements which are “1” if not “0”, and
the membership function of the fuzzy set is 0 to 1 To be selected at any value between them, that
is, elements of the fuzzy set are allowed. The membership of this set is, for example, 0 to 1, and
thus includes multiple types of membership functions in which the corresponding conditions and
factors of the fuzzy set are infinite. Thus, based on the fuzzy set algorithm, the relationship of an
element to a set is no longer two elements, and the similarity between the element and the set
gives a membership value. When there is a large amount of information, or when it is desired to
classify each element into an ambiguous and unclear cluster computing, the fuzzy operation is an
effective and fast mechanism. The computing unit 12 of the present invention determines
whether the video information captured by the video capture 11 includes face information by a
fuzzy operation.
[0026]
The operator 12 according to the present invention analyzes the input video information to
generate a plurality of elements, and uses human's rule of thumb to construct related fuzzy sets,
and each element corresponds to those fuzzy sets. Membership values, and combine the
membership values to estimate whether the video information includes face information. As an
example, in theoretical recognition, a human face is composed of features such as an external
outline of approximately oval shape, a pentagon and skin color. Therefore, after inputting the
video information into the computing unit 12, presence of the face information in the video
information by analyzing the membership value for each of the elements of the video information
as described above using the fuzzy operation Can judge.
[0027]
03-05-2019
9
Furthermore, based on the difference between the space environment in which the automatically
controllable directional speaker 1 is installed and the audio output condition setting, the present
invention forms a plurality of modes by providing a difference in the condition setting, and an
optimal audio output Achieve the effect. Classifications commonly encountered in general include
distance, angle orientation, number of people, and the like. The mode which each classification
can respond | correspond and can perform is demonstrated as follows. Regarding the factor of
distance, if it is assumed that the automatically controllable directional speaker 1 has three voice
output modes of far, middle and near, the user himself is at which distance the automatically
controllable directional speaker 1 is It is necessary to set whether to output voice to at least one
listener 2. After defining the distance range between far, medium and near, after obtaining the
distance to the target person's face automatically controlled directional speaker 1, the distance
range between the person and the video capture 10 It can not be judged promptly whether it
belongs to Therefore, it is possible to deduce to which distance range the face information
belongs based on the rule of thumb, using the same fuzzy operation. As shown in FIGS. 3 and 4A,
video capture 11 automatically detects the surrounding environment and generates video
information. When the video information is analyzed by the computing unit 12 and it is
confirmed that face information is included, a person is present in an environment where the
automatically controllable directional speaker 1 is installed. Further, among the face information,
the infrared focusing device 111 or the ultrasonic focusing device determines the distance to the
face of the target person and the orientation of the face of the target person. If the selection
mode of the automatically controllable directional speaker 1 is a mode in which voice is output to
the listener 2 at a short distance, the distance to the face of the target person corresponds using
fuzzy operation. By determining the membership value of each of the fuzzy sets at far, medium
and near distances, it can be known whether the face information at a distance to the face of the
target person corresponds to the listener 2 at a relatively short distance . If so, the speaker 10 is
driven, and the speaker 10 is made to output voice to the listener 2 in the selected short distance
reproduction mode. In this way, the automatically controllable directional speaker 1 can provide
multiple and more accurate audio output effects. Besides, the automatically controllable
directional speaker 1 can also adjust the output amplitude level of the speaker 10 using the
amplitude control unit 14 based on the distance mode.
[0028]
Similarly, the division of the direction can also be determined by the above-mentioned method.
The direction control motor 13 can move the speaker 10 toward the four directions of up, down,
left, and right according to face information. When it is desired to divide more precisely, the
speaker 10 is controlled by the azimuth control motor 13 and moved toward the respective
directions such as upper and lower, right and left, upper left, lower left, upper right, lower right
03-05-2019
10
and the like. Further, when the division setting of the direction is not so clear, the direction
control motor 13 determines the content of the direction of the face of the person to be targeted
by the infrared focusing device 111 or the ultrasonic focusing device, and the listener 2 When it
is not possible to determine which direction the user is in, it uses the same fuzzy operation as the
distance judgment to calculate the content of the direction of the face of the person and the
membership value to which the fuzzy set of each direction belongs. The motor 10 adjusts the
speaker 10 to a better position and outputs the sound.
[0029]
Besides this, when multiple face information is detected at the same time from the environment
where the automatically controllable directional speaker 1 is attached, that is, when multiple
people are simultaneously in the environment, the automatic control can be directed through the
setting mode. It is sufficient to designate to whom the sex speaker 1 is to be output. For example,
the quantity may be used as the basis for setting the mode. That is, the automatically controllable
directional speaker 1 is set to output voice to a direction with many people or a direction with
few people. As shown in FIG. 4B, if the analysis result of the computing unit includes a plurality
of people in the environment and is dispersed in a group, and the number of people in one place
is larger than the number of people in another place, The auto-controllable directional speaker 1
is voice-outputted toward a direction with many people or a small number of people. FIG. 4B
exemplifies a state in which the automatically controllable directional speaker 1 is producing an
audio output toward the area where there are many listeners 2. As another situation, when there
are a plurality of people in the environment as a result of the analysis by the computing unit 12
and the distances from the directional speaker 1 which can be automatically controlled to the
faces of the target person are different, According to the above-described distance determination
method, voice is output to one listener 2 or a plurality of listeners 2 at a specific distance.
[0030]
Continuing to refer to FIG. 5, a stereo view of another embodiment according to a preferred
embodiment of the present invention. The brightness of the environment in which the
automatically controllable directional speaker 1 is attached may be relatively dark, but in that
case the brightness of the video information may be insufficient and the determination may be
difficult. To that end, the present invention may further include an LED auxiliary light 15
adjacent to the side of the video capture 11. As a result, a sufficient amount of light can be given
to the environment space even in the above-mentioned situation, and the video information
captured by the video capture 11 can have sufficient brightness and definition. For example, the
03-05-2019
11
video capture 11 is attached to the pivoting tool 16 so that the video capture 11 can adjust the
shooting angle and range vertically and horizontally via the pivoting tool 16 so as to expand the
imaging width of the video capture 11 This can make it possible to obtain wider image content as
video information.
[0031]
The automatically controllable directional speaker 1 of the present invention may have a
movable wave guiding structure 17 attached to the audio output port of the speaker 10. The
movable wave guiding structure 17 includes an audio input port 171, an audio output port 172,
and an audio path 173 provided between the audio input port and the audio output port, and the
relative aperture between the audio input port 171 and the audio output port 172. The size of
each can be adjusted. Thus, the amplitude control unit 14 can adjust the audio amplitude of the
speaker output in advance in accordance with the distance to the face of the target person, and
adjust the relative aperture between the audio input port 171 and the audio output port 172.
Thereby, the audio amplitude of the speaker 10 output via the movable wave guiding structure
17 can be modified to provide a better audio hearing to the listener.
[0032]
Continuing to refer to the three-dimensional schematic view according to the second
embodiment of the present invention in FIG. The same members as the first embodiment and the
features of the details will not be described here. In the present embodiment, an LED lamp 3 is
disclosed in which the automatically controllable directional speaker 1 entirely forms a light
bulb. As a result, the LED lighting device 3 provides a lighting effect to the user, and the video
capture 11 automatically detects the environment around the LED lighting device 3 to generate
video information, and the computing unit 12 further performs face information Analysis of the
presence of the voice, control the output orientation and amplitude of the sound of the speaker
10 based on face information control, and improve the practicability of the LED lighting device 3
by having a specific person listen to the output sound of the speaker 10 . The detailed contents
of the automatically controllable directional speaker 1 are the same as those of the first
embodiment, and thus the description thereof will be omitted.
[0033]
03-05-2019
12
Continuing to refer to the stereo view of another embodiment according to the second
embodiment of the present invention in FIG. In the present embodiment, the LED lighting device
3 is a downlight and can be attached to the ceiling of the room to provide the automatically
controllable directional speaker 1 to the indoor lighting, and at the same time, the automatically
controllable directional speaker 1 is a human. By recognizing the face, the speaker 10 can
automatically output voice to a specific person, and the practicality of the LED lighting device 3
can be improved.
[0034]
As mentioned above, the automatically controllable directional speaker according to the present
invention recognizes voice of a person's face to judge whether or not the person is present in the
surrounding environment, and at the same time, makes voice toward the place where the person
is present. Output to achieve the effect of automatically guiding the direction of audio output.
The video capture 11 detects and captures the surrounding environment to generate video
information, and then the operator 12 analyzes the human face to acquire face information.
Furthermore, by using the infrared focusing device 111 or the ultrasonic focusing device, it is
possible to further determine the distance to the face of the target person and the direction of the
face of the target person among the face information. Control voice output direction and output
adjustment. According to the present invention, it is possible to eliminate the inconvenience that
a known speaker can not obtain an audio output effect according to the distance, and the user
can obtain an optimal listening effect as needed. At the same time, according to the installation
environment and the user's demand, the present invention can select various audio output
modes, and can output the automatically controllable directional speaker 1 more accurately and
according to the demand. Can be controlled. In addition to the above, the present invention can
provide a lighting effect by being combined with a lamp and integrally used, and at the same
time, it can also have an effect of automatically detecting a listener and outputting an audio in
that direction.
[0035]
DESCRIPTION OF SYMBOLS 1 Auto-controllable directional speaker 10 Speaker 11 Video
capture 111 Infrared focusing device 12 Arithmetic unit 13 Direction control motor 14
Amplitude control unit 15 LED auxiliary light 16 Pivoting tool 17 Movable wave guiding
structure 171 voice input port 172 voice output Mouth 173 audio path 2 listener 3 LED lights
03-05-2019
13
Документ
Категория
Без категории
Просмотров
0
Размер файла
27 Кб
Теги
jp2017038342
1/--страниц
Пожаловаться на содержимое документа