US6072522A - Video conferencing apparatus for group video conferencing - Google Patents

Video conferencing apparatus for group video conferencing Download PDF

Info

Publication number
US6072522A
US6072522A US08/868,798 US86879897A US6072522A US 6072522 A US6072522 A US 6072522A US 86879897 A US86879897 A US 86879897A US 6072522 A US6072522 A US 6072522A
Authority
US
United States
Prior art keywords
audio
speaker
electronic
participants
video
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
US08/868,798
Inventor
Peter M. Ippolito
Caroline M. Cook
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
CGC Designs
Original Assignee
CGC Designs
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by CGC Designs filed Critical CGC Designs
Priority to US08/868,798 priority Critical patent/US6072522A/en
Application granted granted Critical
Publication of US6072522A publication Critical patent/US6072522A/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/142Constructional details of the terminal equipment, e.g. arrangements of the camera and the display
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems

Definitions

  • This invention relates to a video conferencing apparatus that is optimally suited for application into a round table video conferencing environment.
  • the video conferencing apparatus that is described is comprised of a circular array of audio microphones and a video camera which is mounted onto a rotatable platform. Audio information captured by the circular array of microphones is processed to identify the azimuthal position of a principle speaker. The azimuthal position of the video camera is then electro-mechanically adjusted so as to accurately capture the image of the principle speaker.
  • Video conferencing is a popular means whereby individuals at physically distinct and remote locations are able to interact and exchange information through electronic means.
  • a typical video conferencing setup will include apparatus for the capture and playback of audio and video information, apparatus for the electronic exchange of this information among at least two physically distinct and remote locations, and apparatus for managing the exchange of this information in an orderly manner among the distinct loctations.
  • an audio microphone and speaker with their associated electronic circuitry will serve as the apparatus for the capture and playback, respectively, of the audio information; and similarly, a video camera and display with their associated electronic circuitry will serve as the apparatus for the capture and playback, respectively, of the video information.
  • the exchange of the audio and video information among the remote locations is typically carried across an electronic communications network which may be either analog or digital in nature, and the exchange of this information is typically managed by computerized equipment which has been programmed to assemble the various information into a virtual electronic meeting place.
  • the prior art contains many examples relating to the implementation of various apparatus which enable and facilitate video conferencing.
  • video conferencing environments there are two seperate and distinct types of video conferencing environments: single user video conferencing and group video conferencing.
  • each individual that is involved in the video conference will typically be positioned in front of a video conferencing apparatus which is dedicated to the capture and playback of audio and video information specific solely for that individual.
  • the video conferencing apparatus will capture and playback audio and video information from a group of individuals who are physically located within a same room.
  • the dynamics of the two separate types of video conferencing environments differ from each other substantially.
  • the dynamics of the single user video conferencing environment are generally well understood and therefore can be easily managed by most current art video conferencing apparatus.
  • a particular user is typically positioned before a video conferencing apparatus which is dedicated to the capture of audio and video information for that particular user.
  • the position of the camera is fixed, and it is the user's responsibility to be positioned within the camera's field of view.
  • the position of the microphone is typically also fixed and it is the user's reponsibility to then address the microphone correctly.
  • the video conference managing apparatus then manages the information from the various individuals participating in the video conference, but located at remote and distinct locations, such that the audio and the video information which represents each individual is then brought together into the virtual meeting place.
  • the dynamics of the group video conferencing environment are substantially more complex and difficult to manage.
  • the video conferencing apparatus must capture the audio and video information from a group of individuals who are located within a same room, and must present that information into the virtual meeting place in a manner that is natural and realistic.
  • the natural and realistic capture of audio information from the group environment is not problematic if the audio information is captured with omni-directional microphones, and if the audio information is then further processed using gain compensating electronics techniques. In this manner, good quality audio information can be captured into the virtula meeting place through the use of these well understood conventional means.
  • the capture of video information is more problematic.
  • the video information be captured naturally and realistically, and preferably in a manner which does not interfere with the natural group dynamics and group interactions of the individuals who are participating in the video conference.
  • current state of the art video conferencing apparatus is not capable of capturing the video information present in a group video conferencing environment in a manner that is natural and realistic and in a manner that does not impact the natural human dynamics and interactions of the local group.
  • the video camera is typically positioned before the group of participants, and, in order that a complete image of the various participants be captured, the location of the each participants is restricted to be within the field of view of the video camera.
  • the group of participants is made to act in many respects as a single participant with the natural dynamics of the local group interactions becoming unoviodably compromised, since now each member of the local group must face the video camera in order to be presented into the virtual meeting place in a naturalistic manner.
  • it is also problematic in this environment if a particalur individual member of the local group chooses not to face the video camera, since then the naturalness of the image of this paticular member as captured into the virtual meeting place is invariably compromised.
  • the second approach suggests the use of a mobile camera which can be operated to capture the specific image of the current designated speaker. With this second approach, it is possible to capture a more natural image of a group participant into a virtual meeting place. However this second approach necessitates the continuous manual operation of the video camera by one of the group members or by a dedicated camera operator, with both of these approaches again invariably imposing on and compromising the natural human dynamics of the group video conferencing environment.
  • the inventors propose the video conferencing apparatus for group video conferences that is described herein.
  • Two preferred embodiments of the invention are described by this specification. Both of the preferred embodiments of the invention are designed for ease of use and for ease of manufacture using current state of the art manufacturing processes. The design of both of the preferred embodiments described herein are also compatible with the current state of the art approaches and methodologies used to implement video conferences.
  • a video conferencing apparatus which is comprised of a video camera and of a circular array of audio microphones. Both the camera and the radial array of microphones are mounted onto an integral unit which is centrally located amidst those individuals participating in the local group video conferencing environment.
  • the azimuthal orientation of the camera is controlled by the audio information that is captured by the radial array of microphones. Audio information captured by the radial array of microphones is electronically processed such that a principle speaker is continuously identified based on a pre-programmed algorithm. With the principle speaker identified, the camera is then azimuthly positioned through electromechanical means such that the image of the principle speaker becomes accurately captured within the video camera's field of view.
  • a video conferencing apparatus which is comprised of a multiple number of video cameras and of a circular array of audio microphones.
  • the multiple video cameras are, like the microphones, also arranged into a radial array with each camera being dedicated to the capture of video information from within its own field of view.
  • audio information that is captured by the circular array of microphones is used to determine the approximate azymuthal location of a principle speaker. Once this location is identified, the appropriate video camera in the radial array of cameras is activated so as to capture the image of the recently identified speaker.
  • the second embodiment also allows for a further fine adjustment of the azymuthal positioning of the video camera through electromechanical means so that the image of the recently identified speaker may be more accurately captured into the activated camera's field of view.
  • the first embodiment offers the advantage of implementing the the desired function through the use of a single video camera, thus minimizing the cost of the implementation, whereas the second embodiment, while necessitaing a greater cost to implement, will implement the desired function in a manner which is potentially more responsive to the dynamics of the group video conferencing environment, and whose operation will be less ubiquitous during the course of the video conference.
  • both embodiments of the invention achieve the desired goals. Namely, both embodiments provide a means by which both audio and video information from a group video conferencing environment can be captured naturally and realistically and with a minimal disruption to the natural dynamics and interactions of the video conferencing group. Both embodiments provide a means for the natural and realistic capture of audio and video information with minimal continuous operator intervention. Finally, both embodiments demonstrate an inherently uncomplicated design which can be easily manufactured. A more detailed description of the preferred embodiments of the invention are provided by the insuing drawings and by the accompanying descriptions.
  • FIG. 1 shows a top plan view of a group video conferencing environment wherein a conventional group video conferencing apparatus is employed.
  • FIG. 2 shows a top plan view of a local group video conferencing environment wherein the first embodiment of the proposed group video conferencing apparatus is employed.
  • FIG. 3 shows a cross-sectional view of the local group video conferencing environment shown by FIG. 2 taken along line 3--3 of FIG. 2.
  • FIG. 4 shows a cross-sectional view taken along line 4--4 of FIG. 2 of the first embodiment of the proposed group video conferencing apparatus.
  • FIG. 5 shows a top plan view of the electronic circuit board of the first embodiment of the proposed group video conferencing apparatus.
  • FIG. 6 shows a block diagram depiction of the electronic componentry of the first embodiment of the group video conferencing apparatus.
  • FIG. 7 shows the general algorithm by which the video camera of the first embodiment is azimuthally positioned during the course of a group video conference.
  • FIG. 8 shows a top plan view of the second embodiment of the proposed group video conferencing apparatus.
  • FIG. 9 shows a cross-sectional view taken along line 9--9 of FIG. 8 of the second embodiment of the proposed group video conferencing apparatus.
  • FIG. 10 shows a top plan view of the electronic circuit board of the second embodiment of the proposed group video conferencing apparatus.
  • FIG. 11 shows a block diagram depiction of the electronic componentry of the second embodiment of the group video conferencing apparatus.
  • FIG. 12 shows the general algorithm by which a particular video camera of the second embodiment is activated and azimuthally positioned during the course of a group video conference.
  • FIG. 1 shows a symbolic top plan view of a group video conferencing environment which employs a conventional group video conferencing apparatus 10.
  • conventional group video conferencing apparatus 10 will consist of a video camera 11 for the capture of video information, a microphone 12 for the capture of audio information, a video display 13 to display video information from the remotely located participants of the video conference, an audio speaker 14 to provide audio from the remotely located participants of the video conference, a duplex unit 15 to enable the simultaneous exchange of audio information, and an electronic video conferencing management unit 16 that is connected into an electronic communications network 17.
  • a more detailed description of the operation of video conferencing management unit 16 lies beyond the scope of this specification, is not relevant to the claims of this specification, and so is not provided herein.
  • Video camera 11 connects into video conferencing management unit 16 by way of video camera cable 21, and the video output from video conferencing apparatus 10 connects into video display 13 by way of video display cable 23.
  • the audio output from audio microphone 12 and the audio input to audio speaker 14 are connected to duplex unit 15 by way of audio microphone cable 22 and audio speaker cable 24 respectively.
  • Duplex unit 15 is connected to video conferencing management unit 16 by way of duplex cable 25.
  • video conferencing apparatus 10 will typically be positioned at the head of a meeting table 18, and the local participants 19 involved in the video conference will typically be seated about meeting table 18.
  • Audio microphone 12 is positioned so as to capture the audio information that is produced by the various local participants 19 while audio speaker 14 is used to reproduce the audio information from the various remote participants in the video conference.
  • Duplex unit 15 performs the appropriate echo-cancellation functions so as to permit simultanueous full duplex conversation to take place betwen the local and the remote participants in the video conference. As with video conferencing managment unit 16, a more detailed description of the operation of duplex unit 15 lies beyond the scope of this specification, is not relevant to the claims of this specification and so is not provided herein.
  • Video camera 11 is typically positioned at one end of meeting table 18 so as to capture the video image of each local participant 19 that is seated about meeting table 18.
  • video display 13 is also positioned at one end of meeting table 18, usually next to video camera 11.
  • the fixed location of video camera 11 however imposes several restrictions on the natural and realistic quality of the video infromation that is captured into the virtual meeting place environment. For example, since the field of view of video camera 11 is limited, the location of each local participant 19 becomes similarly limited to being within this field of view. Additionally, each local participant 19 must face video camera 11 in order that his image be discernably captured into the virtual meeting place by video camera 11. For a large enough group of local particpants the danger will exist that the image of those local participants seated furthest from video camera 11 may be captured so as to not be discemable at all.
  • the fixed location of video camera 11 will also impose some unavoidable restrictions onto the natural human dynamics and interactions of the local video conferencing group.
  • the group dynamics will be such that each local participant 19 will interact with each other local participant 19, and also interact with the audio and video information that is provided by display 13 and speaker 14 of video conferencing apparatus 10.
  • display 13 and speaker 14 of video conferencing apparatus 10 Given these varied interactions, it becomes difficult for video camera 11 to realistically and naturally capture the various group dynamics from the local environment given its fixed position at the end of meeting table 11.
  • the requirement that each local participant 19 must face video camera 11 invariably also compromises and limits the local group interaction for that particular participant.
  • FIG. 2 a top plan view of a group video conferencing environment is shown wherein the first embodiment of the proposed video conferencing apparatus is employed.
  • first video conferencing apparatus 100 is placed onto and approximately at the center of meeting table 18, such that, during the course of the video conference, each local participant 19 in the video conference is seated about meeting table 18 so as to face first video conferencing apparatus 100 direcly.
  • first video conferencing apparatus 100 is used to capture both the audio and the video information from each local participant 19 in the local group video conferencing environment.
  • An audio speaker located onto the underside of first video conferencing apparatus 100 is used to reproduce audio information that is generated by the remote participants in the video conference.
  • Video display 13, which is located at one end of meeting table 19 is used to display the video information that is generated by the remote participants in the video conference.
  • video conferencing management unit 16 is used to manage the exchange of the various audio and video information over electronic communication network 17 and to appropriately manage and assemble this information into the virtual meeting place.
  • Duplex unit 15 is used to enable the full duplex simultaneous exchange of audio information among the local and the remote participants in the video conference.
  • Video display 13 connects into video conferencing management unit 16 by way of video display cable 23, and the video output from first video conferencing apparatus 100 connects into video conferencing management unit 16 by way of video camera cable 21.
  • the audio output and the audio input of first video conferencing apparatus 100 are connected to duplex unit 15 by way of audio speaker cable 24 and audio microphone cable 22 respectively, with duplex unit 15 then being connected to video conferencing management unit 16 by way of duplex cable 25.
  • first video conferencing apparatus 100 is similar to video conferencing apparatus 10, with the exception that the video capture, the audio capture, and the audio generation functions are integrated into a compact central module which can be centrally and unobtrusively located into the local group video conferencing environment so as to enable the more realistic capture and exchange of audio and video information during the course of the video conference.
  • first video conferencing apparatus 100 is comprised of a generally circular base unit 110 into which there is mounted a radial array of audio microphones 120 and also onto which is centrally mounted a video camera 130.
  • Each audio microphone 120 is of the type having strongly directional audio capture capability such that each audio microphone 120 will predominantly capture audio information which eminates from within the audio conic section 121 that is associated with each audio microphone 120.
  • video camera 130 which is directional by nature, will only capture video information that is present within the video field of view 131.
  • each audio microphone 120 will capture audio information that originates from within its corresponding audio conic section 121.
  • Electronic circuitry internal to novel video conferencing unit 100 then processes this audio information using a predefined algorithm to to identify the azymuthal position of that specific local participant 19 who is currently the primary speaker in the group of local particpants participating in the video conference.
  • video camera 130 is then azimuthally positioned in either the clockwise or the counter clockwise direction that is indicated by the azymuthal line of travel 132, with azimuthal line of travel 132 being essentially confined to a geometric plane which is parallel to the top surface of meeting table 18.
  • the azymuthal positioning of video camera 130 is realized by way of electromechanical means that are internal to first video conferencing apparatus 100 and which will azymuthally position video camera 130 such that the video image of the most currently identified principle speaker is brought into field of view 131 of video camera 130. In this manner then, the image of the principle speaker is captured by video camera 130 in a manner that is both natural and realistic and in a manner that is unobtrusive to the various dynamics and interactions that are ongoing in the local group video conferencing environment.
  • first video conferencing apparatus 100 For proper design, it is necessary that a sufficient number of audio microphones 120 are radially arranged onto first video conferencing apparatus 100 such that audio information can be adequately captured from the full 360 degrees of azymuthal span about first video conferencing apparatus 100.
  • first video conferencing apparatus 100 eight radially arranged audio microphones 120 are used, with each audio microphone 120 dedicated to the capture of audio information from a conic section which spans a 45 degrees of azimuth about first video conferencing apparatus 100. Using this arrangement, it is possible to azimuthally position video camera 120 to an accuracy of 45 degrees using a simple algorithm to process the audio information that is captured by the circular array of audio microphones 120.
  • a more accurate azymuthal positioning of video camera 130 is possible through the use of more sophisticated algorithms to process the audio information that is captured by the circular array of audio microphones and then subsequently interpolate a more accurate azymuthal position beyond an accuracy of 45 degrees.
  • first video conferencing apparatus 100 relies on a simple positioning algorithm to azymuthally position video camera 130 to the 45 degree accuracy that is obtainable by a simplified processing of the audio information that is captured by the circular array of eight audio microphones 120.
  • video camera 120 must have a field of view which is 45 degrees, or preferably greater, to capture a full image of the selected principle speaker.
  • FIG. 3 now shows a cross-sectional view of the group video conferencing environment depicted by FIG. 2 along line 3--3 of FIG. 2.
  • video camera 130 is typically positioned and tilted at an appropriate verticle angle 133 relative to the top surface of meeting table 18 such that an an adequate image of local participant 19 can be adequately captured into the virtual meeting place.
  • verticle angle 133 at which video camera 130 is tilted is initially set through manual means at the start of the video conference, and, once set, will remain so fixed throughout the course of the video-conference.
  • FIG. 4 now shows a cross-sectional view taken along line 4--4 of FIG. 2 of first video conferencing apparatus 100 in more detail.
  • first video coferencing apparatus 100 is comprised of the previously described base unit 110, a multiple number of audio microphones 120, and video camera 130.
  • the other principle components which also comprise first video conferencing apparatus 100 are a video camera positioning assembly 140, a first electronic circuit board 150, and an audio speaker 160.
  • base unit 110 is comprised of a circular base platform 111, an annular convex enclosure 112, and a circular platform 115.
  • Base platform 111 is a flat circular panel onto which most of the various components comprising novel video conferencing apparatus 100 are affixed.
  • the outer peripheri of annular convex enclosure 112 is affixed to the outer peripheri of circular base platform 111, with annular convex enclosure 112 rising upwards above the upper surface of circular base platform 111 to define an enclosed circular space for housing the majority of the components which comprise first video conferencing apparatus 100.
  • Circular base platform 111 is supported above the height of meeting table 18 by supporting posts 113 which are rigidly and and peripherally affixed to the lower surface of circular base platform 111.
  • Supporting posts 113 are of a sufficient height so as to define a space beneath circular base platform 111 which is adequate to contain audio speaker 160.
  • Audio speaker 160 is mounted onto the lower surface of circular base platform 111 with mounting posts 114.
  • mounting posts 114 should be formed of an accoustically insulative material so as to provide some degree of accoustical isolation between audio speaker 160 and circular base platform 111.
  • the audio signals produced by audio speaker 160 will emanate from the underside of novel video conferencing apparatus 100 while first video conferencing apparatus 100 is operational during the course of a video conference.
  • the inner periferi of convex annular enclosure 112 defines a circular opening into which is located circular platform 115.
  • Video camera 130 is mounted concentrically onto the upper surface of circular platform 1 15 by means of pivot joint 116.
  • Pivot joint 116 is rigidly affixed to the upper surface of circular platform 115, and novel video camera 130 is mounted onto pivot joint 116 in a manner that permits novel video camera 130 to be pivoted and fixed at the desired verticle angle 134.
  • Circular platform is mounted onto and positioned by the video camera positioning assembly 140, the operation of which is described subsequent to the description of first circuit board 150.
  • First circuit board 150 is a donut shaped electronic circuit board which is mounted onto the upper surface of circular base platform 111 and which contains the various electronic components and circuitry necessary for the proper operation of first video conferencing apparatus 100.
  • Each audio microphone 120 is mounted onto first circuit board 150 so as protrude out to the exterior of first video conferencing apparatus 100 through openings 117 that are formed into the structure of convex annular enclosure 112.
  • Video wire assembly 135 electrically connects the video signal from video camera 130 to video output connector 136.
  • Video output connector 136 feeds the video signal to the exterior of circular base platform 111 and connects the video signal into video camera cable 21.
  • Video wire assembly 131 enters into the interior of first video conferencing apparatus 100 through a first hole 118 formed into the structure of circular platform 115, and, for best operation it is preferable to incorporate sufficient slack into video wire assembly 135 such that circular platform 115 is free to rotate unimpeded.
  • Microphone wire assembly 122 is used to connect the composite microphone signal generated by first circuit board 150 to microphone output connector 123.
  • the composite microphone signal is the summed and amplified output of each audio microphone 120 that is located onto first circuit board 150 and is the audio input signal that first video conferencing apparatus 100 provides to duplex unit 15.
  • the composite microphone signal is connected to duplex unit 15 by way of audio microphone cable 22.
  • the audio output signal that is provided by duplex unit 15 is connected to first video conferencing apparatus 100 by audio speaker cable 23.
  • Audio speaker cable 23 connects to first video conferencing apparatus 100 at speaker input connector 161, and speaker wire assembly 162 is then used to connect the audio output signal from speaker input connector 161 to audio speaker 160.
  • the azimuthal orientation of video camera 130 is controlled by the workings of video camera positioning assembly 140, wherein video camera positioning assembly 140 is comprised of a platform motor 141, a verticle support rod 142, and a rotational positional indicator 143.
  • Platform motor 141 is rigidly and concentrically affixed to the top surface of circular base platform 111 in a manner that preferably provides a maximum degree of vibrational isolation between platform motor 141 and circular base platform 111.
  • Platform motor 141 is also rigidly affixed to circular platform 115 by way of verticle support rod 142, wherein the lower end of verticle support rod 142 is concentrically affixed to the rotor shaft 144 of platform motor 141, and the upper end of verticle support rod 142 is concentrically affixed to the lower surface of circular platform 115.
  • Video camera 130 is affixed onto the upper surface of circular platform 115 through pivot joint 115 wherein pivot joint 115 allows video camera 130 to be vertically pivoted to a desired angle of tilt.
  • the azimuthal orientation of video camera 130 is controlled by platform motor 141 whereby platform motor is capable of rotating in either a clockwise or a clockwise direction.
  • Rotational position indicator 143 is formed as an electrically conductive rod whose electrical potential is held at an electrical ground potential as indicated by schematic ground symbol 145.
  • the upper end of rotational position indicator 143 is affixed to the lower surface of circular platform 115 and hence will rotate azimuthally in keeping with the azimuthal orientation of circular platform 115.
  • the lower end of rotational position indicator 143 is made to come into contact with the upper surface of circuit board 150 and is arranged so as to swipe along the upper surface of circuit board 150 as circular platform 115 rotates azimuthally.
  • the azimuthal position of circular platform 115 is then sensed by a circular array of eight conductive pads 151 which are formed as exposed conductive patterns affixed onto the top surface of circuit board assembly 150.
  • Each respective conductive pad 151 is located so as to be in the path of the circular orbit of rotational position indicator 143 and located so as to make electrical contact with the lower end of rotational position indicator 143 once the lower end of rotational position indicator 143 swipes over each respective conductve pad 151.
  • a number of eight conductive pads 151 are used, thus providing an azimuthal position accuracy of 45 degrees for the azimuthal positioning of video camera 130.
  • the use of eight conductive pads 151 also permits the use of a simplified positioning algorithm wherein the azimuthal orientation of video camera 130 is directly correlated to the magnitude and the duration of the audio signal that is captured by each of the eight audio microphones 120 that are mounted onto first circuit board 150.
  • each conductive pad 150 is biased at a value above ground potential through a resistive element that is connected between a positive electric potential and each conductive pad 151.
  • rotational position indicator 143 makes contact with a specific conductive pad 151, the electrical potential of the contacted conductive pad 151 is then forced to the same ground potential that rotational position indicator 143 is biased to.
  • This change in the electrical potential of the contacted conductive pad 151 is sensed by electronic circuitry that is mounted onto first circuit board 150, and in this manner, the azimuthal orientation of rotational position indicator 143, and hence of circular platform 115, is then ascertained.
  • FIG. 5 now shows a top plan view of first circuit board 150 wherein the radial array of audio microphones 120 and the radial array of conductive pads 152 are both indicated.
  • the radial array of audio microphones 120 are best located at the outer peripheri of electronic circuit board 160 so that each audio microphone 120 is best positioned to capture and the audio information from the local group video conferencing environment.
  • the radial array of conductive pads 151 are best located at the inner peripheri of electronic circuit board 150 near to the location where platform motor 140 is mounted. When so located, conductive pads 151 will then be optimally positioned so as to easily make contact with the lower tip of rotational position indicator 143 as it swipes across the surface of first circuit board 150.
  • first circuit board 150 Also located onto first circuit board 150 but not shown by FIG. 5 are the various electronic circuitry and components that are necessary to implement the various electronic functions necessary for the proper operation of first video conferencing apparatus 100.
  • the detailed design of this electronic circuitry is arbitrary as there are numerous possible methods for the detailed implementation of the necessary functions.
  • FIG. 6 a broad description of the necessary electronic functions is subsequently provided by FIG. 6 for completeness.
  • FIG. 6 shows a block diagram depiction of the electronic circuitry 170 that is required for proper operation of first video conferencing apparatus 100.
  • the respective electrical audio signal 180 from each of the eight audio microphones 120 is input into a respective dedicated audio signal amplifier 171 which amplifies audio signal 180 to produce amplified audio signal 181.
  • Each respective amplified audio signal 181 is then fed into a respective audio signal rectifier 172 which electrically rectifies amplified audio signal 181 to produce a respective rectified audio signal 182.
  • Each respective rectified audio signal 182 is then fed to a respective audio signal integrator 173 each of which generates a respective averaged audio signal 183.
  • Each averaged audio signal 183 is a time averaged value of the respective rectified audio signals 182, wherein the integrating time constant of each audio signal integrator 173 is chosen so as to be appropriate for integrating the characteristics of human speech for the purposes of identifying the principle speaker from among the group of individuals participating in the local group video conference. Typically an integrating time constant which is of the order of 10-20 seconds will be appropriate for identifying the type of speech activity which is appropriately loud and appropriately sustained, as would be characteristic of the speech pattern of a principle speaker in the local group video conferencing environment.
  • Each respective averaged audio signal 183 is fed into a respective level comparator 174 which quantizes the average audio signal 183 into a respective logical ⁇ 0 ⁇ or a logical ⁇ 1 ⁇ digital signal 184 determined by whether the magnitude of average audio signal 183 is less than or greater than a preset threshold.
  • a preset threshold For stability, it is desirable that some degree of hysteresis is incorporated into the transfer characteristics of level comparator 174 such that the input threshold for declaring a logical ⁇ 1 ⁇ is chosen to be greater than the input threshold for declaring a logical ⁇ 0 ⁇ . This is a commonly accepted technique for minimizing oscillatory behavior in comparator circuitry.
  • control logic 175 is programmed to identify the principle speaker based on a preset algorithm, and once control logic 175 has made an identification of the principle speaker, the azimuthal positioning of video camera 130 is adjusted so as to capture the image of the principle speaker.
  • control logic 175 monitors the bias state of each conductive pad 151 and is thus able assertain the azimuthal position of circular platform 115, and hence of video camera 130. If video camera 130 needs to be repositioned so as to point to a newly identified principle speaker, control logic 175 will activate either the rotate clockwise output 185 or the rotate counter clockwise output 186 as required.
  • the rotate clockwise output 185 and the rotate counter clockwise output 186 are both inputs to the platform motor drive circuitry 176 wherein platform motor drive circuitry 176 supplies the correct bias voltage to platform motor 141 so as to bring about a clockwise or a counter clockwise rotation of circular platform 115. If both rotate clockwise output 176 and rotate counter clockwise output 177 are inactive, then platform motor 141 and hence circular platform 115 remain stationary.
  • rotational position indicator 143 is correspondingly made to swipe across the radial array of conductive pads 151.
  • each conductive pad 151 is biased through a respective biasing resistor 152 to an electric potential that is higher than electric ground.
  • Each respective biasing resistor 152 is connected between a respective conductive pad 151 and a positive non-ground electric potential Vplus 153.
  • control logic 175 will continue to request a rotation of platform motor 141 until an electrical ground potential is sensed on the specific conductive pad 151 which correlates to the desired azimuthal positioning of video camera 130.
  • the other electronic circuitry and components which are located onto circuit board 150 are the power supply and the composite audio output circuitry 177.
  • the power supply circuitry is neccessary for providing the necessary electric operating voltages to the various electronic components which have been described, but for conciseness, this power supply circuitry is not indicated by FIG. 6.
  • composite audio circuitry 177 which is responsible for generating an output composite audio signal 187 which is the summation of the each audio signal 180 from each respective audio microphone 120.
  • composite audio signal 187 represents the captured audio information from the local group video conferencing environment which is then subsequently provided to duplex unit 15.
  • First control algorithm 190 is shown by FIG. 7 in a general block diagram format and is presented herein as a possible algorithm which may be implemented for controlling the functioning of first video conferencing apparatus 100. It should be apparent to those skilled in the art that differing control algorithms may also be implemented, within the context of the components described by this specification, which would achieve similar results; and conversely, it would also be possible to apply control algorithm 190 to an implementation of components which differs from the particular implementation that comprises first video conferencing apparatus 100.
  • first control algorithm 190 begins with power-on reset step 191 wherein first video conferencing apparatus 100 is first powered on and activated. At this point in the operation of first video conferencing apparatus 100, a current principle speaker, CPS, is undetermined and is therefore unassigned. It is in new speaker detection step 192, that a new principle speaker, NPS, is identified. In first video conferencing apparatus 100 a detection of the NPS is made by way of a particular level comparator 174.
  • a particular digital signal 184 changes state from a logical ⁇ 0 ⁇ to a logical ⁇ 1 ⁇ , this indicates that an audio signal characteristic to that of a principle speaker has been captured by the audio microphone 120 that is respective to the particular digital signal 184 that is at a logical ⁇ 1 ⁇ .
  • new speaker affirmation step 193 wherein first control algorithm 190 insures that the CPS is inactive prior to designating the NPS to be the new CPS.
  • the CPS is determined to be inactive if the particular digital signal 184 respective to the particular audio microphone 120 which corresponds to the azimuthal location of the CPS has returned to a logical ⁇ 0 ⁇ level.
  • first control algorithm 190 continues to new speaker assignment step 194 wherein the NPS is assigned to be the new CPS. Else first control algorithm returns again to new speaker detection step 192. Once a new CPS assignment has been made, first control algorithm 190 will then adjust the azimuthal position of video camera 130 through the workings of camera positionong assembly 140 so that the video image of the newly designated CPS is captured by video camera 130.
  • FIG. 8 now shows a top plan view of a group video conferencing environment wherein a second embodiment of the invention is employed.
  • second video conferencing apparatus 200 like first video conferencing apparatus 100, is also placed onto and approximately at the center of meeting table 18. Again, each local participant 19 in the video conference is seated about meeting table 18 so as to face second video conferencing apparatus 200 directly.
  • Second video conferencing apparatus 200 shares many features in common with first video conferencing apparatus 100. Like first video conferencing apparatus 100, second video conferencing apparatus 200 is similarly comprised of a generally circular base unit 110 into which there is mounted a radial array of audio microphones 120 each having a directional audio signal capture characteristic. The audio information that is captured by each audio microphone 120 is electronically processed so as to identify the azimuthal orientation of the principle speaker from among the group of participants. Once the principle speaker has been identified, the electronic circuitry that is located within second video conferencing apparatus 200 will act to capture the video image of the principle speaker by activating the appropriate video camera from among the group of four video cameras 130 which are mounted in a circular array onto circular platform 115.
  • Second video conferencing apparatus 200 principally differs from first video conferencing apparatus 100 by employing a number of four video cameras 130 that are mounted onto second video conferencing apparatus 200 in a circular array. The video cameras are arranged so as to capture video information from the full azimuthal span about second video conferencing apparatus 200. Once a new principle speaker has been identified by second video conferencing apparatus 200, the appropriate video camera is activiated so as to capture the image of the new principle speaker. In addition, second video conferencing appartus 200 then also has the ability to center the the image of the new principle speaker into the field of view the activated video camera by subsequently mechanically adjusting the azimuthal orientation of the activated video camera.
  • the video signal output from second video conferencing apparatus 200 is also connected to video conferencing management unit 16 by way of video camera cable 21.
  • the audio output and the audio input of second video conferencing apparatus 100 are connected to duplex unit 15 by way of audio speaker cable 24 and audio microphone cable 22 respectively, with duplex unit 15 then being connected to video conferencing management unit 16 by way of duplex cable 25.
  • Video display 13 connects into video conferencing management unit 16 by way of video display cable 23, and video conferencing managment unit manages the exchange of the various audio and video information necessary for the video conference over electronic communication network 17.
  • first video conferencing apparatus 100 will be preferred when a low cost implementation of the video conferencing functions is desired, and second video conferencing apparatus 200 will be preferred when a higher degree of functionality is required.
  • FIG. 9 now shows a detailed cross sectional view of second video conferencing apparatus 200 along line 9--9 that is shown in FIG. 8. Again many of the functional elements found in first video conferencing apparatus 100 are also found in second video conferencing apparatus 200.
  • Second video conferencing apparatus 200 is similarly comprised of base unit 110, a multiple number of audio microphones 120, video camera positioning assembly 140, audio speaker 160, and a second electronic circuit board 250. Each video camera 130 is electrically connected to second electronic circuit board 250 with a respective video wire assembly 135.
  • Each video wire assembly 135 enters into the interior of second video conferencing apparatus 200 through a first hole 118 formed into the structure of circular platform 115, and, for best operation it is preferable to incorporate sufficient slack into each video wire assembly 135 such that the circular platform 115 is free to rotate unimpeded.
  • the video signal originating from the currently active video camera is electronically multiplexed onto second video wire assembly 235 which connects the multiplexed video image from second electronic circuit board 250 to video output connector 136.
  • Video output connector 136 then feeds the multiplexed video signal out to the exterior of second video conferencing apparatus 200 and connects the video signal into video camera cable 21.
  • Microphone wire assembly 122 is used to connect the composite microphone signal generated by second circuit board assembly 250 to microphone output connector 123.
  • the composite microphone signal is the summed and amplified output of each audio microphone 120 that is located onto second circuit board assembly 250 and is the audio input signal that second video conferencing apparatus 200 provides to duplex unit 15.
  • the composite microphone signal is connected to duplex unit 15 by way of audio microphone cable 22.
  • the audio output generated by duplex unit 15 is connected to second video conferencing apparatus 200 by audio speaker cable 23.
  • Audio speaker cable 23 connects to second video conferencing apparatus 200 at speaker input connector 161, and speaker wire assembly 162 is then used to connect the audio output signal from speaker input connector 161 to audio speaker 160.
  • the basic theory of operation which governs the functioning first video conferencing apparatus 100 also applies to the the functioning of second video conferencing apparatus 200 with the exception that second video conferencing apparatus 200 makes use of the added capability provided by having an array of four video cameras 130.
  • audio information that is captured by the circular array of audio microphones 120 is processed by electronic circuitry that is located onto second electronic circuit board 250 and the azimuthal location of the principle speaker is identified based on a preset algorithm. Once the azimuthal location of the principle speaker is identified, the appropriate video camera 130 in the radial array of video cameras is activated so as to capture the image of the recently identified speaker within the field of view of the activated camera.
  • the image of the principle speaker is then centered within the field of view of the activated video camera by rotating video platform 115 in a clockwise or in a counter clockwise direction by activating a corresponding clockwise or counterclockwise rotation of platform motor 141.
  • Platform motor 141 is deactivated once rotational position indicator 140 indicates that a rotation to the desired azymuthal orientation has been acheived.
  • second video conferencing apparatus 200 is able to capture the image of a newly identified principle speaker more quickly and with less mechanical activity and noise than is possible using first video conferencing apparatus 100.
  • FIG. 10 shows a top plan view of second electronic circuit board 250 wherein the radial array of audio microphones 120, and a pair of conductive pads 151 are indicated.
  • conductive pads 151 are positioned onto the top surface of second electronic circuit board 250 so as to make contact with the lower tip of rotational position indicator 143 as it swipes across the top surface of second electronic circuit board 250.
  • second video conferencing apparatus 200 requires only two conductive pads 151 to control the electromechanical azimuthal positioning of circular platform 115.
  • the reponsibility for capturing video images in the complete 360 azimuthal span in second video conferencing apparatus 200 is shared among the four video cameras 130, and each video camera 130 is assigned a coverage which spans a designated azimuthal range of 90 degrees.
  • the radial array of eight audio microphones 120 provide the capability to accurately identify the azimuthal position of a principle speaker with an azimuthal accuracy of 45 degrees using simple audio signal processing techniques.
  • the radial array of audio microphones 120 allows for an azimuthal positioning accuracy of 45 degrees if a simple non-interpolative algorithm is used to process the audio information that is captured by each audio microphone 120.
  • second electronic circuit board 250 The various other electronic components that are also located onto second electronic circuit board 250 are not shown by FIG. 10. As with first video conferencing apparatus 100, the detailed design of this electronic circuitry is arbitrary and there are numerous possible approaches for the detailed implementation of the necessary functions. However, as it is the object of this specification to teach the correct working of second video conferencing apparatus 200, a broad description of the necessary electronic functions is subsequently provided for completeness.
  • FIG. 11 shows a block diagram depiction of the second electronic circuitry 270 that implements the required electronic functions for second video conferencing apparatus 200.
  • the purpose and functionings of each audio signal amplifier 171, each audio signal rectifier 172, each audio signal integrator 173, and each level comparator 174 is the same as the implementation for electronic circuit 170 and so will not be described again repetitiously for second electronic circuitry 270.
  • the functioning of second control logic 275 however differs from the functioning of control logic 175 due to the added complexity of having to control the activation of one of four seperate video cameras 130.
  • second control logic 275 will arbitrate the outputs of each level comparator 174 to determine which audio channel is currently active so as to determine the azimuthal position of the current principle speaker.
  • second control logic 275 identifies the azimuthal position of the current speaker based on a pre-programmed algorithm, the appropriate video camera 130 is activated so as to capture the image of the principle speaker, and the video signal from the activated video camera is multiplexed onto second video wire assembly 235 by video multiplexer 276. Second control logic 275 also monitors the bias state of each conductive pad 151 and thus can assertain the current azimuthal position of circular platform 115. If the image of the principle speaker then needs to be centered within the field of view of the active video camera, second control logic 275 will activate either the rotate clockwise output 185 or the rotate counter clockwise output 186 as appropriate. Second control logic 275 will continue to reqeust a rotation of platform motor 141 until an electrical ground potential is sensed on the desired conductive pad 151 which correlates to the desired azimuthal positioning of circular platform 115.
  • Second positioning algorithm 290 is presented by FIG. 12 in a general block diagram form and is presented herein as a possible algorithm which may be implemented for controlling the functionings of second video conferencing apparatus 200. It should be apparent to those skilled in the art that differing algorithms may also be implemented, within the context of the components described by this specification, which would achieve similar results; and conversely it would also be possible to apply algorithm 290 to an implementation of components differing from the particular implementation of second video conferencing apparatus 200.
  • first control algorithm 290 begins with power-on reset step 291 wherein second video conferencing apparatus 200 is first powered on and activated. At this point in the operation of second video conferencing apparatus 200, a current principle speaker, CPS, is undetermined and is therefore unassigned. It is in new speaker detection step 292, that a new principle speaker, NPS, is identified. In second video conferencing apparatus 200 a detection of the NPS is made by way of a particular level comparator 174.
  • a particular digital signal 184 changes state from a logical ⁇ 0 ⁇ to a logical ⁇ 1 ⁇ , this indicates that an audio signal characteristic to that of a principle speaker has been captured by the audio microphone 120 that is respective to the particular digital signal 184 that is at a logical ⁇ 1 ⁇ .
  • new speaker affirmation step 293 wherein second control algorithm 290 insures that the CPS is inactive prior to designating the NPS to be the new CPS.
  • the CPS is determined to be inactive if the particular digital signal 184 respective to the particular audio microphone 120 which corresponds to the azimuthal location of the CPS has returned to a logical ⁇ 0 ⁇ level.
  • second control algorithm 290 continues to new speaker assignment step 294 wherein the NPS is assigned to be the new CPS. Else first control algorithm returns again to new speaker detection step 292. Once a new CPS assignment has been made, second control algorithm 290 will then to camera activation step 295 wherein the appropriate video camera 130 is activated to capture the image of the newly designated CPS. Second control algorithm 290 then advances to a subsequent step, camera positioning step 296 wherein the azimuthal position of the activated video camera 130 is appropriately adjusted through the workings of camera positioning assembly 140.
  • a digital microprocessor based method could concievably be employed to digitally process a digitized representation of the audio information that is captured by each audio microphone 110, and so determine the azymuthal position of the principle speaker through fully digital means, as opposed to the combined analog and digital means that are described by this specification.

Abstract

A group video conferencing apparatus for the purposes of facilitating a video conference involving a group of participants which are azymuthly located about said apparatus is described. Means are are provided for the identification of a principle speaker and for the positioning of the video camera so as to capture the image of the principle speaker. Identification of the azymuthal orientation of the principle speaker is realized through the electronic processing of audio signals generated by the group of participants, and the azymuthal positioning of the video camera is adjusted through electromechanical means so as to capture the image of the identified principle speaker.

Description

CROSS-REFERENCES TO RELATED APPLICATIONS
U.S. Pat. No. 3,958,084 May 1976 Nicholas
U.S. Pat. No. 4,054,908 October 1977 Poirier et al.
U.S. Pat. No. 4,267,593 May 1981 Regan et al.
U.S. Pat. No. 4,449,238 May 1984 Lee et al.
U.S. Pat. No. 5,003,532 March 1991 Ashida et al.
U.S. Pat. No. 5,117,285 May 1992 Nelson et al.
U.S. Pat. No. 5,347,306 September 1994 Nitta
U.S. Pat. No. 5,473,367 December 1995 Bales et al.
U.S. Pat. No. 5,479,203 December 1995 Kawai et al.
FIELD OF THE INVENTION
This invention relates to a video conferencing apparatus that is optimally suited for application into a round table video conferencing environment. The video conferencing apparatus that is described is comprised of a circular array of audio microphones and a video camera which is mounted onto a rotatable platform. Audio information captured by the circular array of microphones is processed to identify the azimuthal position of a principle speaker. The azimuthal position of the video camera is then electro-mechanically adjusted so as to accurately capture the image of the principle speaker.
DESCRIPTION OF THE PRIOR ART
Video conferencing is a popular means whereby individuals at physically distinct and remote locations are able to interact and exchange information through electronic means. A typical video conferencing setup will include apparatus for the capture and playback of audio and video information, apparatus for the electronic exchange of this information among at least two physically distinct and remote locations, and apparatus for managing the exchange of this information in an orderly manner among the distinct loctations. In practice, an audio microphone and speaker with their associated electronic circuitry will serve as the apparatus for the capture and playback, respectively, of the audio information; and similarly, a video camera and display with their associated electronic circuitry will serve as the apparatus for the capture and playback, respectively, of the video information. The exchange of the audio and video information among the remote locations is typically carried across an electronic communications network which may be either analog or digital in nature, and the exchange of this information is typically managed by computerized equipment which has been programmed to assemble the various information into a virtual electronic meeting place.
The prior art contains many examples relating to the implementation of various apparatus which enable and facilitate video conferencing. In general, there are two seperate and distinct types of video conferencing environments: single user video conferencing and group video conferencing. In a single user video conferencing environment, each individual that is involved in the video conference will typically be positioned in front of a video conferencing apparatus which is dedicated to the capture and playback of audio and video information specific solely for that individual. In a group video conferencing environment, the video conferencing apparatus will capture and playback audio and video information from a group of individuals who are physically located within a same room.
The dynamics of the two separate types of video conferencing environments differ from each other substantially. The dynamics of the single user video conferencing environment are generally well understood and therefore can be easily managed by most current art video conferencing apparatus. In the single user environment, a particular user is typically positioned before a video conferencing apparatus which is dedicated to the capture of audio and video information for that particular user. In the single user environment the position of the camera is fixed, and it is the user's responsibility to be positioned within the camera's field of view. Similarly, the position of the microphone is typically also fixed and it is the user's reponsibility to then address the microphone correctly. Once the single user audio and video information is captured, the video conference managing apparatus then manages the information from the various individuals participating in the video conference, but located at remote and distinct locations, such that the audio and the video information which represents each individual is then brought together into the virtual meeting place.
In contrast to the single user environment, the dynamics of the group video conferencing environment are substantially more complex and difficult to manage. In the group environment, the video conferencing apparatus must capture the audio and video information from a group of individuals who are located within a same room, and must present that information into the virtual meeting place in a manner that is natural and realistic. Typically the natural and realistic capture of audio information from the group environment is not problematic if the audio information is captured with omni-directional microphones, and if the audio information is then further processed using gain compensating electronics techniques. In this manner, good quality audio information can be captured into the virtula meeting place through the use of these well understood conventional means.
In a group video conferencing environment however the capture of video information is more problematic. In the group environment it is desirable that the video information be captured naturally and realistically, and preferably in a manner which does not interfere with the natural group dynamics and group interactions of the individuals who are participating in the video conference. For the most part, current state of the art video conferencing apparatus is not capable of capturing the video information present in a group video conferencing environment in a manner that is natural and realistic and in a manner that does not impact the natural human dynamics and interactions of the local group.
In current state of the art video conferencing apparatus, the video camera is typically positioned before the group of participants, and, in order that a complete image of the various participants be captured, the location of the each participants is restricted to be within the field of view of the video camera. Using this approach then the group of participants is made to act in many respects as a single participant with the natural dynamics of the local group interactions becoming unoviodably compromised, since now each member of the local group must face the video camera in order to be presented into the virtual meeting place in a naturalistic manner. Conversely, it is also problematic in this environment if a particalur individual member of the local group chooses not to face the video camera, since then the naturalness of the image of this paticular member as captured into the virtual meeting place is invariably compromised.
In order to provide an improved method for the the natural and realistic capture of video information in a group video conferencing environment, other implementations have proposed the use of fixed multiple video cameras so as to enable the capturing of video information from a multiplicity of differing angles, or the use of mobile video cameras to capture video information from the current active speaker in the group. In general, both of these alternate approaches also possess undesired shortcomings. The first approach involving the use of multiple fixed cameras necessitates a high equipment costs for the multiple cameras and for the required support electronics. Regardless of the additional cameras however, this approach still falls short of capturing each participant's image in a natural and realistic manner since the participants remain confined to the fields of view of the specific cameras. The second approach suggests the use of a mobile camera which can be operated to capture the specific image of the current designated speaker. With this second approach, it is possible to capture a more natural image of a group participant into a virtual meeting place. However this second approach necessitates the continuous manual operation of the video camera by one of the group members or by a dedicated camera operator, with both of these approaches again invariably imposing on and compromising the natural human dynamics of the group video conferencing environment.
SUMMARY OF THE INVENTION
Accordingly then, it is desirable to implement a video conferencing apparatus for specific application to a group video conferencing environment which overcomes the shortcomings of the various prior art devices. In particular, the following objects and advantages of a video conferencing apparatus for application to group video conferencing are desirable:
(a) to realize a video conferencing apparatus which is relatively uncomplicated in its design and therefore relatively inexpensive to manufacture,
(b) to realize a video conferencing apparatus which will capture natural and realistic audio and video information from a group video conferencing environment,
(c) to realize a video conferencing apparatus which will not interfere with the natural human dynamics which exist in a group video conferencing environment,
(d) to realize a video conferencing apparatus which will automatically capture audio and video information from a group video conferencing environment without the need for manual supervision by a group member or by a dedicated operator.
Therefore, in keeping with the above stated objectives the inventors propose the video conferencing apparatus for group video conferences that is described herein. Two preferred embodiments of the invention are described by this specification. Both of the preferred embodiments of the invention are designed for ease of use and for ease of manufacture using current state of the art manufacturing processes. The design of both of the preferred embodiments described herein are also compatible with the current state of the art approaches and methodologies used to implement video conferences.
In the first preferred embodiment of the invention, a video conferencing apparatus is described which is comprised of a video camera and of a circular array of audio microphones. Both the camera and the radial array of microphones are mounted onto an integral unit which is centrally located amidst those individuals participating in the local group video conferencing environment. The azimuthal orientation of the camera is controlled by the audio information that is captured by the radial array of microphones. Audio information captured by the radial array of microphones is electronically processed such that a principle speaker is continuously identified based on a pre-programmed algorithm. With the principle speaker identified, the camera is then azimuthly positioned through electromechanical means such that the image of the principle speaker becomes accurately captured within the video camera's field of view.
In the second preferred embodiment of the invention, a video conferencing apparatus is described which is comprised of a multiple number of video cameras and of a circular array of audio microphones. In this second embodiment, the multiple video cameras are, like the microphones, also arranged into a radial array with each camera being dedicated to the capture of video information from within its own field of view. As in the first embodiment, audio information that is captured by the circular array of microphones is used to determine the approximate azymuthal location of a principle speaker. Once this location is identified, the appropriate video camera in the radial array of cameras is activated so as to capture the image of the recently identified speaker. The second embodiment also allows for a further fine adjustment of the azymuthal positioning of the video camera through electromechanical means so that the image of the recently identified speaker may be more accurately captured into the activated camera's field of view.
In comparing the first with the second embodiment, the first embodiment offers the advantage of implementing the the desired function through the use of a single video camera, thus minimizing the cost of the implementation, whereas the second embodiment, while necessitaing a greater cost to implement, will implement the desired function in a manner which is potentially more responsive to the dynamics of the group video conferencing environment, and whose operation will be less ubiquitous during the course of the video conference.
Both embodiments of the invention achieve the desired goals. Namely, both embodiments provide a means by which both audio and video information from a group video conferencing environment can be captured naturally and realistically and with a minimal disruption to the natural dynamics and interactions of the video conferencing group. Both embodiments provide a means for the natural and realistic capture of audio and video information with minimal continuous operator intervention. Finally, both embodiments demonstrate an inherently uncomplicated design which can be easily manufactured. A more detailed description of the preferred embodiments of the invention are provided by the insuing drawings and by the accompanying descriptions.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 shows a top plan view of a group video conferencing environment wherein a conventional group video conferencing apparatus is employed.
FIG. 2 shows a top plan view of a local group video conferencing environment wherein the first embodiment of the proposed group video conferencing apparatus is employed.
FIG. 3 shows a cross-sectional view of the local group video conferencing environment shown by FIG. 2 taken along line 3--3 of FIG. 2.
FIG. 4 shows a cross-sectional view taken along line 4--4 of FIG. 2 of the first embodiment of the proposed group video conferencing apparatus.
FIG. 5 shows a top plan view of the electronic circuit board of the first embodiment of the proposed group video conferencing apparatus.
FIG. 6 shows a block diagram depiction of the electronic componentry of the first embodiment of the group video conferencing apparatus.
FIG. 7 shows the general algorithm by which the video camera of the first embodiment is azimuthally positioned during the course of a group video conference.
FIG. 8 shows a top plan view of the second embodiment of the proposed group video conferencing apparatus.
FIG. 9 shows a cross-sectional view taken along line 9--9 of FIG. 8 of the second embodiment of the proposed group video conferencing apparatus.
FIG. 10 shows a top plan view of the electronic circuit board of the second embodiment of the proposed group video conferencing apparatus.
FIG. 11 shows a block diagram depiction of the electronic componentry of the second embodiment of the group video conferencing apparatus.
FIG. 12 shows the general algorithm by which a particular video camera of the second embodiment is activated and azimuthally positioned during the course of a group video conference.
DESCRIPTION OF THE PREFERRED EMBODIMENTS
Referring now to the drawings, FIG. 1 shows a symbolic top plan view of a group video conferencing environment which employs a conventional group video conferencing apparatus 10. Typically conventional group video conferencing apparatus 10 will consist of a video camera 11 for the capture of video information, a microphone 12 for the capture of audio information, a video display 13 to display video information from the remotely located participants of the video conference, an audio speaker 14 to provide audio from the remotely located participants of the video conference, a duplex unit 15 to enable the simultaneous exchange of audio information, and an electronic video conferencing management unit 16 that is connected into an electronic communications network 17. It is the function of video conferencing management unit 16 to manage the exchange of the various audio and video information over electronic communication network 17 and to appropriately manage and assemble this information into the virtual meeting place. A more detailed description of the operation of video conferencing management unit 16 lies beyond the scope of this specification, is not relevant to the claims of this specification, and so is not provided herein.
The disparate units are interconnected by way of various cables. Video camera 11 connects into video conferencing management unit 16 by way of video camera cable 21, and the video output from video conferencing apparatus 10 connects into video display 13 by way of video display cable 23. The audio output from audio microphone 12 and the audio input to audio speaker 14 are connected to duplex unit 15 by way of audio microphone cable 22 and audio speaker cable 24 respectively. Duplex unit 15 is connected to video conferencing management unit 16 by way of duplex cable 25.
In a group video conferencing environment, video conferencing apparatus 10 will typically be positioned at the head of a meeting table 18, and the local participants 19 involved in the video conference will typically be seated about meeting table 18.
Audio microphone 12 is positioned so as to capture the audio information that is produced by the various local participants 19 while audio speaker 14 is used to reproduce the audio information from the various remote participants in the video conference. Duplex unit 15 performs the appropriate echo-cancellation functions so as to permit simultanueous full duplex conversation to take place betwen the local and the remote participants in the video conference. As with video conferencing managment unit 16, a more detailed description of the operation of duplex unit 15 lies beyond the scope of this specification, is not relevant to the claims of this specification and so is not provided herein.
Video camera 11 is typically positioned at one end of meeting table 18 so as to capture the video image of each local participant 19 that is seated about meeting table 18. Similarly, video display 13 is also positioned at one end of meeting table 18, usually next to video camera 11. Using this arrangement, the video information required for the video conference can be easily presented to and captured from the group of local participants 19. The fixed location of video camera 11 however imposes several restrictions on the natural and realistic quality of the video infromation that is captured into the virtual meeting place environment. For example, since the field of view of video camera 11 is limited, the location of each local participant 19 becomes similarly limited to being within this field of view. Additionally, each local participant 19 must face video camera 11 in order that his image be discernably captured into the virtual meeting place by video camera 11. For a large enough group of local particpants the danger will exist that the image of those local participants seated furthest from video camera 11 may be captured so as to not be discemable at all.
The fixed location of video camera 11 will also impose some unavoidable restrictions onto the natural human dynamics and interactions of the local video conferencing group. For example, within the local group video conferencing environment, the group dynamics will be such that each local participant 19 will interact with each other local participant 19, and also interact with the audio and video information that is provided by display 13 and speaker 14 of video conferencing apparatus 10. Given these varied interactions, it becomes difficult for video camera 11 to realistically and naturally capture the various group dynamics from the local environment given its fixed position at the end of meeting table 11. Similarly, the requirement that each local participant 19 must face video camera 11 invariably also compromises and limits the local group interaction for that particular participant.
Referring now to FIG. 2, a top plan view of a group video conferencing environment is shown wherein the first embodiment of the proposed video conferencing apparatus is employed. As FIG. 2 indicates, first video conferencing apparatus 100 is placed onto and approximately at the center of meeting table 18, such that, during the course of the video conference, each local participant 19 in the video conference is seated about meeting table 18 so as to face first video conferencing apparatus 100 direcly.
During the course of a video conference first video conferencing apparatus 100 is used to capture both the audio and the video information from each local participant 19 in the local group video conferencing environment. An audio speaker located onto the underside of first video conferencing apparatus 100 is used to reproduce audio information that is generated by the remote participants in the video conference. Video display 13, which is located at one end of meeting table 19 is used to display the video information that is generated by the remote participants in the video conference.
As with video conferencing apparatus 10, video conferencing management unit 16 is used to manage the exchange of the various audio and video information over electronic communication network 17 and to appropriately manage and assemble this information into the virtual meeting place. Duplex unit 15 is used to enable the full duplex simultaneous exchange of audio information among the local and the remote participants in the video conference. Video display 13 connects into video conferencing management unit 16 by way of video display cable 23, and the video output from first video conferencing apparatus 100 connects into video conferencing management unit 16 by way of video camera cable 21. The audio output and the audio input of first video conferencing apparatus 100 are connected to duplex unit 15 by way of audio speaker cable 24 and audio microphone cable 22 respectively, with duplex unit 15 then being connected to video conferencing management unit 16 by way of duplex cable 25. Thus, in many respects, first video conferencing apparatus 100 is similar to video conferencing apparatus 10, with the exception that the video capture, the audio capture, and the audio generation functions are integrated into a compact central module which can be centrally and unobtrusively located into the local group video conferencing environment so as to enable the more realistic capture and exchange of audio and video information during the course of the video conference.
Describing first video conferencing apparatus 100 now in more detail, first video conferencing apparatus 100 is comprised of a generally circular base unit 110 into which there is mounted a radial array of audio microphones 120 and also onto which is centrally mounted a video camera 130. Each audio microphone 120 is of the type having strongly directional audio capture capability such that each audio microphone 120 will predominantly capture audio information which eminates from within the audio conic section 121 that is associated with each audio microphone 120. Similarly, video camera 130, which is directional by nature, will only capture video information that is present within the video field of view 131.
During the course of the group video conference, each audio microphone 120 will capture audio information that originates from within its corresponding audio conic section 121. Electronic circuitry internal to novel video conferencing unit 100 then processes this audio information using a predefined algorithm to to identify the azymuthal position of that specific local participant 19 who is currently the primary speaker in the group of local particpants participating in the video conference. Once the principle speaker in the local group of participants is identified, video camera 130 is then azimuthally positioned in either the clockwise or the counter clockwise direction that is indicated by the azymuthal line of travel 132, with azimuthal line of travel 132 being essentially confined to a geometric plane which is parallel to the top surface of meeting table 18. The azymuthal positioning of video camera 130 is realized by way of electromechanical means that are internal to first video conferencing apparatus 100 and which will azymuthally position video camera 130 such that the video image of the most currently identified principle speaker is brought into field of view 131 of video camera 130. In this manner then, the image of the principle speaker is captured by video camera 130 in a manner that is both natural and realistic and in a manner that is unobtrusive to the various dynamics and interactions that are ongoing in the local group video conferencing environment.
For proper design, it is necessary that a sufficient number of audio microphones 120 are radially arranged onto first video conferencing apparatus 100 such that audio information can be adequately captured from the full 360 degrees of azymuthal span about first video conferencing apparatus 100. In the design of first video conferencing apparatus 100, eight radially arranged audio microphones 120 are used, with each audio microphone 120 dedicated to the capture of audio information from a conic section which spans a 45 degrees of azimuth about first video conferencing apparatus 100. Using this arrangement, it is possible to azimuthally position video camera 120 to an accuracy of 45 degrees using a simple algorithm to process the audio information that is captured by the circular array of audio microphones 120. A more accurate azymuthal positioning of video camera 130 is possible through the use of more sophisticated algorithms to process the audio information that is captured by the circular array of audio microphones and then subsequently interpolate a more accurate azymuthal position beyond an accuracy of 45 degrees. As described herein however, first video conferencing apparatus 100 relies on a simple positioning algorithm to azymuthally position video camera 130 to the 45 degree accuracy that is obtainable by a simplified processing of the audio information that is captured by the circular array of eight audio microphones 120. Under this approach then, video camera 120 must have a field of view which is 45 degrees, or preferably greater, to capture a full image of the selected principle speaker.
FIG. 3 now shows a cross-sectional view of the group video conferencing environment depicted by FIG. 2 along line 3--3 of FIG. 2. As FIG. 3 indicates, video camera 130 is typically positioned and tilted at an appropriate verticle angle 133 relative to the top surface of meeting table 18 such that an an adequate image of local participant 19 can be adequately captured into the virtual meeting place. In general, verticle angle 133 at which video camera 130 is tilted is initially set through manual means at the start of the video conference, and, once set, will remain so fixed throughout the course of the video-conference. Since the majority of the participants in the local group video conference environment are typically seated at a similar height relative to the height of meeting table 18, there will not be a need to provide any automatic re-adjustment capability of vertical angle 133 provided that the verticle field of view 134 of novel video camera 130 is sufficient to capture a correct image of the selected principle speaker.
FIG. 4 now shows a cross-sectional view taken along line 4--4 of FIG. 2 of first video conferencing apparatus 100 in more detail. As FIG. 4 shows, first video coferencing apparatus 100 is comprised of the previously described base unit 110, a multiple number of audio microphones 120, and video camera 130. The other principle components which also comprise first video conferencing apparatus 100 are a video camera positioning assembly 140, a first electronic circuit board 150, and an audio speaker 160.
As FIG. 4 indicates, base unit 110 is comprised of a circular base platform 111, an annular convex enclosure 112, and a circular platform 115. Base platform 111 is a flat circular panel onto which most of the various components comprising novel video conferencing apparatus 100 are affixed. The outer peripheri of annular convex enclosure 112 is affixed to the outer peripheri of circular base platform 111, with annular convex enclosure 112 rising upwards above the upper surface of circular base platform 111 to define an enclosed circular space for housing the majority of the components which comprise first video conferencing apparatus 100. Circular base platform 111 is supported above the height of meeting table 18 by supporting posts 113 which are rigidly and and peripherally affixed to the lower surface of circular base platform 111. Supporting posts 113 are of a sufficient height so as to define a space beneath circular base platform 111 which is adequate to contain audio speaker 160. Audio speaker 160 is mounted onto the lower surface of circular base platform 111 with mounting posts 114. Preferably, mounting posts 114 should be formed of an accoustically insulative material so as to provide some degree of accoustical isolation between audio speaker 160 and circular base platform 111. The audio signals produced by audio speaker 160 will emanate from the underside of novel video conferencing apparatus 100 while first video conferencing apparatus 100 is operational during the course of a video conference.
The inner periferi of convex annular enclosure 112 defines a circular opening into which is located circular platform 115. Video camera 130 is mounted concentrically onto the upper surface of circular platform 1 15 by means of pivot joint 116. Pivot joint 116 is rigidly affixed to the upper surface of circular platform 115, and novel video camera 130 is mounted onto pivot joint 116 in a manner that permits novel video camera 130 to be pivoted and fixed at the desired verticle angle 134. Circular platform is mounted onto and positioned by the video camera positioning assembly 140, the operation of which is described subsequent to the description of first circuit board 150.
First circuit board 150 is a donut shaped electronic circuit board which is mounted onto the upper surface of circular base platform 111 and which contains the various electronic components and circuitry necessary for the proper operation of first video conferencing apparatus 100. Each audio microphone 120 is mounted onto first circuit board 150 so as protrude out to the exterior of first video conferencing apparatus 100 through openings 117 that are formed into the structure of convex annular enclosure 112. Video wire assembly 135 electrically connects the video signal from video camera 130 to video output connector 136. Video output connector 136 feeds the video signal to the exterior of circular base platform 111 and connects the video signal into video camera cable 21. Video wire assembly 131 enters into the interior of first video conferencing apparatus 100 through a first hole 118 formed into the structure of circular platform 115, and, for best operation it is preferable to incorporate sufficient slack into video wire assembly 135 such that circular platform 115 is free to rotate unimpeded.
Microphone wire assembly 122 is used to connect the composite microphone signal generated by first circuit board 150 to microphone output connector 123. The composite microphone signal is the summed and amplified output of each audio microphone 120 that is located onto first circuit board 150 and is the audio input signal that first video conferencing apparatus 100 provides to duplex unit 15. The composite microphone signal is connected to duplex unit 15 by way of audio microphone cable 22. Similarly, the audio output signal that is provided by duplex unit 15 is connected to first video conferencing apparatus 100 by audio speaker cable 23. Audio speaker cable 23 connects to first video conferencing apparatus 100 at speaker input connector 161, and speaker wire assembly 162 is then used to connect the audio output signal from speaker input connector 161 to audio speaker 160.
The azimuthal orientation of video camera 130 is controlled by the workings of video camera positioning assembly 140, wherein video camera positioning assembly 140 is comprised of a platform motor 141, a verticle support rod 142, and a rotational positional indicator 143.
Platform motor 141 is rigidly and concentrically affixed to the top surface of circular base platform 111 in a manner that preferably provides a maximum degree of vibrational isolation between platform motor 141 and circular base platform 111. Platform motor 141 is also rigidly affixed to circular platform 115 by way of verticle support rod 142, wherein the lower end of verticle support rod 142 is concentrically affixed to the rotor shaft 144 of platform motor 141, and the upper end of verticle support rod 142 is concentrically affixed to the lower surface of circular platform 115. Video camera 130 is affixed onto the upper surface of circular platform 115 through pivot joint 115 wherein pivot joint 115 allows video camera 130 to be vertically pivoted to a desired angle of tilt. The azimuthal orientation of video camera 130 is controlled by platform motor 141 whereby platform motor is capable of rotating in either a clockwise or a clockwise direction.
The azimuthal orientation of circular platform 115, and hence of novel video camera 130, is sensed through the use of rotational position indicator 143. Rotational position indicator 143 is formed as an electrically conductive rod whose electrical potential is held at an electrical ground potential as indicated by schematic ground symbol 145. The upper end of rotational position indicator 143 is affixed to the lower surface of circular platform 115 and hence will rotate azimuthally in keeping with the azimuthal orientation of circular platform 115. The lower end of rotational position indicator 143 is made to come into contact with the upper surface of circuit board 150 and is arranged so as to swipe along the upper surface of circuit board 150 as circular platform 115 rotates azimuthally. With this arrangement, the azimuthal position of circular platform 115 is then sensed by a circular array of eight conductive pads 151 which are formed as exposed conductive patterns affixed onto the top surface of circuit board assembly 150. Each respective conductive pad 151 is located so as to be in the path of the circular orbit of rotational position indicator 143 and located so as to make electrical contact with the lower end of rotational position indicator 143 once the lower end of rotational position indicator 143 swipes over each respective conductve pad 151. For the design of first video conferencing apparatus 100, a number of eight conductive pads 151 are used, thus providing an azimuthal position accuracy of 45 degrees for the azimuthal positioning of video camera 130. The use of eight conductive pads 151 also permits the use of a simplified positioning algorithm wherein the azimuthal orientation of video camera 130 is directly correlated to the magnitude and the duration of the audio signal that is captured by each of the eight audio microphones 120 that are mounted onto first circuit board 150.
Normally the electrical potential of each conductive pad 150 is biased at a value above ground potential through a resistive element that is connected between a positive electric potential and each conductive pad 151. Once rotational position indicator 143 makes contact with a specific conductive pad 151, the electrical potential of the contacted conductive pad 151 is then forced to the same ground potential that rotational position indicator 143 is biased to. This change in the electrical potential of the contacted conductive pad 151 is sensed by electronic circuitry that is mounted onto first circuit board 150, and in this manner, the azimuthal orientation of rotational position indicator 143, and hence of circular platform 115, is then ascertained.
FIG. 5 now shows a top plan view of first circuit board 150 wherein the radial array of audio microphones 120 and the radial array of conductive pads 152 are both indicated. For proper design, the radial array of audio microphones 120 are best located at the outer peripheri of electronic circuit board 160 so that each audio microphone 120 is best positioned to capture and the audio information from the local group video conferencing environment. Also for proper design, the radial array of conductive pads 151 are best located at the inner peripheri of electronic circuit board 150 near to the location where platform motor 140 is mounted. When so located, conductive pads 151 will then be optimally positioned so as to easily make contact with the lower tip of rotational position indicator 143 as it swipes across the surface of first circuit board 150.
Also located onto first circuit board 150 but not shown by FIG. 5 are the various electronic circuitry and components that are necessary to implement the various electronic functions necessary for the proper operation of first video conferencing apparatus 100. In general, the detailed design of this electronic circuitry is arbitrary as there are numerous possible methods for the detailed implementation of the necessary functions. However, as it is the object of this specification to teach the correct working of first video conferencing apparatus 100, a broad description of the necessary electronic functions is subsequently provided by FIG. 6 for completeness.
FIG. 6 shows a block diagram depiction of the electronic circuitry 170 that is required for proper operation of first video conferencing apparatus 100. As FIG. 5 indicates, the respective electrical audio signal 180 from each of the eight audio microphones 120 is input into a respective dedicated audio signal amplifier 171 which amplifies audio signal 180 to produce amplified audio signal 181. Each respective amplified audio signal 181 is then fed into a respective audio signal rectifier 172 which electrically rectifies amplified audio signal 181 to produce a respective rectified audio signal 182. Each respective rectified audio signal 182 is then fed to a respective audio signal integrator 173 each of which generates a respective averaged audio signal 183. Each averaged audio signal 183 is a time averaged value of the respective rectified audio signals 182, wherein the integrating time constant of each audio signal integrator 173 is chosen so as to be appropriate for integrating the characteristics of human speech for the purposes of identifying the principle speaker from among the group of individuals participating in the local group video conference. Typically an integrating time constant which is of the order of 10-20 seconds will be appropriate for identifying the type of speech activity which is appropriately loud and appropriately sustained, as would be characteristic of the speech pattern of a principle speaker in the local group video conferencing environment.
Each respective averaged audio signal 183 is fed into a respective level comparator 174 which quantizes the average audio signal 183 into a respective logical `0` or a logical `1` digital signal 184 determined by whether the magnitude of average audio signal 183 is less than or greater than a preset threshold. For stability, it is desirable that some degree of hysteresis is incorporated into the transfer characteristics of level comparator 174 such that the input threshold for declaring a logical `1` is chosen to be greater than the input threshold for declaring a logical `0`. This is a commonly accepted technique for minimizing oscillatory behavior in comparator circuitry.
The quantized outputs of each level comparator 174 are then provided to a block of control logic 175 which is used to arbitrate the logical outputs of each level comparator 174 to determine which audio channel is currently active as the principle speaker. Control logic 175 is programmed to identify the principle speaker based on a preset algorithm, and once control logic 175 has made an identification of the principle speaker, the azimuthal positioning of video camera 130 is adjusted so as to capture the image of the principle speaker.
In order to azimuthally position video camera 130, control logic 175 monitors the bias state of each conductive pad 151 and is thus able assertain the azimuthal position of circular platform 115, and hence of video camera 130. If video camera 130 needs to be repositioned so as to point to a newly identified principle speaker, control logic 175 will activate either the rotate clockwise output 185 or the rotate counter clockwise output 186 as required. The rotate clockwise output 185 and the rotate counter clockwise output 186 are both inputs to the platform motor drive circuitry 176 wherein platform motor drive circuitry 176 supplies the correct bias voltage to platform motor 141 so as to bring about a clockwise or a counter clockwise rotation of circular platform 115. If both rotate clockwise output 176 and rotate counter clockwise output 177 are inactive, then platform motor 141 and hence circular platform 115 remain stationary.
As platform motor 141 is made to rotate, rotational position indicator 143 is correspondingly made to swipe across the radial array of conductive pads 151. Normally, each conductive pad 151 is biased through a respective biasing resistor 152 to an electric potential that is higher than electric ground. Each respective biasing resistor 152 is connected between a respective conductive pad 151 and a positive non-ground electric potential Vplus 153. As rotational position indicator 143 makes contact with a specific conductive pad 151 the electrical potential of that particular conductive pad 151 will be forced to electrical ground, and this change in the electrical bias of the particular conductive pad 151 is then sensed by control logic 175. During the period wherein video camera 130 is azimuthally repositioned, control logic 175 will continue to request a rotation of platform motor 141 until an electrical ground potential is sensed on the specific conductive pad 151 which correlates to the desired azimuthal positioning of video camera 130.
The other electronic circuitry and components which are located onto circuit board 150 are the power supply and the composite audio output circuitry 177. The power supply circuitry is neccessary for providing the necessary electric operating voltages to the various electronic components which have been described, but for conciseness, this power supply circuitry is not indicated by FIG. 6. Indicated by FIG. 6 however is composite audio circuitry 177, which is responsible for generating an output composite audio signal 187 which is the summation of the each audio signal 180 from each respective audio microphone 120. As previously described, composite audio signal 187 represents the captured audio information from the local group video conferencing environment which is then subsequently provided to duplex unit 15.
The logical control algorithm which is programmed into control logic 175 is shown by FIG. 7. First control algorithm 190 is shown by FIG. 7 in a general block diagram format and is presented herein as a possible algorithm which may be implemented for controlling the functioning of first video conferencing apparatus 100. It should be apparent to those skilled in the art that differing control algorithms may also be implemented, within the context of the components described by this specification, which would achieve similar results; and conversely, it would also be possible to apply control algorithm 190 to an implementation of components which differs from the particular implementation that comprises first video conferencing apparatus 100.
As FIG. 7 indicates, first control algorithm 190 begins with power-on reset step 191 wherein first video conferencing apparatus 100 is first powered on and activated. At this point in the operation of first video conferencing apparatus 100, a current principle speaker, CPS, is undetermined and is therefore unassigned. It is in new speaker detection step 192, that a new principle speaker, NPS, is identified. In first video conferencing apparatus 100 a detection of the NPS is made by way of a particular level comparator 174. When a particular digital signal 184 changes state from a logical `0` to a logical `1`, this indicates that an audio signal characteristic to that of a principle speaker has been captured by the audio microphone 120 that is respective to the particular digital signal 184 that is at a logical `1`. Following new speaker detection step 192 is new speaker affirmation step 193, wherein first control algorithm 190 insures that the CPS is inactive prior to designating the NPS to be the new CPS. In first video conferencing apparatus 100, the CPS is determined to be inactive if the particular digital signal 184 respective to the particular audio microphone 120 which corresponds to the azimuthal location of the CPS has returned to a logical `0` level. If the CPS is indeed inactive, then first control algorithm 190 continues to new speaker assignment step 194 wherein the NPS is assigned to be the new CPS. Else first control algorithm returns again to new speaker detection step 192. Once a new CPS assignment has been made, first control algorithm 190 will then adjust the azimuthal position of video camera 130 through the workings of camera positionong assembly 140 so that the video image of the newly designated CPS is captured by video camera 130.
FIG. 8 now shows a top plan view of a group video conferencing environment wherein a second embodiment of the invention is employed. As FIG. 8 indicates, second video conferencing apparatus 200, like first video conferencing apparatus 100, is also placed onto and approximately at the center of meeting table 18. Again, each local participant 19 in the video conference is seated about meeting table 18 so as to face second video conferencing apparatus 200 directly.
Second video conferencing apparatus 200 shares many features in common with first video conferencing apparatus 100. Like first video conferencing apparatus 100, second video conferencing apparatus 200 is similarly comprised of a generally circular base unit 110 into which there is mounted a radial array of audio microphones 120 each having a directional audio signal capture characteristic. The audio information that is captured by each audio microphone 120 is electronically processed so as to identify the azimuthal orientation of the principle speaker from among the group of participants. Once the principle speaker has been identified, the electronic circuitry that is located within second video conferencing apparatus 200 will act to capture the video image of the principle speaker by activating the appropriate video camera from among the group of four video cameras 130 which are mounted in a circular array onto circular platform 115.
Second video conferencing apparatus 200 principally differs from first video conferencing apparatus 100 by employing a number of four video cameras 130 that are mounted onto second video conferencing apparatus 200 in a circular array. The video cameras are arranged so as to capture video information from the full azimuthal span about second video conferencing apparatus 200. Once a new principle speaker has been identified by second video conferencing apparatus 200, the appropriate video camera is activiated so as to capture the image of the new principle speaker. In addition, second video conferencing appartus 200 then also has the ability to center the the image of the new principle speaker into the field of view the activated video camera by subsequently mechanically adjusting the azimuthal orientation of the activated video camera.
Like first video conferencing apparatus 100, the video signal output from second video conferencing apparatus 200 is also connected to video conferencing management unit 16 by way of video camera cable 21. Similarly, the audio output and the audio input of second video conferencing apparatus 100 are connected to duplex unit 15 by way of audio speaker cable 24 and audio microphone cable 22 respectively, with duplex unit 15 then being connected to video conferencing management unit 16 by way of duplex cable 25. Video display 13 connects into video conferencing management unit 16 by way of video display cable 23, and video conferencing managment unit manages the exchange of the various audio and video information necessary for the video conference over electronic communication network 17.
The use of multiple video cameras for the capture of video information from the local video conferencing environment offers second video conferencing apparatus 200 the advantage of being able to capture the video image of a newly identified principle speaker more quickly and with less electromechanical activity and noise than would be possible with a single video camera approach. However, the advantages provided by a multiple video camera approach are realized, of course, at the expense of the greater component cost associated with having a multiple number of video cameras. Thus, it is envisioned that first video conferencing apparatus 100 will be preferred when a low cost implementation of the video conferencing functions is desired, and second video conferencing apparatus 200 will be preferred when a higher degree of functionality is required.
FIG. 9 now shows a detailed cross sectional view of second video conferencing apparatus 200 along line 9--9 that is shown in FIG. 8. Again many of the functional elements found in first video conferencing apparatus 100 are also found in second video conferencing apparatus 200. Second video conferencing apparatus 200 is similarly comprised of base unit 110, a multiple number of audio microphones 120, video camera positioning assembly 140, audio speaker 160, and a second electronic circuit board 250. Each video camera 130 is electrically connected to second electronic circuit board 250 with a respective video wire assembly 135. Each video wire assembly 135 enters into the interior of second video conferencing apparatus 200 through a first hole 118 formed into the structure of circular platform 115, and, for best operation it is preferable to incorporate sufficient slack into each video wire assembly 135 such that the circular platform 115 is free to rotate unimpeded.
The video signal originating from the currently active video camera is electronically multiplexed onto second video wire assembly 235 which connects the multiplexed video image from second electronic circuit board 250 to video output connector 136. Video output connector 136 then feeds the multiplexed video signal out to the exterior of second video conferencing apparatus 200 and connects the video signal into video camera cable 21.
Microphone wire assembly 122 is used to connect the composite microphone signal generated by second circuit board assembly 250 to microphone output connector 123. The composite microphone signal is the summed and amplified output of each audio microphone 120 that is located onto second circuit board assembly 250 and is the audio input signal that second video conferencing apparatus 200 provides to duplex unit 15. The composite microphone signal is connected to duplex unit 15 by way of audio microphone cable 22. Similarly, the audio output generated by duplex unit 15 is connected to second video conferencing apparatus 200 by audio speaker cable 23. Audio speaker cable 23 connects to second video conferencing apparatus 200 at speaker input connector 161, and speaker wire assembly 162 is then used to connect the audio output signal from speaker input connector 161 to audio speaker 160.
The basic theory of operation which governs the functioning first video conferencing apparatus 100 also applies to the the functioning of second video conferencing apparatus 200 with the exception that second video conferencing apparatus 200 makes use of the added capability provided by having an array of four video cameras 130. As in the first embodiment, audio information that is captured by the circular array of audio microphones 120 is processed by electronic circuitry that is located onto second electronic circuit board 250 and the azimuthal location of the principle speaker is identified based on a preset algorithm. Once the azimuthal location of the principle speaker is identified, the appropriate video camera 130 in the radial array of video cameras is activated so as to capture the image of the recently identified speaker within the field of view of the activated camera. The image of the principle speaker is then centered within the field of view of the activated video camera by rotating video platform 115 in a clockwise or in a counter clockwise direction by activating a corresponding clockwise or counterclockwise rotation of platform motor 141. Platform motor 141 is deactivated once rotational position indicator 140 indicates that a rotation to the desired azymuthal orientation has been acheived. In this manner then, second video conferencing apparatus 200 is able to capture the image of a newly identified principle speaker more quickly and with less mechanical activity and noise than is possible using first video conferencing apparatus 100.
The electronic circuitry for amplifying the audio signals from each audio microphone 120, for identifying a principle speaker, and for controlling and positioning platform motor 120 is located onto second electronic circuit board 250. FIG. 10 shows a top plan view of second electronic circuit board 250 wherein the radial array of audio microphones 120, and a pair of conductive pads 151 are indicated. As in the design of first electronic circuit board 150, conductive pads 151 are positioned onto the top surface of second electronic circuit board 250 so as to make contact with the lower tip of rotational position indicator 143 as it swipes across the top surface of second electronic circuit board 250. Unlike first video conferencing apparatus 100, second video conferencing apparatus 200 requires only two conductive pads 151 to control the electromechanical azimuthal positioning of circular platform 115. The reponsibility for capturing video images in the complete 360 azimuthal span in second video conferencing apparatus 200 is shared among the four video cameras 130, and each video camera 130 is assigned a coverage which spans a designated azimuthal range of 90 degrees. The radial array of eight audio microphones 120 provide the capability to accurately identify the azimuthal position of a principle speaker with an azimuthal accuracy of 45 degrees using simple audio signal processing techniques. Thus, when a particular video camera 130 is activated, an image that is within the specific 90 degree azimuthal span that is designated to the activated camera can be captured, and the image can then be centered into into the field of view of the activated camera to an azymuthal accuracy of 45 degrees through the electromechanical positioning capability that is provided by rotational positioning system 140 and the pair of conductive pads 151 . As with first video conferencing apparatus 100, the radial array of audio microphones 120 allows for an azimuthal positioning accuracy of 45 degrees if a simple non-interpolative algorithm is used to process the audio information that is captured by each audio microphone 120. If an interpolative algorithm were employed to process the audio information captured by each audio microphne 120 so as to thereby achieve an azimuthal positioning accuracy greater than 45 degrees, then an appropriately greater number of conductive pads 151 or video cameras 130 would be required to properly support this greater degree of accuracy.
The various other electronic components that are also located onto second electronic circuit board 250 are not shown by FIG. 10. As with first video conferencing apparatus 100, the detailed design of this electronic circuitry is arbitrary and there are numerous possible approaches for the detailed implementation of the necessary functions. However, as it is the object of this specification to teach the correct working of second video conferencing apparatus 200, a broad description of the necessary electronic functions is subsequently provided for completeness.
FIG. 11 shows a block diagram depiction of the second electronic circuitry 270 that implements the required electronic functions for second video conferencing apparatus 200. The purpose and functionings of each audio signal amplifier 171, each audio signal rectifier 172, each audio signal integrator 173, and each level comparator 174 is the same as the implementation for electronic circuit 170 and so will not be described again repetitiously for second electronic circuitry 270. The functioning of second control logic 275 however differs from the functioning of control logic 175 due to the added complexity of having to control the activation of one of four seperate video cameras 130. Like control logic 175, second control logic 275 will arbitrate the outputs of each level comparator 174 to determine which audio channel is currently active so as to determine the azimuthal position of the current principle speaker. Once second control logic 275 identifies the azimuthal position of the current speaker based on a pre-programmed algorithm, the appropriate video camera 130 is activated so as to capture the image of the principle speaker, and the video signal from the activated video camera is multiplexed onto second video wire assembly 235 by video multiplexer 276. Second control logic 275 also monitors the bias state of each conductive pad 151 and thus can assertain the current azimuthal position of circular platform 115. If the image of the principle speaker then needs to be centered within the field of view of the active video camera, second control logic 275 will activate either the rotate clockwise output 185 or the rotate counter clockwise output 186 as appropriate. Second control logic 275 will continue to reqeust a rotation of platform motor 141 until an electrical ground potential is sensed on the desired conductive pad 151 which correlates to the desired azimuthal positioning of circular platform 115.
The azimuthal positioning algorithm which is programmed into control logic 275 is shown by FIG. 12. Second positioning algorithm 290 is presented by FIG. 12 in a general block diagram form and is presented herein as a possible algorithm which may be implemented for controlling the functionings of second video conferencing apparatus 200. It should be apparent to those skilled in the art that differing algorithms may also be implemented, within the context of the components described by this specification, which would achieve similar results; and conversely it would also be possible to apply algorithm 290 to an implementation of components differing from the particular implementation of second video conferencing apparatus 200.
As FIG. 12 indicates, first control algorithm 290 begins with power-on reset step 291 wherein second video conferencing apparatus 200 is first powered on and activated. At this point in the operation of second video conferencing apparatus 200, a current principle speaker, CPS, is undetermined and is therefore unassigned. It is in new speaker detection step 292, that a new principle speaker, NPS, is identified. In second video conferencing apparatus 200 a detection of the NPS is made by way of a particular level comparator 174. When a particular digital signal 184 changes state from a logical `0` to a logical `1`, this indicates that an audio signal characteristic to that of a principle speaker has been captured by the audio microphone 120 that is respective to the particular digital signal 184 that is at a logical `1`. Following new speaker detection step 292 is new speaker affirmation step 293, wherein second control algorithm 290 insures that the CPS is inactive prior to designating the NPS to be the new CPS. In second video conferencing apparatus 200, the CPS is determined to be inactive if the particular digital signal 184 respective to the particular audio microphone 120 which corresponds to the azimuthal location of the CPS has returned to a logical `0` level. If the CPS is indeed inactive, then second control algorithm 290 continues to new speaker assignment step 294 wherein the NPS is assigned to be the new CPS. Else first control algorithm returns again to new speaker detection step 292. Once a new CPS assignment has been made, second control algorithm 290 will then to camera activation step 295 wherein the appropriate video camera 130 is activated to capture the image of the newly designated CPS. Second control algorithm 290 then advances to a subsequent step, camera positioning step 296 wherein the azimuthal position of the activated video camera 130 is appropriately adjusted through the workings of camera positioning assembly 140.
Although the preceding description contains various specificities, these should not be construed as limiting the scope of the invention but as merely providing an example of the preferred embodiments of this invention. Many modifications, alterations and changes will become apparent to those skilled in the art to which this invention pertains. For example, it is a relatively simple procedure to integrate the functionality of duplex unit 15 into either first video conferencing appartus 100 or second video conferencing apparatus 200 so as to realize a self contained speaker phone and video conferencing apparatus. Similarly, it is possible to design alternate embodiments of the invention having an alternate number of audio microphones or a differing number of video cameras than are shown by the embodiments described by this specification. Also it is possible to implement alternate algorithms and associated componentry for the identification of the azymuthal position of the principle speaker. For example, a digital microprocessor based method could concievably be employed to digitally process a digitized representation of the audio information that is captured by each audio microphone 110, and so determine the azymuthal position of the principle speaker through fully digital means, as opposed to the combined analog and digital means that are described by this specification. Similarly, it is also possible to implement alternate methods for the azymuthal positioning of video camera 120 which differ from rotational positioning apparatus 140 that is described by this specification. For example, it is also possible using well known techniques to sense the azymuthal position of video camera 120 using optical azymuthal position sensors and their associated componentry, or using magnetic azymuthal position sensors and their associated componentry in lieu of the methods which have been described by this specification.
Therefore as many alterations, modifications and changes will become apparent to those skilled in the art to which this invention pertains, the scope of this invention should be determined by the appended claims and their legal equivalents, rather than by the embodiment described herein.

Claims (12)

The inventors claim:
1. An apparatus for the purpose of determining the azimuthal position of a speaker from within a group of a first number of participants, said participants being azimuthally positioned about said apparatus, said apparatus comprising:
a means for the determination of the azimuthal position of said speaker from within said group of participants, wherein said determination of said azimuthal position of said speaker is realized through the electronic processing of audio signals generated by said group of participants, wherein said means for said determination of the azimuthal position of said speaker is comprised of a generally circular array of a second number of audio microphones, wherein said second number of said audio microphones is not correlated to said first number of said participants, said audio microphones being affixed to a common base element, said audio microphones being unattached from said participants, said audio microphones being located so as to capture said audio signals generated by said participants, wherein the low level electronic signal produced by each of said audio microphones is electronically amplified to produce a multiple number of corresponding first electronic signals, wherein the absolute value of each said first electronic signal is averaged over time through electronic means to produce a corresponding number of second electronic signals, wherein the magnitude of each said second electronic signal is converted to a multiple number of discrete digital electronic signals, wherein said multiple number of discrete digital electronic signals are processed by electronic computational logic, wherein said electronic computational logic computes the azimuthal position of said speaker through a pre-programmed algorithm.
2. A group video conferencing apparatus for the purpose of facilitating a group video conference involving a group of a first number of participants, said participants being azimuthally positioned about said apparatus, said apparatus comprising:
a means for the determination of the azimuthal position of said speaker from within said group of participants, wherein said determination of said azimuthal position of said speaker is realized through the electronic processing of audio signals generated by said group of participants, wherein said means for said determination of the azimuthal position of said speaker is comprised of a generally circular array of a second number of audio microphones, wherein said second number of said audio microphones is not correlated to said first number of said participants, said audio microphones being affixed to a common base element, said audio microphones being unattached from said participants, said audio microphones being located so as to capture said audio signals generated by said participants, wherein the low level electronic signal produced by each of said audio microphones is electronically amplified to produce a multiple number of corresponding first electronic signals, wherein the absolute value of each said first electronic signal is averaged over time through electronic means to produce a corresponding number of second electronic signals, wherein the magnitude of each said second electronic signal is converted to a multiple number of discrete digital electronic signals, wherein said multiple number of discrete digital electronic signals are processed by electronic computational logic, wherein said electronic computational logic computes the azimuthal position of said speaker through a pre-programmed algorithm,
a means to effect the azimuthal positioning of at least a single video camera, wherein said azimuthal positioning of said video camera is correlated to said azimuthal position of said speaker, wherein said azimuthal position of said speaker is determined by said electronic processing of said electronic signals generated by said group of participants.
3. The invention of claim 2 wherein said azimuthal positioning of said video camera is effected through electromechanical means, said electromechanical means comprising a motor means to effect said azimuthal positioning of said video camera, said electromechanical means also comprising electronic circuitry, said electronic circuitry acting to effect the actuation of said motor means.
4. The invention of claim 2 also comprising a monitoring means for the monitoring of said azimuthal positioning of said video camera, said monitoring means comprising an electrically conductive element wherein the azimuthal position of said electrically conductive element is in correlation with the azimuthal position of said video camera, said monitoring means comprising a group of electrically conductive pads wherein said electrically conductive pads are rigidly located into a generally circular arrangement, said monitoring means comprising electronic monitoring circuitry, said electrically conductive element being capable of contacting a particular said electrically conductive pad once the azimuthal position of said electrically conductive element is coincident with the azimuthal location of a particular said electrically conductive pad, said electrically conductive element being biased to a first electric potential, said electrically conductive pads being biased to a second electric potential, said electrically conductive element being capable of impressing said first electric potential onto a contacted said electrically conductive pad, said electronic monitoring circuitry being capable of sensing the electric potential of each said electrically conductive pad, said electronic monitoring circuitry ascertaining the azimuthal position of said video camera by sensing the electric potential of each said electrically conductive pad.
5. The invention of claim 2 wherein said generally circular array of said audio microphones and wherein said video camera are both affixed to a common structure for the purposes of comprising a constituent apparatus.
6. The invention of claim 2 wherein said generally circular array of said audio microphones and wherein said video camera are both affixed to a common structure for the purposes of comprising a constituent apparatus, wherein said video camera is centrally located to said generally circular array of said audio microphones.
7. The invention of claim 2 wherein said video conferencing apparatus also comprises an audio speaker means, said audio speaker means capable of converting an electronic signal into an audible audio signal, said audio signal being appropriate for the purposes of implementing a speakerphone.
8. A group video conferencing apparatus for the purpose of facilitating a group video conference involving a group of a first number of participants, said participants being azimuthally positioned about said apparatus, said apparatus comprising:
a means for the determination of the azimuthal position of said speaker from within said group of participants, wherein said determination of said azimuthal position of said speaker is realized through the electronic processing of audio signals generated by said group of participants, wherein said means for said determination of the azimuthal position of said speaker is comprised of a generally circular array of a second number of audio microphones, wherein said second number of said audio microphones is not correlated to said first number of said participants, said audio microphones being affixed to a common base element, said audio microphones being unattached from said participants, said audio microphones being located so as to capture said audio signals generated by said participants, wherein the low level electronic signal produced by each of said audio microphones is electronically amplified to produce a multiple number of corresponding first electronic signals, wherein the absolute value of each said first electronic signal is averaged over time through electronic means to produce a corresponding number of second electronic signals, wherein the magnitude of each said second electronic signal is converted to a multiple number of discrete digital electronic signals, wherein said multiple number of discrete digital electronic signals are processed by electronic computational logic, wherein said electronic computational logic computes the azimuthal position of said speaker through a pre-programmed algorithm,
a means to activate a video image captured by at least a single video camera, wherein the azimuthal field of view of the activated said video camera is correlated to said azimuthal position of said speaker, wherein said azimuthal position of said speaker is determined by said electronic processing of said audio signals generated by said group of participants.
9. The invention of claim 8 wherein said generally circular array of said audio microphones and wherein said video camera are both affixed to a common structure for the purposes of comprising a constituent apparatus.
10. The invention of claim 8 wherein said generally circular array of said audio microphones and wherein said video camera are both affixed to a common structure for the purposes of comprising a constituent apparatus, wherein said video camera is centrally located to said generally circular array of said audio microphones.
11. The invention of claim 8 wherein said video conferencing apparatus also comprises an audio speaker means, said audio speaker means capable of converting an electronic signal into an audible audio signal, said audio signal being appropriate for the purposes of implementing a speakerphone.
12. A group video conferencing apparatus for the purpose of facilitating a group video conference involving a group of a first number of participants, said participants being azimuthally positioned about said apparatus, said apparatus comprising:
a means for the determination of the azimuthal position of said speaker from within said group of participants, wherein said determination of said azimuthal position of said speaker is realized through the electronic processing of audio signals generated by said group of participants, wherein said means for said determination of the azimuthal position of said speaker is comprised of a generally circular array of a second number of audio microphones, wherein said second number of said audio microphones is not correlated to said first number of said participants, said audio microphones being affixed to a common base element, said audio microphones being unattached from said participants, said audio microphones being located so as to capture said audio signals generated by said participants, wherein the low level electronic signal produced by each of said audio microphones is processed by electronic circuitry so as to determine said azimuthal position of said speaker, said electronic circuitry effecting said determination of said azimuthal position of said speaker,
a means to effect the azimuthal positioning of at least a single video camera, wherein said azimuthal positioning of said video camera is correlated to said azimuthal position of said speaker, wherein said azimuthal position of said speaker is determined by said electronic processing of said audio signals generated by said group of participants,
a means for the monitoring of said azimuthal positioning of said video camera, said monitoring means comprising an electrically conductive element wherein the azimuthal position of said electrically conductive element is in correlation with the azimuthal positioning of said video camera, said monitoring means comprising a group of electrically conductive pads wherein said electrically conductive pads are rigidly located into a generally circular arrangement, said monitoring means comprising electronic monitoring circuitry, said electrically conductive element being capable of contacting a particular said electrically conductive pad once the azimuthal position of said electrically conductive element is coincident with the azimuthal location of a particular said electrically conductive pad, said electrically conductive element being biased to a first electric potential, said electrically conductive pads being biased to a second electric potential, said electrically conductive element being capable of impressing said first electric potential onto a contacted said electrically conductive pad, said electronic monitoring circuitry being capable of sensing the electric potential of each said electrically conductive pad, said electronic monitoring circuitry ascertaining the azimuthal position of said video camera by sensing the electric potential of each said electrically conductive pad.
US08/868,798 1997-06-04 1997-06-04 Video conferencing apparatus for group video conferencing Expired - Fee Related US6072522A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US08/868,798 US6072522A (en) 1997-06-04 1997-06-04 Video conferencing apparatus for group video conferencing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US08/868,798 US6072522A (en) 1997-06-04 1997-06-04 Video conferencing apparatus for group video conferencing

Publications (1)

Publication Number Publication Date
US6072522A true US6072522A (en) 2000-06-06

Family

ID=25352340

Family Applications (1)

Application Number Title Priority Date Filing Date
US08/868,798 Expired - Fee Related US6072522A (en) 1997-06-04 1997-06-04 Video conferencing apparatus for group video conferencing

Country Status (1)

Country Link
US (1) US6072522A (en)

Cited By (69)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6397275B1 (en) 1994-09-07 2002-05-28 Viseon, Inc. Peripheral video conferencing system
US20020085738A1 (en) * 2000-12-28 2002-07-04 Peters Geoffrey W. Controlling a processor-based system by detecting flesh colors
US20020191071A1 (en) * 2001-06-14 2002-12-19 Yong Rui Automated online broadcasting system and method using an omni-directional camera system for viewing meetings over a computer network
US20020196327A1 (en) * 2001-06-14 2002-12-26 Yong Rui Automated video production system and method using expert video production rules for online publishing of lectures
US6516066B2 (en) * 2000-04-11 2003-02-04 Nec Corporation Apparatus for detecting direction of sound source and turning microphone toward sound source
US20030220971A1 (en) * 2002-05-23 2003-11-27 International Business Machines Corporation Method and apparatus for video conferencing with audio redirection within a 360 degree view
US20040012669A1 (en) * 2002-03-25 2004-01-22 David Drell Conferencing system with integrated audio driver and network interface device
US6766035B1 (en) * 2000-05-03 2004-07-20 Koninklijke Philips Electronics N.V. Method and apparatus for adaptive position determination video conferencing and other applications
WO2004100546A1 (en) * 2003-05-08 2004-11-18 Tandberg Telecom As An arrangement and method for audio source tracking
US20040263636A1 (en) * 2003-06-26 2004-12-30 Microsoft Corporation System and method for distributed meetings
US20050015286A1 (en) * 2001-09-06 2005-01-20 Nice System Ltd Advanced quality management and recording solutions for walk-in environments
US20060012671A1 (en) * 2004-07-16 2006-01-19 Alain Nimri Natural pan tilt zoom camera motion to preset camera positions
US20060083389A1 (en) * 2004-10-15 2006-04-20 Oxford William V Speakerphone self calibration and beam forming
US20060082655A1 (en) * 2004-10-15 2006-04-20 Vanderwilt Patrick D High definition pan tilt zoom camera with embedded microphones and thin cable for data and power
US7035418B1 (en) * 1999-06-11 2006-04-25 Japan Science And Technology Agency Method and apparatus for determining sound source
US20060088308A1 (en) * 2004-10-15 2006-04-27 Kenoyer Michael L Camera support mechanism
US20060093128A1 (en) * 2004-10-15 2006-05-04 Oxford William V Speakerphone
US20060104633A1 (en) * 2004-10-15 2006-05-18 Kenoyer Michael L High definition camera pan tilt mechanism
US20060104458A1 (en) * 2004-10-15 2006-05-18 Kenoyer Michael L Video and audio conferencing system with spatial audio
US7057636B1 (en) * 1998-12-22 2006-06-06 Koninklijke Philips Electronics N.V. Conferencing system and method for the automatic determination of preset positions corresponding to participants in video-mediated communications
US20060132595A1 (en) * 2004-10-15 2006-06-22 Kenoyer Michael L Speakerphone supporting video and audio features
US7089285B1 (en) * 1999-10-05 2006-08-08 Polycom, Inc. Videoconferencing apparatus having integrated multi-point conference capabilities
US20060239477A1 (en) * 2004-10-15 2006-10-26 Oxford William V Microphone orientation and size in a speakerphone
US20060239443A1 (en) * 2004-10-15 2006-10-26 Oxford William V Videoconferencing echo cancellers
US20060256974A1 (en) * 2005-04-29 2006-11-16 Oxford William V Tracking talkers using virtual broadside scan and directed beams
US20060256991A1 (en) * 2005-04-29 2006-11-16 Oxford William V Microphone and speaker arrangement in speakerphone
US20060256983A1 (en) * 2004-10-15 2006-11-16 Kenoyer Michael L Audio based on speaker position and/or conference location
US20060262943A1 (en) * 2005-04-29 2006-11-23 Oxford William V Forming beams with nulls directed at noise sources
US20060262942A1 (en) * 2004-10-15 2006-11-23 Oxford William V Updating modeling information based on online data gathering
US7143182B1 (en) 2000-08-08 2006-11-28 Cisco Technology, Inc. Smart secretary for routing call objects in a telephony network
US20060269080A1 (en) * 2004-10-15 2006-11-30 Lifesize Communications, Inc. Hybrid beamforming
US20060269278A1 (en) * 2004-10-15 2006-11-30 Kenoyer Michael L Coordinated camera pan tilt mechanism
US20060269074A1 (en) * 2004-10-15 2006-11-30 Oxford William V Updating modeling information based on offline calibration experiments
US20070030984A1 (en) * 2005-08-02 2007-02-08 Gotfried Bradley L Conference system
US7246145B1 (en) 2000-08-08 2007-07-17 Cisco Technology, Inc. Fully distributed, scalable infrastructure, communication system
US20070195532A1 (en) * 2006-02-21 2007-08-23 Cml Innovative Technologies, Inc. LED lamp module
US20080030611A1 (en) * 2006-08-01 2008-02-07 Jenkins Michael V Dual Sensor Video Camera
US7415516B1 (en) 2000-08-08 2008-08-19 Cisco Technology, Inc. Net lurkers
US20080255840A1 (en) * 2007-04-16 2008-10-16 Microsoft Corporation Video Nametags
US20090003678A1 (en) * 2007-06-29 2009-01-01 Microsoft Corporation Automatic gain and exposure control using region of interest detection
US20090002477A1 (en) * 2007-06-29 2009-01-01 Microsoft Corporation Capture device movement compensation for speaker indexing
US20090002476A1 (en) * 2007-06-28 2009-01-01 Microsoft Corporation Microphone array for a camera speakerphone
US20090261597A1 (en) * 2005-04-14 2009-10-22 Natural Forces, Llc Reduced Friction Wind Turbine Apparatus and Method
US7852369B2 (en) * 2002-06-27 2010-12-14 Microsoft Corp. Integrated design for omni-directional camera and microphone array
WO2012031524A1 (en) * 2010-09-10 2012-03-15 中兴通讯股份有限公司 Microphone array device, conferencing system and smart terminal
US20120081504A1 (en) * 2010-09-30 2012-04-05 Alcatel-Lucent Usa, Incorporated Audio source locator and tracker, a method of directing a camera to view an audio source and a video conferencing terminal
US8457614B2 (en) 2005-04-07 2013-06-04 Clearone Communications, Inc. Wireless multi-unit conference phone
US8749612B1 (en) 2011-12-01 2014-06-10 Google Inc. Reduced bandwidth usage in video conferencing
US8791982B1 (en) 2012-06-27 2014-07-29 Google Inc. Video multicast engine
US8896656B2 (en) 2007-10-12 2014-11-25 Steelcase Inc. Personal control apparatus and method for sharing information in a collaborative workspace
US8917309B1 (en) 2012-03-08 2014-12-23 Google, Inc. Key frame distribution in video conferencing
US9008487B2 (en) 2011-12-06 2015-04-14 Alcatel Lucent Spatial bookmarking
US9055332B2 (en) 2010-10-26 2015-06-09 Google Inc. Lip synchronization in a video conference
US9210302B1 (en) 2011-08-10 2015-12-08 Google Inc. System, method and apparatus for multipoint video transmission
US20150358585A1 (en) * 2013-07-17 2015-12-10 Ebay Inc. Methods, systems, and apparatus for providing video communications
US9294716B2 (en) 2010-04-30 2016-03-22 Alcatel Lucent Method and system for controlling an imaging system
US9465524B2 (en) 2008-10-13 2016-10-11 Steelcase Inc. Control apparatus and method for sharing information in a collaborative workspace
US9609275B2 (en) 2015-07-08 2017-03-28 Google Inc. Single-stream transmission method for multi-user video conferencing
USD788725S1 (en) * 2015-09-11 2017-06-06 Polycom, Inc. Videoconferencing unit
US9955209B2 (en) 2010-04-14 2018-04-24 Alcatel-Lucent Usa Inc. Immersive viewer, a method of providing scenes on a display and an immersive viewing system
US10051353B2 (en) * 2016-12-13 2018-08-14 Cisco Technology, Inc. Telecommunications audio endpoints
US10264213B1 (en) 2016-12-15 2019-04-16 Steelcase Inc. Content amplification system and method
US10631632B2 (en) 2008-10-13 2020-04-28 Steelcase Inc. Egalitarian control apparatus and method for sharing information in a collaborative workspace
US10863035B2 (en) 2017-11-30 2020-12-08 Cisco Technology, Inc. Microphone assembly for echo rejection in audio endpoints
US10884607B1 (en) 2009-05-29 2021-01-05 Steelcase Inc. Personal control apparatus and method for sharing information in a collaborative workspace
US10904657B1 (en) * 2019-10-11 2021-01-26 Plantronics, Inc. Second-order gradient microphone system with baffles for teleconferencing
US10951859B2 (en) 2018-05-30 2021-03-16 Microsoft Technology Licensing, Llc Videoconferencing device and method
US20210334070A1 (en) * 2017-11-06 2021-10-28 Google Llc Methods and systems for attending to a presenting user
EP4002836A1 (en) * 2020-11-13 2022-05-25 Honda Motor Co., Ltd. Remote display system, robot, and display terminal

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3958084A (en) * 1974-09-30 1976-05-18 Rockwell International Corporation Conferencing apparatus
US4054908A (en) * 1975-05-27 1977-10-18 Poirier Alain M Videotelephone conference system
US4267593A (en) * 1979-06-15 1981-05-12 Wescom Switching, Inc. Method and means for digital conferencing
US4449238A (en) * 1982-03-25 1984-05-15 Bell Telephone Laboratories, Incorporated Voice-actuated switching system
US5003532A (en) * 1989-06-02 1991-03-26 Fujitsu Limited Multi-point conference system
US5117285A (en) * 1991-01-15 1992-05-26 Bell Communications Research Eye contact apparatus for video conferencing
US5206721A (en) * 1990-03-08 1993-04-27 Fujitsu Limited Television conference system
US5347306A (en) * 1993-12-17 1994-09-13 Mitsubishi Electric Research Laboratories, Inc. Animated electronic meeting place
JPH07264569A (en) * 1994-03-16 1995-10-13 Sumitomo Electric Ind Ltd Remote conference device
US5473367A (en) * 1993-06-30 1995-12-05 At&T Corp. Video view selection by a chairperson
US5479203A (en) * 1992-04-20 1995-12-26 Canon Kabushiki Kaisha Video camera apparatus with zoom control based on the pan or tilt operation

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3958084A (en) * 1974-09-30 1976-05-18 Rockwell International Corporation Conferencing apparatus
US4054908A (en) * 1975-05-27 1977-10-18 Poirier Alain M Videotelephone conference system
US4267593A (en) * 1979-06-15 1981-05-12 Wescom Switching, Inc. Method and means for digital conferencing
US4449238A (en) * 1982-03-25 1984-05-15 Bell Telephone Laboratories, Incorporated Voice-actuated switching system
US5003532A (en) * 1989-06-02 1991-03-26 Fujitsu Limited Multi-point conference system
US5206721A (en) * 1990-03-08 1993-04-27 Fujitsu Limited Television conference system
US5117285A (en) * 1991-01-15 1992-05-26 Bell Communications Research Eye contact apparatus for video conferencing
US5479203A (en) * 1992-04-20 1995-12-26 Canon Kabushiki Kaisha Video camera apparatus with zoom control based on the pan or tilt operation
US5473367A (en) * 1993-06-30 1995-12-05 At&T Corp. Video view selection by a chairperson
US5347306A (en) * 1993-12-17 1994-09-13 Mitsubishi Electric Research Laboratories, Inc. Animated electronic meeting place
JPH07264569A (en) * 1994-03-16 1995-10-13 Sumitomo Electric Ind Ltd Remote conference device

Cited By (138)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6519662B2 (en) 1994-09-07 2003-02-11 Rsi Systems, Inc. Peripheral video conferencing system
US6654825B2 (en) 1994-09-07 2003-11-25 Rsi Systems, Inc. Peripheral video conferencing system with control unit for adjusting the transmission bandwidth of the communication channel
US6397275B1 (en) 1994-09-07 2002-05-28 Viseon, Inc. Peripheral video conferencing system
US7057636B1 (en) * 1998-12-22 2006-06-06 Koninklijke Philips Electronics N.V. Conferencing system and method for the automatic determination of preset positions corresponding to participants in video-mediated communications
US7035418B1 (en) * 1999-06-11 2006-04-25 Japan Science And Technology Agency Method and apparatus for determining sound source
US7089285B1 (en) * 1999-10-05 2006-08-08 Polycom, Inc. Videoconferencing apparatus having integrated multi-point conference capabilities
US6516066B2 (en) * 2000-04-11 2003-02-04 Nec Corporation Apparatus for detecting direction of sound source and turning microphone toward sound source
US6766035B1 (en) * 2000-05-03 2004-07-20 Koninklijke Philips Electronics N.V. Method and apparatus for adaptive position determination video conferencing and other applications
US8495140B2 (en) 2000-08-08 2013-07-23 Cisco Technology, Inc. Fully distributed, scalable infrastructure, communication system
US7246145B1 (en) 2000-08-08 2007-07-17 Cisco Technology, Inc. Fully distributed, scalable infrastructure, communication system
US20080031242A1 (en) * 2000-08-08 2008-02-07 Cisco Technology, Inc. Fully distributed, scalable infrastructure, communication system
US7415516B1 (en) 2000-08-08 2008-08-19 Cisco Technology, Inc. Net lurkers
US7143182B1 (en) 2000-08-08 2006-11-28 Cisco Technology, Inc. Smart secretary for routing call objects in a telephony network
US20020085738A1 (en) * 2000-12-28 2002-07-04 Peters Geoffrey W. Controlling a processor-based system by detecting flesh colors
US7349005B2 (en) * 2001-06-14 2008-03-25 Microsoft Corporation Automated video production system and method using expert video production rules for online publishing of lectures
US6937266B2 (en) * 2001-06-14 2005-08-30 Microsoft Corporation Automated online broadcasting system and method using an omni-directional camera system for viewing meetings over a computer network
US20020196327A1 (en) * 2001-06-14 2002-12-26 Yong Rui Automated video production system and method using expert video production rules for online publishing of lectures
US20020191071A1 (en) * 2001-06-14 2002-12-19 Yong Rui Automated online broadcasting system and method using an omni-directional camera system for viewing meetings over a computer network
US20050015286A1 (en) * 2001-09-06 2005-01-20 Nice System Ltd Advanced quality management and recording solutions for walk-in environments
US7728870B2 (en) * 2001-09-06 2010-06-01 Nice Systems Ltd Advanced quality management and recording solutions for walk-in environments
US20040012669A1 (en) * 2002-03-25 2004-01-22 David Drell Conferencing system with integrated audio driver and network interface device
US7450149B2 (en) * 2002-03-25 2008-11-11 Polycom, Inc. Conferencing system with integrated audio driver and network interface device
US20030220971A1 (en) * 2002-05-23 2003-11-27 International Business Machines Corporation Method and apparatus for video conferencing with audio redirection within a 360 degree view
US7852369B2 (en) * 2002-06-27 2010-12-14 Microsoft Corp. Integrated design for omni-directional camera and microphone array
US7586513B2 (en) 2003-05-08 2009-09-08 Tandberg Telecom As Arrangement and method for audio source tracking
WO2004100546A1 (en) * 2003-05-08 2004-11-18 Tandberg Telecom As An arrangement and method for audio source tracking
US8111282B2 (en) * 2003-06-26 2012-02-07 Microsoft Corp. System and method for distributed meetings
US20090046139A1 (en) * 2003-06-26 2009-02-19 Microsoft Corporation system and method for distributed meetings
US7428000B2 (en) * 2003-06-26 2008-09-23 Microsoft Corp. System and method for distributed meetings
US20040263636A1 (en) * 2003-06-26 2004-12-30 Microsoft Corporation System and method for distributed meetings
US7623156B2 (en) * 2004-07-16 2009-11-24 Polycom, Inc. Natural pan tilt zoom camera motion to preset camera positions
US20060012671A1 (en) * 2004-07-16 2006-01-19 Alain Nimri Natural pan tilt zoom camera motion to preset camera positions
US7903137B2 (en) 2004-10-15 2011-03-08 Lifesize Communications, Inc. Videoconferencing echo cancellers
US7970151B2 (en) 2004-10-15 2011-06-28 Lifesize Communications, Inc. Hybrid beamforming
US20060269278A1 (en) * 2004-10-15 2006-11-30 Kenoyer Michael L Coordinated camera pan tilt mechanism
US20060269074A1 (en) * 2004-10-15 2006-11-30 Oxford William V Updating modeling information based on offline calibration experiments
US20060083389A1 (en) * 2004-10-15 2006-04-20 Oxford William V Speakerphone self calibration and beam forming
US20060262942A1 (en) * 2004-10-15 2006-11-23 Oxford William V Updating modeling information based on online data gathering
US7720232B2 (en) 2004-10-15 2010-05-18 Lifesize Communications, Inc. Speakerphone
US8237770B2 (en) * 2004-10-15 2012-08-07 Lifesize Communications, Inc. Audio based on speaker position and/or conference location
US8878891B2 (en) 2004-10-15 2014-11-04 Lifesize Communications, Inc. Providing audio playback during a conference based on conference system source
US20060256983A1 (en) * 2004-10-15 2006-11-16 Kenoyer Michael L Audio based on speaker position and/or conference location
US20060269080A1 (en) * 2004-10-15 2006-11-30 Lifesize Communications, Inc. Hybrid beamforming
US8116500B2 (en) 2004-10-15 2012-02-14 Lifesize Communications, Inc. Microphone orientation and size in a speakerphone
US20060082655A1 (en) * 2004-10-15 2006-04-20 Vanderwilt Patrick D High definition pan tilt zoom camera with embedded microphones and thin cable for data and power
US20060239443A1 (en) * 2004-10-15 2006-10-26 Oxford William V Videoconferencing echo cancellers
US8054336B2 (en) 2004-10-15 2011-11-08 Lifesize Communications, Inc. High definition pan tilt zoom camera with embedded microphones and thin cable for data and power
US20060104633A1 (en) * 2004-10-15 2006-05-18 Kenoyer Michael L High definition camera pan tilt mechanism
US20060088308A1 (en) * 2004-10-15 2006-04-27 Kenoyer Michael L Camera support mechanism
US7473040B2 (en) 2004-10-15 2009-01-06 Lifesize Communications, Inc. High definition camera pan tilt mechanism
US20060093128A1 (en) * 2004-10-15 2006-05-04 Oxford William V Speakerphone
US20060239477A1 (en) * 2004-10-15 2006-10-26 Oxford William V Microphone orientation and size in a speakerphone
US7572073B2 (en) 2004-10-15 2009-08-11 Lifesize Communications, Inc. Camera support mechanism
US20060132595A1 (en) * 2004-10-15 2006-06-22 Kenoyer Michael L Speakerphone supporting video and audio features
US7717629B2 (en) 2004-10-15 2010-05-18 Lifesize Communications, Inc. Coordinated camera pan tilt mechanism
US7720236B2 (en) 2004-10-15 2010-05-18 Lifesize Communications, Inc. Updating modeling information based on offline calibration experiments
US20060104458A1 (en) * 2004-10-15 2006-05-18 Kenoyer Michael L Video and audio conferencing system with spatial audio
US7826624B2 (en) 2004-10-15 2010-11-02 Lifesize Communications, Inc. Speakerphone self calibration and beam forming
US7667728B2 (en) 2004-10-15 2010-02-23 Lifesize Communications, Inc. Video and audio conferencing system with spatial audio
US7760887B2 (en) 2004-10-15 2010-07-20 Lifesize Communications, Inc. Updating modeling information based on online data gathering
US8457614B2 (en) 2005-04-07 2013-06-04 Clearone Communications, Inc. Wireless multi-unit conference phone
US20090261597A1 (en) * 2005-04-14 2009-10-22 Natural Forces, Llc Reduced Friction Wind Turbine Apparatus and Method
US7847428B2 (en) 2005-04-14 2010-12-07 Natural Forces, Llc Reduced friction wind turbine apparatus and method
US7907745B2 (en) 2005-04-29 2011-03-15 Lifesize Communications, Inc. Speakerphone including a plurality of microphones mounted by microphone supports
US20060256991A1 (en) * 2005-04-29 2006-11-16 Oxford William V Microphone and speaker arrangement in speakerphone
US20100008529A1 (en) * 2005-04-29 2010-01-14 Oxford William V Speakerphone Including a Plurality of Microphones Mounted by Microphone Supports
US7593539B2 (en) 2005-04-29 2009-09-22 Lifesize Communications, Inc. Microphone and speaker arrangement in speakerphone
US7970150B2 (en) 2005-04-29 2011-06-28 Lifesize Communications, Inc. Tracking talkers using virtual broadside scan and directed beams
US7991167B2 (en) 2005-04-29 2011-08-02 Lifesize Communications, Inc. Forming beams with nulls directed at noise sources
US20060262943A1 (en) * 2005-04-29 2006-11-23 Oxford William V Forming beams with nulls directed at noise sources
US20060256974A1 (en) * 2005-04-29 2006-11-16 Oxford William V Tracking talkers using virtual broadside scan and directed beams
US20070030984A1 (en) * 2005-08-02 2007-02-08 Gotfried Bradley L Conference system
US7488097B2 (en) * 2006-02-21 2009-02-10 Cml Innovative Technologies, Inc. LED lamp module
US20070195532A1 (en) * 2006-02-21 2007-08-23 Cml Innovative Technologies, Inc. LED lamp module
US20080030611A1 (en) * 2006-08-01 2008-02-07 Jenkins Michael V Dual Sensor Video Camera
US7667762B2 (en) 2006-08-01 2010-02-23 Lifesize Communications, Inc. Dual sensor video camera
US20080255840A1 (en) * 2007-04-16 2008-10-16 Microsoft Corporation Video Nametags
US20090002476A1 (en) * 2007-06-28 2009-01-01 Microsoft Corporation Microphone array for a camera speakerphone
US8526632B2 (en) * 2007-06-28 2013-09-03 Microsoft Corporation Microphone array for a camera speakerphone
US20090003678A1 (en) * 2007-06-29 2009-01-01 Microsoft Corporation Automatic gain and exposure control using region of interest detection
US8165416B2 (en) 2007-06-29 2012-04-24 Microsoft Corporation Automatic gain and exposure control using region of interest detection
US20090002477A1 (en) * 2007-06-29 2009-01-01 Microsoft Corporation Capture device movement compensation for speaker indexing
US8330787B2 (en) 2007-06-29 2012-12-11 Microsoft Corporation Capture device movement compensation for speaker indexing
US8749650B2 (en) 2007-06-29 2014-06-10 Microsoft Corporation Capture device movement compensation for speaker indexing
US9871978B1 (en) 2007-10-12 2018-01-16 Steelcase Inc. Personal control apparatus and method for sharing information in a collaborative workspace
US9883740B2 (en) 2007-10-12 2018-02-06 Steelcase Inc. Personal control apparatus and method for sharing information in a collaborative workspace
US11743425B2 (en) 2007-10-12 2023-08-29 Steelcase Inc. Personal control apparatus and method for sharing information in a collaborative workspace
US11337518B2 (en) 2007-10-12 2022-05-24 Steelcase Inc. Personal control apparatus and method for sharing information in a collaborative workplace
US11202501B1 (en) 2007-10-12 2021-12-21 Steelcase Inc. Personal control apparatus and method for sharing information in a collaborative workspace
US8896656B2 (en) 2007-10-12 2014-11-25 Steelcase Inc. Personal control apparatus and method for sharing information in a collaborative workspace
US10925388B2 (en) 2007-10-12 2021-02-23 Steelcase Inc. Personal control apparatus and method for sharing information in a collaborative workspace
US9699408B1 (en) 2007-10-12 2017-07-04 Steelcase Inc. Personal control apparatus and method for sharing information in a collaborative workspace
US9510672B2 (en) 2007-10-12 2016-12-06 Steelcase Inc. Control apparatus and method for sharing information in a collaborative workspace
US9492008B2 (en) 2007-10-12 2016-11-15 Steelcase Inc. Personal control apparatus and method for sharing information in a collaborative workspace
US9462882B2 (en) 2007-10-12 2016-10-11 Steelcase Inc. Personal control apparatus and method for sharing information in a collaborative workspace
US9254035B2 (en) 2007-10-12 2016-02-09 Steelcase Inc. Control apparatus and method for sharing information in a collaborative workspace
US9462883B2 (en) 2007-10-12 2016-10-11 Steelcase Inc. Personal control apparatus and method for sharing information in a collaborative workspace
US9339106B2 (en) 2007-10-12 2016-05-17 Steelcase Inc. Control apparatus and method for sharing information in a collaborative workspace
US9456686B2 (en) 2007-10-12 2016-10-04 Steelcase Inc. Personal control apparatus and method for sharing information in a collaborative workspace
US9420880B2 (en) 2007-10-12 2016-08-23 Steelcase Inc. Personal control apparatus and method for sharing information in a collaborative workspace
US9456687B2 (en) 2007-10-12 2016-10-04 Steelcase Inc. Personal control apparatus and method for sharing information in a collaborative workspace
US9465524B2 (en) 2008-10-13 2016-10-11 Steelcase Inc. Control apparatus and method for sharing information in a collaborative workspace
US10631632B2 (en) 2008-10-13 2020-04-28 Steelcase Inc. Egalitarian control apparatus and method for sharing information in a collaborative workspace
US10884607B1 (en) 2009-05-29 2021-01-05 Steelcase Inc. Personal control apparatus and method for sharing information in a collaborative workspace
US11112949B2 (en) 2009-05-29 2021-09-07 Steelcase Inc. Personal control apparatus and method for sharing information in a collaborative workspace
US9955209B2 (en) 2010-04-14 2018-04-24 Alcatel-Lucent Usa Inc. Immersive viewer, a method of providing scenes on a display and an immersive viewing system
US9294716B2 (en) 2010-04-30 2016-03-22 Alcatel Lucent Method and system for controlling an imaging system
WO2012031524A1 (en) * 2010-09-10 2012-03-15 中兴通讯股份有限公司 Microphone array device, conferencing system and smart terminal
US20120081504A1 (en) * 2010-09-30 2012-04-05 Alcatel-Lucent Usa, Incorporated Audio source locator and tracker, a method of directing a camera to view an audio source and a video conferencing terminal
US8754925B2 (en) * 2010-09-30 2014-06-17 Alcatel Lucent Audio source locator and tracker, a method of directing a camera to view an audio source and a video conferencing terminal
US9055332B2 (en) 2010-10-26 2015-06-09 Google Inc. Lip synchronization in a video conference
US9210302B1 (en) 2011-08-10 2015-12-08 Google Inc. System, method and apparatus for multipoint video transmission
US8749612B1 (en) 2011-12-01 2014-06-10 Google Inc. Reduced bandwidth usage in video conferencing
US9008487B2 (en) 2011-12-06 2015-04-14 Alcatel Lucent Spatial bookmarking
US8917309B1 (en) 2012-03-08 2014-12-23 Google, Inc. Key frame distribution in video conferencing
US8791982B1 (en) 2012-06-27 2014-07-29 Google Inc. Video multicast engine
US9386273B1 (en) 2012-06-27 2016-07-05 Google Inc. Video multicast engine
US10536669B2 (en) 2013-07-17 2020-01-14 Ebay Inc. Methods, systems, and apparatus for providing video communications
US11683442B2 (en) 2013-07-17 2023-06-20 Ebay Inc. Methods, systems and apparatus for providing video communications
US9681100B2 (en) * 2013-07-17 2017-06-13 Ebay Inc. Methods, systems, and apparatus for providing video communications
US20150358585A1 (en) * 2013-07-17 2015-12-10 Ebay Inc. Methods, systems, and apparatus for providing video communications
US10951860B2 (en) 2013-07-17 2021-03-16 Ebay, Inc. Methods, systems, and apparatus for providing video communications
US9609275B2 (en) 2015-07-08 2017-03-28 Google Inc. Single-stream transmission method for multi-user video conferencing
USD788725S1 (en) * 2015-09-11 2017-06-06 Polycom, Inc. Videoconferencing unit
US10051353B2 (en) * 2016-12-13 2018-08-14 Cisco Technology, Inc. Telecommunications audio endpoints
US11190731B1 (en) 2016-12-15 2021-11-30 Steelcase Inc. Content amplification system and method
US10897598B1 (en) 2016-12-15 2021-01-19 Steelcase Inc. Content amplification system and method
US11652957B1 (en) 2016-12-15 2023-05-16 Steelcase Inc. Content amplification system and method
US10638090B1 (en) 2016-12-15 2020-04-28 Steelcase Inc. Content amplification system and method
US10264213B1 (en) 2016-12-15 2019-04-16 Steelcase Inc. Content amplification system and method
US20210334070A1 (en) * 2017-11-06 2021-10-28 Google Llc Methods and systems for attending to a presenting user
US11789697B2 (en) * 2017-11-06 2023-10-17 Google Llc Methods and systems for attending to a presenting user
US10863035B2 (en) 2017-11-30 2020-12-08 Cisco Technology, Inc. Microphone assembly for echo rejection in audio endpoints
US10951859B2 (en) 2018-05-30 2021-03-16 Microsoft Technology Licensing, Llc Videoconferencing device and method
US10904657B1 (en) * 2019-10-11 2021-01-26 Plantronics, Inc. Second-order gradient microphone system with baffles for teleconferencing
US11750968B2 (en) 2019-10-11 2023-09-05 Plantronics, Inc. Second-order gradient microphone system with baffles for teleconferencing
EP4002836A1 (en) * 2020-11-13 2022-05-25 Honda Motor Co., Ltd. Remote display system, robot, and display terminal
US11637952B2 (en) 2020-11-13 2023-04-25 Honda Motor Co., Ltd. Remote display system, robot, and display terminal

Similar Documents

Publication Publication Date Title
US6072522A (en) Video conferencing apparatus for group video conferencing
EP1377041B1 (en) Integrated design for omni-directional camera and microphone array
US7038709B1 (en) System and method for tracking a subject
US6147701A (en) Image sensing apparatus
US9417433B2 (en) Camera arrangement
US6972787B1 (en) System and method for tracking an object with multiple cameras
US9952485B1 (en) Video surveillance camera having a separable and removable gimbal
US7023499B2 (en) Television receiver with motion sensor
US8711201B2 (en) Controlling a video window position relative to a video camera position
US5604551A (en) Magnetic recording/reproducing apparatus with video camera, suited for photorecording without attending camera operator
CN116208885A (en) Device with enhanced audio
JPS6359637B2 (en)
JP2003532348A (en) Method and apparatus for tracking moving objects using combined video and audio information in video conferencing and other applications
WO2008075726A1 (en) Video conferencing device
KR102077079B1 (en) Direction changeable microphone device
US5436654A (en) Lens tilt mechanism for video teleconferencing unit
EP0765084A2 (en) Automatic video tracking system
US7507920B2 (en) Hearing aid with a control element
Fiala et al. A panoramic video and acoustic beamforming sensor for videoconferencing
JPH0993471A (en) Panorama television camera and video monitor
CN111343413A (en) Video conference system and display method thereof
US6270239B1 (en) Fader wheel for lighting control console
JP2632149B2 (en) Video camera
TWI780450B (en) Pickup system and pickup device
JP2003529060A (en) Spatial sonic steering system

Legal Events

Date Code Title Description
LAPS Lapse for failure to pay maintenance fees
FP Lapsed due to failure to pay maintenance fee

Effective date: 20040606

STCH Information on status: patent discontinuation

Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362