US 20080143819 A1
A conference link between devices in teleconference system at one site is disclosed. The linked devices include video conference units, speakerphones or stand-alone loudspeakers. Audio data can be transmitted through the conference link between devices. Audio data processing can be performed in either a video conference unit or a speakerphone. The microphones and loudspeakers in the video conference unit may be eliminated. The microphones and loudspeakers in the speakerphone are used. Other data, for example directories of dialing information, may also be transmitted through the conference link and the data processing may be allocated among processors in devices connected by conference links. The conference link may be wired or wireless, analog or digital. The devices may be linked by conference link in parallel or series. A dialing program can adapt the dialing stream with the locations of the near end site and the dialed far end site. The dialing program can automatically select a mutually supported network or protocol to establish a connection between two sites.
1. A method for combining a near end video conference unit and a near end speakerphone at a near end for a teleconference with at least one far end, the method comprising:
connecting the near end video conference unit and the near end speakerphone with a conference link;
picking up audio signals at the near end with microphones of the near end speakerphone;
deriving near end audio signals for transmission to a first far end; and
exchanging audio signals between the near end speakerphone and the near end video conference unit through the conference link.
2. The method of
connecting the near end video conference unit to the first far end;
the near end video conference unit receiving first far end audio signals;
the near end video conference unit transmitting the first far end audio signals to the near end speakerphone through the conference link; and
reproducing the first far end audio signals through loudspeakers of the near end speakerphone.
3. The method of
disabling microphones and loudspeakers of the near end video conference unit.
4. The method of
connecting the near end speakerphone to the first far end;
receiving first far end audio signals by the near end speakerphone;
transmitting the first far end audio signals to the near end video conference unit through the conference link; and
reproducing the first far end audio signals through loudspeakers of the near end speakerphone.
5. The method of
the near end video conference unit generating a bass audio signal from the first far end audio signals;
connecting a subwoofer to the near end video conference unit; and
reproducing the bass audio signal through the subwoofer.
6. The method of
connecting the near end video conference unit to the first far end;
the near end video conference unit receiving first far end audio signals;
connecting the near end speakerphone to a second far end;
the near end speakerphone receiving second far end audio signals;
transmitting the first far end audio signals and the second far end audio signals through the conference link;
mixing the first far end audio signals and the second far end audio signals to form a mixed far end signal; and
reproducing the mixed far end audio signals through loudspeakers of the near end speakerphone.
7. The method of
mixing the second far end audio signals with the near end audio signals to form a second-far-near audio signal;
the near end video conference unit transmitting the second-far-near audio signal to the first far end;
mixing the first far end audio signals with the near end audio signals to form a first-far-near audio signal; and
the near end speakerphone transmitting the first-far-near audio signal to the second far end.
8. The method of
9. The method of
10. The method of
11. The method of
12. The method of
13. The method of
14. The method of
15. The method of
16. The method of
17. The method of
18. The method of
connecting the near end video conference unit to one or more second near end speakerphones with second conference links in parallel; and
exchanging audio signals between one of the second near end speakerphones and the near end video conference unit through the second conference links.
19. The method of
connecting the near end video conference unit to a subwoofer with a third conference link;
the near end video conference unit generating a bass sound signal;
the near end video conference unit transmitting the bass sound signal to the subwoofer through the third conference link; and
reproducing the bass sound signal in the subwoofer.
20. The method of
connecting the near end speakerphone unit to one or more third near end speakerphones with fourth conference links in series; and
exchanging audio signals between one of the third near end speakerphones and the near end video conference unit through the near end speakerphone, the fourth conference links, and the first conference link.
21. The method of
transmitting all audio signals via the conference link to the near end video conference unit; and
processing the audio signals by a processor in the near end video conference unit.
22. The method of
transmitting all audio signals via the conference link to the near end speakerphone; and
processing the audio signals by a processor in the near end speakerphone.
23. The method of
transmitting data via the conference link to the near end video conference unit; and
processing the transmitted data by a processor in the near end video conference unit.
24. The method of
transmitting data via the conference link to the near end speakerphone; and
processing the transmitted data by a processor in the near end speakerphone.
25. The method of
connecting a second speakerphone via a second conference link to the near end speakerphone or the near end video conference unit;
transmitting audio signals via the second conference link to a second processor in the second speakerphone; and
processing the audio signals by the processor in the second speakerphone.
26. The method of
transmitting video signals via the conference link to the near end speakerphone; and
processing the video signals by a processor in the near end speakerphone.
This patent application is a continuation of co-pending and commonly assigned U.S. application Ser. No. 10/897,318, filed on Jul. 21, 2004 and entitled “Conference Link Between a Speakerphone and a Video Conference Unit,” which is a Non-Provisional of Application Ser. No. 60/562,782, filed on Apr. 16, 2004 and entitled “A Speakerphone with a Cellular Phone Connection,” assigned to the same assignee. The benefit of priority under 35 U.S.C. §§ 119-120 is hereby claimed.
This patent application is related to another patent application by Jed Wilson, Kate Nogarede and Greg Rousch, assigned to the same assignee, entitled “Method and Apparatus for Videoconference Interaction with Bluetooth-enabled Cellular Telephone,” attorney docket number 199-0225US.
1. Field of the Invention
This invention relates to conference equipment including a video conference unit and a speakerphone, more specifically to enhance and expand the features and functions of a combination of existing and future videoconference units and speakerphones.
2. Description of the Related Art
Teleconferencing has long been an essential tool for communication in business, government and educational institutions. There are many types of teleconferencing equipment based on many characterizations. One type of teleconferencing unit is a video conference unit, which transmits real-time video images as well as real-time audio signals. A video conferencing unit typically comprises a video processing component and an audio processing component. The video processing component may include a camera to pick up live images of conference participants and a video display for showing real-time video images of conference participants or images of documents. The audio portion of a video conferencing unit typically includes one or more microphones to pick up voice signals of conference participants, and loudspeakers to reproduce voices of the participants at the far end. There are many ways to connect video conferencing units. At the low end the link may be an analog plain old telephone service (POTS) line. It may be a digital service line such as an integrated service digital network (ISDN) line or a digital interface to PBX which may use a T1 or PRI line. More recently video conference units and speakerphones may be linked by digital networks using the Internet Protocol.
Video signals in a video conferencing unit are typically very different compared to an audio signal. Video signals are more complicated and bandwidth demanding than audio signals.
Another type of teleconference unit is a speakerphone, which is typically a speakerphone that includes at least a loudspeaker and a microphone. Similar to a video conference unit, a speakerphone may also have various connections to another speakerphone. The connection may be an analog POTS line, a digital service line such as an ISDN line or an IP connection.
Although video conferencing units and speakerphones have many overlapping features and functionalities, they do not usually work very well with each other. Typically, in a business or other entities, there is a video conferencing unit and a speakerphone in the same conference room. When a video conference is desired or required, the video conferencing unit is used. If only an audio conferencing is needed or available, the speakerphone is used.
As indicated above, the video conference unit and speakerphone have many features and functions overlapping. As a consequence, there is duplicate equipment for each conference unit. For example, there are microphones for the video conference unit and there are microphones for the speakerphone. There are both loudspeakers for the video conferencing unit and for the speakerphone. There are also wires connecting all these pieces. It is desirable to reduce the redundant equipment and un-clutter a typical conference room. It is desirable to have the video conference unit and the speakerphones share common components or to expand the capability and functions with redundant components.
The sound quality and features in a good speakerphone are typically better than the sound quality of the audio component in a video conference unit. The control on a speakerphone is simpler and easier to work with than a videoconference unit. It is desirable to upgrade and extend the sound quality of a video conference unit using new or existing speakerphones.
It is desirable to have a method and an apparatus with improved teleconferencing capabilities.
The present invention uses a conference link between a video conferencing unit and a speakerphone. With this link, audio signals may be transmitted between the video conferencing unit and the speakerphone. The connected video conferencing unit and the speakerphone can work as a single unit to take advantage of the components within the two units. In one embodiment, the redundant equipment in the video conferencing unit such as loudspeakers and microphones can be eliminated from a typical conference room. In another embodiment, all audio signal processing is performed by one of the audio signal processors in either the video conference unit or the speakerphone such that the best audio processing algorithm can be used. The conference link can connect multiple video conference units with multiple speakerphones in serial or parallel. In systems with multiple video conference units or speakerphones, the audio processing may be allocated in one or more processors, either in a video conference unit or a speakerphone. The conference link may be an analog link or a digital link, wired or wireless. Similarly, other data may also be transmitted through the conference link. Other data processing may be allocated to one or more processors. In addition to sharing microphones and loudspeakers, the speakerphone and the video conference unit may also share directories in each device. A dialing program can adapt the dialing stream with the locations of the near end and the dialed far end. The dialing program can automatically select a mutually supported network or protocol to establish a connection between two sites.
A better understanding of the invention can be had when the following detailed description of the preferred embodiments is considered in conjunction with the following drawings, in which:
A typical speakerphone is shown in
A block diagram of a speakerphone according to an embodiment of the current invention is shown in
As one can see from
Alternatively, if the audio components in the video conference unit 100 are retained, then the audio components in the speakerphone 200 can expand the capability of the video conferencing unit regarding the audio pickup and reproduction. The microphones and loudspeakers in the speakerphone 200 can provide wider coverage in a large conference room.
In another embodiment of the current invention, as shown in
When a digital connection is used, various data packets can be transmitted between the video unit 100 and the speakerphone 200. These data may include multiple channels of digitized audio data between the two units.
The data transmitted between the units are in data packets. Each packet may include several 16-bit words, typically two to eight words. Each word may represent the digitized data for one audio channel, one control command, one response or the like. In one embodiment, the digital link is implemented in a master/slave protocol, for example, a video unit is a master and all connected speakerphone are slaves. The communication between them is asymmetric.
Once the connection between a video conference unit and one or more speakerphones is setup, the audio data are transmitted between them. The video unit may be a master and the audio unit may be a slave. The audio unit is collecting audio data from its internal, external and auxiliary microphones at the local conference room, possibly in many distinct audio channels. The connection can be in parallel as shown in
The combined video conference unit and a speakerphone can be used to make various conference calls, e.g. an audio only conference call, a video conference call or a three-party mixed video and audio conference call.
When the speakerphone alone is making an audio only call, the speakerphone can be used as a normal speakerphone, except that part of the audio signal may be sent to the video unit for processing and reproduction. For example, the audio data from the far end is sent to the video unit via the conference link. The bass sound is produced in the subwoofer. The microphones in the video units are disabled.
When the video conference unit is making a video conference call, it can be used normally, except that the near end audio input is generated from the microphones in the speakerphone.
When a video conference unit and a speakerphone are both used in a three-site conference call as illustrated in
At the near end site, the audio portion may be processed as shown in
In the above examples where the master/slave protocol is used, the speakerphones perform only minimum data processing. The speakerphone is used primarily as an interface to the POTS network, as external microphones and as external loudspeakers. Therefore, a “dumb” and typically cheaper speakerphone may be installed in a conference room without degrading the audio conference capability in that conference room.
Alternatively, the data processing may be distributed differently, for example, by allocating all video data processing in the video conference unit and allocating all audio data processing in the speakerphone. In this embodiment, regardless of the types of conference calls, all video data are collected and processed by the processor in the video unit; all audio data are collected from various far end sites or near end site are sent to the speakerphone and processed in the speakerphone.
In yet another embodiment, the data processing is allocated among various components on an as-needed/as-available basis such that processing power in either the video conference unit or the speakerphone is fully utilized and balanced. In some state of the art video conference units or speakerphones, the processors are general purpose processors and very powerful, for example the processors in the Polycom VSX7000 video conference units or VTX1000 speakerphones have up to 1000 MIPS capabilities (1 MIPS=1 Million Instructions Processed per Second). As long as an appropriate software program is loaded to a processor, either a video data processing program or an audio data processing program, the processor can perform the processing task as dictated by the program. This way, each component, the video conference unit or the speakerphone does not run out of processing power until the combined units run out of processing power. Another benefit of this embodiment is making the combined video/speakerphone very scalable, i.e. the unit's processing power can grow gradually rather than replacing the old unit with a new more powerful one every time when the demands exceed the current capacity. For example, still referring to the system shown in
To simplify the process to establish a conference call, either a video conference call or an audio conference call, an auto dialing program may be installed. The auto dialing program may be installed in one of the processors in the devices linked by the conference links. It can keep track of calling information of itself and other parties. The calling information may include the POTS phone number, ISDN phone number, IP address etc. Each type of number may have a default mode of conference call, either a video call or an audio call. From its own calling information and that of the called party, the processor can determine which type of call will take place and what prefix, if any, is needed to be added in front of the dialing stream. All of the dialing information may be stored in a directory on each device. When a user wants to make a call, he can manually input the dialing information as usual, or he may select the other party from the directory list. When the user selects an entry from the directory, the dialing program determines the type of the call and the necessary prefix. For example, if both parties are internal to a same company, then only the four-digit extension 4567 is dialed, where the called party's phone number is 1-832-123-4567. The phone number includes the country code 1, area code 832, phone number 123-4567. If parties are in different countries, then appropriate country code, area code plus the access code will be added to the dialing stream. For example, when a speakerphone in Houston, Tex., USA dials a speakerphone in London, England, the dialing stream may be 9-011-44-20-1234-5678. The added prefix includes an access number 9 to reach an external telephone network and international phone call access number 011. But when the speakerphone in London dials the speakerphone in Houston, the dialing stream is 00-1-832-123-4567, where the international access number changes to 00 and no external access number is needed when the speakerphone is connected to the public telephone network directly.
Entries in a directory in a device may be entered or collected by various ways. They may be entered by a user manually, or downloaded from other speakerphones or video conference units linked by conference link, or captured during a conference call. During the process of establishing a conference call, the video conference units or speakerphones involved exchange dialing information. Such information may be stored in the directory maintained by the speakerphone or video conference units for later use.
The auto dialing program is aware of the different dialing numbers and their associated networks or protocols. When a user select an entry to establish a conference call, the auto dialing program selects a mutually supported network and protocol between the near end device and the far end device for the selected type of call. The selection of the networks or protocols is transparent to the user. In one embodiment, the available types of conference between the near end and a far end entry in the directory are indicated in the directory. So a user knows the types of conference calls available between the two parties before trying to establish a conference call. For example, a local video device may is capable of video calls through IP, ISDN or other network, but a far end only supports an ISDN video call. When a user initiates a video conference call, he can simply select the far end from the entry in the directory which may indicate that video conference capability is available at the far end. The auto dialing program selects the ISDN network and the ISDN number of the far end party to establish the video conference call. The user does not need to know the detail of what type of video call is established.
In addition to sharing components such as microphones and loudspeakers between linked speakerphones and video conference units via conference links, more functions and resources may be shared among them. For example, a directory on one device may be accessed by another device through the conference link.
The conference link may be an analog link or a digital link as described above. These examples are just some of many ways of implementing the current invention. When the audio signals are digital signals, the conference link may be a regular Ethernet link, a USB link or other packet network. The digital signal processor in the speakerphone can process the digital signals, performing D/A and A/D conversions. The processor in the videoconference unit can separate or combine the audio data with the video data. The combined digital video and audio data are exchanged through the digital network with the video conference unit on the far end. Many digital video conference protocols may be used, for example, the ITU H.32x family of recommendations that provides multimedia communication over a variety of networks. The video data and audio data under these recommendations are processed by different codecs or components. The processes are allocated to different logical components and can be easily allocated to different physical components. According to one embodiment of the current invention discussed above, the video processing is allocated to a video conference unit and the audio processing is allocated to the speakerphone. This way, more processing power in the video conference unit can be dedicated to the video processing. Alternatively, the processor in the video unit may control all the signal processing in a master/slave arrangement as discussed in the above examples.
The audio link between a speakerphone and a video conference unit can be wired as discussed above, or it can alternatively be wireless. Using a wireless connection can avoid the many problems associated with many different wires, such as limitation of the relative locations between the speakerphone and the video conferencing unit, the unsightly wires around the conference room and table, and the trip hazards for conference participants. In the example shown in
As discussed above, the embodiments of the current invention combine video conference units with speakerphones to make them work together seamlessly using conference links. With conference links, various speakerphone functions or video conference functions may be allocated among the two. The embodiments of the current invention improve and expand functionalities and features of videoconference units and speakerphones or allow cost reductions in the units. In either case, certain redundant hardware, particularly microphones and loudspeakers can be eliminated.
“Audio signals” as used in the current application can be either analog signals for audio channels in a teleconference unit, or digital signals for audio channels in a digital system. “Audio data” as used in the current application refers to digitized audio signals. “Audio data” are typically used in digital signal processors.
While illustrative embodiments of the invention have been illustrated and described, it will be appreciated that various changes can be made therein without departing from the spirit and scope of the invention.