WO2012104297A1

WO2012104297A1 - Generation of user-adapted signal processing parameters

Info

Publication number: WO2012104297A1
Application number: PCT/EP2012/051554
Authority: WO
Inventors: Bernhard Grill
Original assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.
Priority date: 2011-02-01
Filing date: 2012-01-31
Publication date: 2012-08-09
Also published as: DE102011003450A1

Abstract

A mobile communication device includes an interface for receiving individual hearing system information of a user of the mobile communication device, a parameter adapter for providing at least one signal processing parameter adapted to the user with regard to the individual hearing system information, an audio data source for providing audio data, and an audio signal processor for receiving the audio data from the audio data source and the at least one signal processing parameter adapted to the user. The audio signal processor is provided for processing the audio data according to the at least one signal processing parameter and for providing an output signal resulting therefrom giving the user a spatial hearing impression.

Description

GENERATION OF USER-ADAPTED SIGNAL

PROCESSING PARAMETERS CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims priority from German Patent Application No. 10 201 1 003 450.1 - 31 , which was filed on February 1, 201 1 , and is incorporated herein in its entirety by reference. This application claims priority from US Provisional Patent Application No. 61/438,531 , which was fi!ed February 1 , 201 1 , and is incorporated herein in its entirety by reference.

BACKGROUND OF THE INVENTION The present invention relates to mobile telephones or mobile communication devices with the possibility of reproducing surround sound and, in particular, to a user-related adaptation of the mobile telephone/communication device for the reproduction o surround sound. The present invention relates further to a server which interacts with the mobile telephone/communication device for a user-related adaptation, to a method for generating an audio signal on a mobile telephone/communication device, a method for providing signal processing parameters and a computer program having a program code for executing at least one of the methods.

For several decades, pieces of music and other audio presentations which offered stereo sound or were recorded in stereo, at least for the general public, were the technology closest to real surround sound. At the end of the 90s, the first SACDs (super audio compact disks) and DVD-A (digital versatile disk-audio) were placed on the market, which represented a clear development also with respect to stereo recordings even with respect to surround sound. However, neither the SACD nor the DVD-A was a commercial success, which, according to many persons skilled in the art, is to be attributed to the fact that, on the one hand, special players were needed and, on the other hand, a short time later other formats for marketing music or audio representations were developed, in particular in the combination of highly compressed audio files and a distribution of these audio files via suitable communication networks like the internet.

On the other hand, films with surround sound are gaining increasing popularity. In the USA at the end of 2008 already 50% of households were equipped with a home cinema usually also including a loudspeaker set with five or more loudspeakers. These loudspeaker sets, depending on their equipment, are known as 5.0. 5.1 or 7.1 loudspeaker systems. Modem data media like DVD or Blu-Ray disks frequently contain corresponding audio data which may be reproduced with such loudspeaker equipment in order to enable a realistic listening experience.

In the past few years, the applicant has carried out research work with the aim of providing a realistic surround sound experience already with relatively low data transmission rates or a low memory requirement. The results from this research work regarding audio encoders, audio decoders and further products are summarized and referred to as "MP3 Surround" (see "An Introduction to MP3 Surround" by JOrgen Herre et al., Fraunhofer-Institut Integrierte Schaltungen IIS, 91058 Erlangen, Germany, and by Christof Faller, Agere Systems, Allentown, PA 18109, USA).

The human capacity for spatial hearing is based on a series of phenomena, such as phase and runtime differences between the two ears, shadowing effects by the head and body, reflection effects, for example at the shoulders, other body parts and objects in the environment, directional characteristic of the auricle and a location dependency of the frequency response of human hearing (among other things caused by the structure of the auricle). These influences have an effect on a spreading process, via which sounds from a sound source reach the hearing of the listener. This spreading process, to which possibly a component directly reflected from the sound source to the ear belongs, may be modeled as a filter. The Fourier transform of this filter is referred to as a "head-related transfer function (HRTF)". By means of HRTF, a spatial sound experience may be simulated for a listener who is wearing a headset and has sound transferred to him via the same. Although the form of an HRTF is roughly similar for most people, especially in the field of higher frequencies there are clear differences from one individual to the other. Among other things, different head shapes and different shapes of the auricle are decisive in this respect. As the brain of a listener adapts to the individual HRTFs of the listener over time, the capability for spatial hearing is reduced if for this listener a generic HRTF or the HRTF of another individual is used. For achieving a spatial sound experience which is as optimal as possible, it is thus needed in particular with the use of headsets, to use an HRTF (or several HRTFs) which come as close as possible to the natural HRTF of the corresponding individual. For this purpose, the natural HRTF of the individual is usually measured, which is relatively complex and needs special measurement equipment.

The article "Limitations of 3D Audio to Improve Auditory Cues in Aircraft Cockpits" by C.W. Johnson and W. Dell; Dept of Computing Science, Univ. of Glasgow, Scotland describes the results of a test for spatial hearing which was carried out using generic HRTFs. The authors assume that individualized HRTFs would have led to an improvement in the test results. For application in aviation, they propose the use of biometric data of the flight crew to load the corresponding individual HRTFs before the flight. Nevertheless, this requests the individual HRTF for each pilot to be measured once in advance. For a higher number of individuals, like, e.g., mobile telephone users or home cinema users, a correspondingly high number of individual measurements would be needed, which would mean high effort and high costs.

In the article "3D Audio for Mobile Devices via Java" by Matti Paavola et al., Audio Engineering Society, Convention Paper 6472, AES 118^th Convention, Barcelona, Spain, 2005 May 28-31, the authors provide an introduction to a new standard interface known as "Advanced Multimedia Supplements" for accessing specific features of new mobile devices from the Java programming language. The article states that the mobile terminal (telephone) is rapidly evolving from its origins as a basic device for voice communication into being an advanced multimedia computer able to handle demanding signal processing tasks in real time. The new interface disclosed in the article "3D Audio for Mobile Devices via Java" augments the existing mobile media specification with mechanisms to control audio effect processing in real time, including 3D positional audio and reverberation, all of which can be synthesized using standard stereo headphones or stereo microspeakers.

F. Amadu and J.-M. Raczinski describe a 3D audio engine in their article "An Efficient Implementation of 3D Audio Engine for Mobile Devices", AES 35^th International Conference, London, UK, 2009 February 11-13. This paper presents a generic and customizable 3D audio engine which has been specially designed for gaming on low-end mobile devices. The engine is based on source and listener 3D positioning for headphone playback. Distance attenuation, Doppler effect and reverberation can be added to fit JSR-234 (i.e., the Advanced Multimedia Supplements) specifications. In order to address platform diversity, the authors of the article "An Efficient Implementation of 3D Audio Engine for Mobile Devices" have developed a PC application to easily design the best 3D audio engine in accordance with processor capabilities. Standard HRTF based processes have been simplified to obtain a limited number of fixed point IIR filters, which have been successfully implemented on several platforms.

The article "Using Computer Vision to Generate Customized Spatial Audio" by A. Mohan, et al., IEEE, ICME 2003, pages 111-57 to 111-60, describes a set of algorithms to synthesize a virtual audio environment that includes real-time tracking of head position and orientation using an inexpensive setup. The article states that the creation of high quality virtual spatial audio over headphones requires real-time head tracking, personalized head- related transfer functions (HRTFs) and customized room response models. While there are expensive solutions to address these issues based on costly head trackers, measured personalized HRTFs and room responses, these are not suitable for widespread or easy deployment and use, according to the article "Using Computer Vision to Generate Customized Spatial Audio". The article reports on the development of a system that uses computer vision to produce customizable models for both the HRTF and the room response, and to achieve head-tracking. The system uses relatively inexpensive cameras and widely available personal computers. Computer-vision based anthropometric measurements of the head, torso, and the external ears are used for HRTF customization.

SUMMARY

According to an embodiment, a mobile communication device may have an interface for receiving individual hearing system information of a user of the mobile communication device; a parameter adapter for providing at least one signal processing parameter (or an estimation of a head-related transfer function) adapted to the user with regard to the individual hearing system information; an audio data source for providing audio data; an audio signal processor for receiving the audio data from the audio data source and the at least one signal processing parameter (or the estimation of the head-related transfer function) adapted to the user, for processing the audio data according to the at least one signal processing parameter (or the estimation of the head-related transfer function) and for providing an audio signal resulting therefrom, giving the user a spatial hearing impression.

According to another embodiment, a parameterization server may have a parameter generator for providing a signal processing parameter (or an estimation of a head-related transfer function) adapted to a user on the basis of individual hearing system information; a mobile radio network interface for receiving the individual hearing system information from a mobile communication device which relate to the user of the mobile communication device, and for transferring the signal processing parameter (or the estimation of the head-related transfer function) adapted to the user to the mobile communication device for processing audio data by the mobile communication device according to the at least one signal processing parameter (or the estimation of the head- related transfer function) and for providing an output signal resulting therefrom which gives the user a spatial hearing impression.

According to another embodiment, a method for generating an audio signal output by a mobile communication device may have the steps of receiving individual hearing system information of the user; determining at least one signal processing parameter (or an estimation of a head-related transfer function) adapted to the user at least with regard to the individual hearing system information; providing audio data; processing the audio data according to the at least one signal processing parameter (or the estimation of the head- related transfer function); outputting the resulting audio signal output from the mobile communication device which gives a spatial hearing impression to the user.

According to another embodiment, a method for providing at least one signal processing parameter (or an estimation of a head-related transfer function) which is adapted to a user so that an audio signal processed by means of the at least one signal processing parameter (or the estimation of the head-related transfer function) gives the user a spatial hearing impression may have the steps of receiving individual hearing system information from a mobile communication device via a mobile radio network, wherein the individual hearing system information relates to the user; generating the at least one signal processing parameter (or the estimation of the head-related transfer function) on the basis of the individual hearing system information; transferring the signal processing parameter (or the estimation of the head-related transfer function) adapted to the user via the mobile radio network to the mobile communication device.

According to the teachings disclosed herein, a mobile communication device includes an interface for receiving individual hearing system information of a user of the mobile communication device, a parameter adaptation means for providing at least a signal processing parameter (or an estimation of a head-related transfer function) adapted to the user with respect to the individual hearing system information, an audio data source for providing audio data, and an audio signal processor for receiving the audio data from the audio data source and the at least one signal processing parameter (or the estimation of the head-related transfer function) adapted to the user from the parameter adaptation means. The audio signal processor is further provided for processing the audio data according to the at least one signal processing parameter (or the estimation of the head-related transfer function) and for providing an output signal resulting therefrom which gives a spatial hearing impression to the user.

The at least one signal processing parameter (or the estimation of the head-related transfer function) may, for example, reproduce a characteristic of the HRTF, such as a frequency response. The individual hearing system information may be data of different types which characterize the hearing system of the mobile communication device user, in particular with respect to a spatial sound perception capability of the mobile communication device user. The mobile communication device saves the user the execution of a complex measurement for determining a complete HRTF by determining the at least one signal processing parameter (or the estimation of the head-related transfer function) from individual hearing system information which is easier to determine. This determination may be executed on the basis of empirical knowledge. In particular, two individuals may not have identical, but at least similar HRTFs, which may be determined via a suitable selection of the individual hearing system information. There is thus a correlation between suitably selected individual hearing system information and the at least one signal processing parameter (or the estimation of the head-related transfer function) or the complete HRTF. According to embodiments, the mobile communication device may further include a radio interface which is implemented to receive the individual hearing system information from the interface for the individual hearing system information and transfer the same to a server. The radio interface may further be implemented to receive the at least one signal processing parameter (or the estimation of the head-related transfer function) adapted to the user from the server and pass the same on to the audio signal processor. In this way, an actual allocation of the at least one signal processing parameter (or the estimation of the head-related transfer function) to the individual hearing system information is shifted from the mobile communication device to the server. When allocating the individual hearing system information to the at least one signal processing parameter (or the estimation of the head-related transfer function), a matching or alignment of the individual hearing system information to a relatively extensive database may be needed. Such a matching to the extensive database could cause high requirements on a storage capacity and a computing power of the mobile communication device. Mobile communication devices today, however, frequently offer both high storage capacity and also high computing power.

In implementations, the mobile communication device may further include an image recording device. The image recording device may be implemented to provide at least a part of the individual hearing system information in the form of an image at least partially showing the user for the parameter adaptation means. The image may, in this respect, show the hearing system of the mobile communication device user or parts of the same (e.g. auricle). The image recording device may be in connection with the interface for receiving individual hearing system information and/or the parameter adaptation means. Image processing for extracting individual hearing system information from the image which may further be used may be provided in the parameter adaptation means or as an independent Image processing device. Such image processing may, for example, provide numerical values like dimensions or classification results (for example with respect to the shape of the auricle). According to embodiments of the teaching disclosed herein, the audio data source may be implemented to provide sample audio signals for a reproduction for the user. The interface for receiving the individual hearing system information may be implemented to receive an evaluation of the sample audio signals by the user with respect to the spatial hearing impression. The evaluation of the sample audio signals may be part of the individual hearing system information or also the complete individual hearing system information. By a suitable sample audio signals it may be tested with which sample audio signals the mobile communication device user, for example, observes an especially clear spatial sound perception and with which other sample audio signals no remarkable spatial sound perception is offered. By a suitable structuring of the sample audio signals, for example as a (binary) tree, for the mobile communication device user, sample audio signals may be reproduced which are successively better tuned to the same, so that a relatively quick collection of the individual hearing system information is enabled. The reproduction of the sample audio signals and the evaluation of the sample audio signals by the user may insofar be interactive. The sample audio signals may in particular be specially prepared stereo audio signals which are reproduced conventionally by the mobile communication device via a headset. It is not needed for the audio signal processor to be involved in the reproduction of the sample audio signals, although this is possible. If the audio signal processor is involved in the reproduction of the sample audio signals, apart from the sample audio signals also sample signal processing parameters (or the sample estimations of the head-related transfer function) may be provided for being used by the audio signal processor. By the reproduction and the evaluation of the sample audio signals, by means of a search from coarse to fine an HRTF may be determined for the mobile communication device user, which is relatively similar to the natural HRTF of the mobile communication device user and enables a sufficiently realistic spatial auditory perception.

In embodiments of the mobile communication device disclosed herein, the audio signal processor may be implemented to process at least two audio data streams of different origins such that the user perceives the at least two audio data streams as being spaced apart from each other according to the spatial hearing impression. In this way, the user can typically distinguish the audio data streams better from each other, which frequently increases the comprehensibility of the audio data streams. For example, the user may be in a telephone conference with two communication partners and the audio signals coming in from the two communication partners are processed by the audio signal processor such that the user gains the impression that communication partner No. 1 is located left in front of him, whereas communication partner No. 2 is, rather, located on the right in front of him. The audio data streams may be of different origins, in particular telephone connections, music reproduction, radio or television transmissions of, for example, sports events, speech synthesis computers or podcasts.

In a similar way, in embodiments of the teaching disclosed herein, the audio data source may include at least one of the following: a mobile radio receive section, an internal storage of the mobile communication device, a removable medium, a computer connected via an online connection and a streaming media connection.

A parameterization server according to the teaching disclosed herein includes a parameter generator and a mobile radio interface. The parameter generator serves for providing a signal processing parameter (or an estimation of a head-related transfer function) adapted to a user on the basis of individual hearing system information. The mobile radio interface serves for receiving the individual hearing system information from a mobile communication device which relate to the user of the mobile communication device or his hearing system. The mobile radio network interface also serves for transferring the signal processing parameter (or the estimation of the head-related transfer function) adapted to the user to the mobile communication device for processing audio data by the mobile communication device according to the at least one signal processing parameter (or the estimation of the head-related transfer function) and for providing an output signal resulting therefrom which gives the user a spatial hearing impression.

By the parameterization server, the determination of the signal processing parameter (or the estimation of the head-related transfer function) adapted to the user which may need a lot of computation and storage may be relocated on the basis of the individual hearing system information from the mobile communication device to hardware which is better suited for this purpose. Advantage is taken here of the fact that a mobile communication device is suitable for data communication with a remote computer, like the parameterization server via a mobile radio network. In embodiments of the parameterization server disclosed herein, the parameter generation may include a database with data sets which include predetermined individual hearing system information and the at least one signal processing parameter (or the estimation of the head-related transfer function) which is allocated to the predetermined individual hearing system information of the respective data set. The parameter generation may be implemented to select at least one data set whose individual hearing system information is similar or identical to the individual hearing system information received via the mobile radio interface. Further, the parameter generator may be implemented to transfer the at least one associated signal processing parameter (or the estimation of the head-related transfer function) via the mobile radio interface to the mobile communication device.

The parameterization server may further include an image processing means which is implemented for receiving an image from the mobile radio interface which at least partially shows the user. The image processing means is implemented to analyze the image with respect to at least a part of the individual hearing system information of the user and for providing at least that part of the individual hearing system information to the parameter generator.

In embodiments of the teaching disclosed herein, the parameterization server may further include a sample audio signal collection with sample audio signals. The mobile radio interface may then be implemented to transfer at least one of the sample audio signals to the mobile communication device and to receive a user assessment of the at least one sample audio signal from the mobile communication device with respect to the spatial hearing impression of the at least one part of the individual hearing system information. The sample audio signals may be associated with H TFs of individuals for which the sample audio signals have provided a spatial hearing impression that is virtually true to the original. When a mobile communication device user observes a similarly strong spatial hearing impression when playing one of the sample audio signals, this may be used as an indication that the mobile communication device user has an HRTF which is similar to the HRTF of the individual to which this sample audio signal is allocated.

In embodiments of the parameterization server according to the teaching disclosed herein, the parameter generator may be implemented to evaluate a function into which at least a part of the individual hearing system information received by the mobile radio interface is introduced as an input quantity or variable and which provides the at least one signal processing parameter (or the estimation of the head-related transfer function) as an output quantity or variable.

According to embodiments of the teaching disclosed herein, the parameterization server may further include an accounting or billing unit which is implemented for generating billing information with respect to an executed generation or transfer of the at least one signal processing parameter (or the estimation of the head-related transfer function) to the mobile communication device. The generation of the billing information is typically linked to the executed generation or determination of the at least one signal processing parameter (or the estimation of the head-related transfer function) to the mobile communication device, so that a reliable billing for the use of the parameterization server is possible. The term parameterization server is here not restricted to one single computer in the sense of information and communication technology, but may also relate to a server complex or a computational center or also to the complete infrastructure of a corresponding service provider,

A method for generating an audio signal output by a mobile communication device according to the teaching disclosed herein includes the following: receiving individual hearing system information of the user; determining at least one signal processing parameter (or an estimation of a head-related transfer function) adapted to the user at least with respect to the individual hearing system information; providing audio data; processing the audio data according to the at least one signal processing parameter (or the estimation of the head-related transfer function); and outputting the resulting audio signal to be output by the mobile communication device which gives a spatial hearing impression to the user,

Processing the audio data according to the at least one signal processing parameter (or the estimation of the head-related transfer function) leads to the resulting audio signal which is output by the mobile communication device, possibly after further signal processing steps like a digital-to-analog conversion or an amplification, which, however, do not substantially change the audio signal.

The method may further include transferring the individual hearing system information via a radio connection to a server and receiving the at least one signal processing parameter (or the estimation of the head-related transfer function) adapted to the user via the radio connection from the server.

In embodiments of the method, this may further include recording an image at least partially showing the user which forms at least a part of the individual hearing system information.

In further embodiments of the method according to the teaching disclosed herein, the method may further include the following: reproducing sample audio signals for the user; receiving an evaluation of the sample audio signals by the user with respect to the spatial hearing impression; and adding the evaluation to the individual hearing system information.

In embodiments of the method, processing the audio data may be executed according to a first spatial sound source position. The method may further include a processing of further audio data, i.e. according to the at least one signal processing parameter (or the estimation of the head-related transfer function) adapted to the user, and a second spatial sound source position, so that the user perceives the further data as being spaced apart from the (first) audio data with respect to the spatial hearing impression.

By such a spatial separation in the hearing impression of the user of the (first) audio data and the further audio data, the user is typically supported in the theoretical separation or processing of the (first) audio data and the further audio data, as he can allocate the audio data or the further audio data to their respective source more easily.

A method for providing at least one signal processing parameter (or an estimation of a head-related transfer function) which is adapted to a user so that an output signal processed by means of the at least one signal processing parameter (or the estimation of the head- related transfer function) gives a spatial hearing impression to the user includes the following: receiving individual hearing system information from a mobile communication device via a mobile radio network, wherein the individual hearing system information relates to the user; generating the at least one signal processing parameter (or the estimation of the head-related transfer function) on the basis of the individual hearing system information; and conveying the signal processing parameter (or the estimation of the head-related transfer function) adapted to the user to the mobile communication device via the mobile radio network.

In embodiments according to the teachings disclosed herein, the method may further include the following: querying a database including data sets which include predetermined individual hearing system information and at least one allocated signal processing parameter (or an estimation of a head-related transfer function); selecting at least one data set whose individual hearing system information is similar or identical to the individual hearing system information received via the mobile radio network; and transferring the at least one allocated signal processing parameter (or the estimation of the head-related transfer function) to the mobile communication device.

In embodiments of the teaching disclosed herein, the method may further include evaluating a function into which at least a part of the individual hearing system information received from the mobile radio interface is introduced as an input variable and which then provides the at least one signal processing parameter (or the estimation of the head-related transfer function) as an output variable. In embodiments according to the teaching disclosed herein, the method may further include the following: receiving an image at least partially showing the user via the mobile radio network; analyzing the image with respect to at least one part of the individual hearing system information; and using at least that part of the individual hearing system information for generating the at least one signal processing parameter (or the estimation of the head-related transfer function).

The method may further include transferring at least one sample audio signal via the mobile radio network to the mobile communication device and receiving a user evaluation of the at least one sample audio signal with respect to the spatial hearing impression as at least one part of the individual hearing system information.

The method may further include generating billing information with respect to a completed generation of the at least one signal processing parameter (or the estimation of the head- related transfer function) or the transfer of the same to the mobile communication device.

BRIEF DESCRIPTION OF THE DRAWINGS

In the following, embodiments of the present invention will be explained in more detail with reference to the accompanying drawings, in which:

Fig. 1. shows a schematical block diagram of a mobile communication device of an embodiment of the disclosed teaching with a focus on some components of the mobile communication device relevant in this respect;

Fig. 2 shows a schematical block diagram of a mobile communication device of a further embodiment of the disclosed teaching;

Fig. 3 shows a schematical block diagram of a mobile communication device according to a further embodiment of the disclosed teaching;

Fig. 4 shows a schematical block diagram of a mobile communication device according to a further embodiment of the disclosed teaching; Fig. 5 shows a schematical representation of a spatial sound impression perceived by a user, for example during a telephone conference; Fig, 6 shows a schematical block diagram with components of a mobile communication device suitable for implementing the concept illustrated in Fig. 5; Fig. 7 shows a schematical block diagram of a parameterization server according to an embodiment of the disclosed teaching;

Fig. 8 shows a schematical block diagram of a parameterization server according to a further embodiment;

Fig. 9 shows a schematical flowchart of a method for generating an audio signal output by a mobile communication device according to the disclosed teaching;

Fig. 10 shows a schematical flowchart of a method for providing at least one signal processing parameter by the parameterization server;

Fig. 11 shows an illustration of a loudspeaker arrangement for acquiring so-called surround sound; and Fig. 12 shows a schematical block diagram of a user-adapted conversion of a 5-channel audio signal into a 2-channel signal or a headset signal.

DETAILED DESCRIPTION OF THE INVENTION Fig. 1 shows a schematical block diagram of a mobile communication device or mobile telephone, wherein basically such components of the mobile telephone are illustrated which are useful for understanding the teaching disclosed herein. A user of the mobile telephone 100 inputs individual hearing system information via an interface 1 12. The individual hearing system information may include general information on the user, like age, sex, height and/or weight. Alternatively or additionally, the individual hearing system information may contain more detailed indications with respect to the hearing system of the user, like, for example, the head circumference measured at the ears, the distance of the ears or dimensions of the auricle(s). These indications directed to the hearing system of the user may request the user to measure himself, for example using a measuring tape or a ruler, or have another person help him in doing so. These measurements may be relatively easy to do and, apart from the mentioned conventional measuring devices, need no further measuring equipment. From the interface 112, the individual hearing system information is transferred to a parameter adaptation means 1 14. The parameter adaptation means 1 14 uses the individual hearing system information to determine at least one signal processing parameter, typically, however, one data set of several signal processing parameters which matches the user as well as possible. Typically, a signal processing parameter set determined in this way represents an estimate of an HRTF of the user. The signal processing parameter set may also include several estimates for several HRTFs of the user. The several HRTFs may, for example, be provided for one space or spatial sector each of a user-centered coordinate system, so that the user may execute a spatial allocation of an audio signal according to the respective spatial sector as far as the signal processing parameter set comprises a sufficient correspondence to the natural HRTF of the user.

The signal processing parameter/s are transferred from the parameter adaptation means 114 to an audio signal processor 118. The audio signal processor 118 also receives audio data from an audio data source 116. The audio data provided by the audio data source 116 may either be a pure mono signal, a conventional stereo signal or an audio signal containing surround sound information, like, e.g., a 5.0 signal, a 5.1 signal, a 7.1 signal or an MP3 Surround signal. When the audio data is a mono signal, the audio signal processor 1 18 may give a fictitious standard sound source position which may be used for a so-called binaural rendering. For example, this standard sound source position may be assumed directly in front of the user at a distance of approx. 2 m, In a similar way, for the left and the right channel of a stereo signal standard positions may be assumed. For these standard positions, special HRTFs may be provided, so that corresponding mono or stereo signals may be rendered binaurally as faithfully as possible to the original for these standard positions. If the audio data is present in a format which already contains extensive surround sound information, like, e.g., a 5.1 signal, then the individual channels are basically allocated to different loudspeakers positioned in a certain arrangement around the user. The positions of the individual loudspeakers are described in different standards of the ITU (International Telecommunication Union). The reproduction of a 5.1 signal via loudspeakers according to the ITU standard is mainly provided for home cinema applications, however, and is hardly practicable in connection with mobile telephones or other portable devices. The user of the mobile telephone will thus typically fall back on a headset 122. In connection with the reproduction of, for example, 5.1 audio data, it is thus needed to reproduce the five or six channels via the headset 122 such that the user basically perceives the same hearing impression as if the 5.1 audio signal were, for example, reproduced by a suitable home cinema loudspeaker device. Typically, for each of the loudspeaker positions given according to the ITU standard, at least one HRTF is generated by the parameter adaptation means 1 14 as far as a reproduction of the 5.1 signal which is as faithful to the original as possible is desired over the headset 21.

Fig. 2 shows a schematical block diagram of a mobile telephone 100 of a further embodiment. The interface 1 12, the audio data source 1 16 and the audio signal processor 1 18 are already known from Fig. 1 and basically fulfill the same functions as described in connection with Fig. 1. The parameter adaptation means 214 of Fig. 2 is different from the parameter adaptation means 1 14 in Fig. 1. The parameter adaptation means 214 illustrated in Fig. 2 is connected to a radio interface 215. The parameter adaptation means 214 may pass on the individual hearing system information received from the interface 1 12 to the radio interface 215. In the opposite direction, the parameter adaptation means 214 may receive the at least one signal processing parameter from the radio interface 215 and pass the same on to the audio signal processor 118. The radio interface 215 is in turn connected to a mobile radio network 216 via an antenna. A server 700 is also connected to the mobile radio network 216. The radio interface 215 transmits the individual hearing system information via the mobile radio network 216 to the server 700. The server 700 thereupon executes a determination of the at least one signal processing parameter which matches the individual hearing system information. The at least one signal processing parameter is transmitted from the server 700 via the mobile radio network 216 to the radio interface 215 of the mobile telephone 100. In particular when the generation of the at least one signal processing parameter includes the query of a database, in which the data sets best matching the individual hearing system information are searched, it is best to hold this database available on a central server as a transfer of extensive data amounts via the mobile radio network 216 or another medium may thus be prevented. Instead, only small amounts of information are transmitted via the mobile radio network 216 in the form of the individual hearing system information and the signal processing parameter adapted to the user. Further, the necessity is omitted to store the database on the mobile telephone 100. The parameter adaptation means 214 typically mainly serves as a connection between the interface 1 12, the radio interface 215 and the audio signal processor 1 18. The parameter adaptation means 214 may, however, also execute tasks going beyond that, like, e.g., a user profile management for the signal processing parameters if the mobile telephone 100 is used by several different users. By a user profile management or administration, with a change of users, the already generated signal processing parameters for this user may be loaded without the somewhat more extensive renewed determination of the signal processing parameters being needed. Also the signal processing means 1 14 illustrated in Fig. 1 may comprise a corresponding functionality. Apart from that, still further tasks are possible which the parameter adaptation means 1 14, 214 may execute. Fig. 3 shows a schematical block diagram of a mobile telephone 100 according to a further embodiment. The individual hearing system information may partially also be gained from images which represent the user or his hearing system. For example, in particular the shape and the size of the auricles may provide hints as to how an individual perceives sounds, as the three-dimensional shape of the auricle acts like an acoustic filter on incoming sound waves which attenuates certain frequencies and lets others pass unchanged. Further, also from a picture of the user, conclusions may be drawn about the hearing system, like, e.g., the distance of the ears and their positioning with respect to the head. To be able to use the individual hearing system information contained in an image recording, the mobile telephone may include an image recording device 31 1, using which an image of the user or of his ears may be recorded. Alternatively, the image recording device may also be present externally with respect to the mobile telephone 100, as is indicated by the dashed line. An image 301 recorded by the image recording device 311 is passed on to the interface 312. Possibly, still further individual hearing system information is added, as was described with respect to Fig. 1. The interface 312 collectively passes on the individual hearing system information, i.e. the image 301 and the further individual hearing system information, to the parameter adaptation means 314. The parameter adaptation means 314 passes on the image 301 to an image processing means 317. The image processing means 317 does not necessarily have to be part of the mobile telephone 100, but may, for example, be installed on a universal computer of the user with corresponding software. The image processing means 317 analyzes the image 301 with respect to features contained therein, which may be used as individual hearing system information. The individual hearing system information extracted in this way from the image 301 is given back to the parameter adaptation means 314.

When the image 301 is recorded by the image recording device 31 1 , the user may be instructed to position a comparison measurement element within the image recording area in such a way that the image 301 also contains an imaging of the comparison measurement element. As a comparison measurement, a ruler may be used or also certain standardized objects of everyday life, like, for example, certain coins. Using the picture of the comparison measurement element contained in the image 301, the image processing means 317 may standardize the image independent of an image recording geometry and thus determine relatively exact measurement values for dimensions illustrated in the image 301. Accordingly, the image processing means 317 may contain a size standardization module. The method for generating an audio signal output by a mobile telephone may contain a step, according to which the picture of the comparison measurement element contained in the image 301 is detected and measured so that a subsequent step of standardizing the image 301 may be executed. Deviating from the arrangement of parameter adaptation means 314 and image processing means 317 illustrated in Fig. 3, the image processing means 317 may also be connected to other illustrated components. For example, the image processing means 317 may be directly connected to the image recording device 311 and the result of the image processing may be passed on to the interface 312.

The further processing of the signal processing parameters provided by the parameter adaptation means 314 basically corresponds to that of Fig. 1.

Fig. 4 shows a block diagram of a mobile telephone 100 according to a further implementation of the teaching disclosed herein. For determining the signal processing parameters adapted to the user, sample audio signals are played to the user which are to be evaluated by the user with respect to a spatial sound impression. The sample audio signals are stored, for example, in a database or a memory area 425, from where they are transferred to the audio data source 416. The audio data source 416 may now either directly reproduce the sample audio signals, for example, as stereo signals via the headset 122. In this case, the sample audio signals stored in the memory 425 are already mixed or rendered as a stereo signal such that for some listeners a strong spatial sound impression results. In this case, the audio data source 416 outputs the sample audio signal directly to the headset 122, i.e. bypassing the audio signal processors 118. Another possibility is to reproduce generic sample audio signals in connection with prepared signal processing parameters using the audio signal processor 1 18 (dashed arrows in Fig. 4). The prepared signal processing parameters originating from the database 425 may be varied so that, for the user when reproducing the same generic sample audio signal, different spatial sound impressions result.

Independent of the method used for reproducing the sample audio signals, the user may input an assessment of the respective spatial sound impressions via the interface 1 12, from where they are transferred to the parameter adaptation means 414. The parameter adaptation means 414 also receives an identification number (ID) of the last sample audio signal which was reproduced or a set of prepared signal processing parameters from the database of the sample audio signals 425. The assessments of the sample audio signals together with the associated identification numbers form the individual hearing system information, using which the user-adapted signal processing parameter(s) may be determined. In the case In which the sample audio signals are formatted as stereo signals which are mixed or rendered such that the spatial sound impression (at least for some listeners) is inherently contained in the sample audio signal, the sample audio signals may be obtained as follows. For the execution of a parameter adaptation based on sample audio signals, typically reference data from individuals are needed, for which a conventional measurement, for example of the HRTF, was executed. The sample audio signals may now be obtained by equipping the reference individual with microphones, advantageously miniature microphones, which are placed at the inputs of the respective auditory canals of the reference individual. The microphones at the inputs of the auditory canals of the reference individual then record sound signals as they are also substantially perceived by the reference individual. Thus, different sample audio signals may be generated by playing different sounds from different relative positions to the reference individual. The thus generated sample audio signals are stored with respect to the HRTF (possibly also several HRTFs) belonging to the reference individual, so that an allocation is possible when a user assesses or evaluates the sample audio signals as being especially expressive with regard to a spatial sound impression.

Fig. 5 illustrates a possible application of a mobile telephone according to the teaching disclosed herein. Let us assume that the user is participating in a telephone conference with four participants. A typical problem with telephone conferences is that it is often difficult for the individual participants to allocate which of the other participants has been talking as the signals all arrive at the user as mono audio signals. The teaching disclosed herein enables allocating a special spatial position to each of the other participants 1 to 3 for the purpose of audio signal processing by the audio signal processor 118. Thus, participant 1 is located at a position 1 which is at +45° in front of the user. Participant 2 is located at a position 2 which lies directly in front of the user (0°). The participant 3 is located in a position 3 which is at an angle of -45°. The user may thereupon better differentiate the participants 1 to 3 from each other based on the direction from which the user acoustically perceives the respective participant.

If the audio signals of the participants 1 to 3 arrive at the user via a common channel, for example from a telephone conference provider, additional inaudible signals may be added to the audio signals which give information on the origin of the audio signals. Fig. 6 shows a schematica! block diagram, using which the functionality illustrated in Fig. 5 may be implemented. The mobile telephone includes a mobile radio module 610 which is implemented to support several speech connections simultaneously (in the illustrated example, three speech connections). The three signals are transferred to the audio signal processor 618. Apart from that, the mobile radio module 610 also transfers information via the speech connections to the parameter adaptation means 614, such as, e.g., the number of currently existing speech connections. The parameter adaptation means 614 thereupon generates a corresponding number of position parameter sets also transferred to the audio signal processor 618. The audio signal processor 618 executes a binaural rendering of the three signals using the three position parameter sets so that the output signal sounds for the user as if it were coming from a direction of +45°, whereas the output signals 2 and 3 seem to be coming from the directions 0° or -45°. Fig. 7 shows a schematical block diagram of a parameterization server 700 according to the teaching disclosed herein. The server 700 is connected to a mobile telephone 100 via a mobile radio network 216. For this purpose, the server includes a mobile radio interface 715. The mobile radio interface 715 is not necessarily to be regarded as hardware which is needed for mobile radio communication (in the sense of transmit/receive part, antenna, etc.). Rather, the mobile radio interface 715 represents a general connection to the mobile radio network 216 which may, however, possibly include several intermediate stations. In particular, the server may be connected to one or several base stations of the mobile radio network 216 which establish the radio connection to the mobile telephone. The mobile telephone 100 transmits individual hearing system information via the mobile radio network 216 to the server 700. The individual hearing system information arrive at the mobile radio interface 715 and are passed on to a parameter generator 714 which is comparable to the parameter adaptation means 1 14, 314, 414 with regard to its function. The parameter generator 71 determines signal processing parameters adapted to the user based on the individual hearing system information. The user-adapted signal processing parameters are transmitted back to the mobile telephone 100 again via the mobile radio interface 715 and the mobile radio network 216. The mobile telephone 100 uses the transmitted signal processing parameters in order to process audio signals for the respective user such that for this user a spatial sound impression results that is as realistic as possible.

Fig. 8 shows a schematical block diagram of another embodiment of the parameterization server 800. The parameter generator 814 now includes a database 840 and a function evaluation 850. The database 840 and the function evaluation 850 do not have to exist simultaneously, however. Further, the parameterization server 800 includes a database for sample audio signals 825 and an image processing means 817. Also here it is the case that the database of the sample audio signals 825 and the image processing means 817 may be present optionally, but do not have to be present. In so far, Fig. 8 shows an embodiment in which several optional components are combined without this representing the only possible combination.

As was described with reference to Fig. 7, the parameterization server 800 receives individual hearing system information from the mobile telephone 100 via the mobile radio network 216. The individual hearing system information arrives at the mobile radio interface 815 of the parameterization server 800 and is passed on to the parameter generator 814. The database 840 of the parameter generator 814 typically contains data sets which contain hearing system information of reference individuals as well as signal processing parameters determined for these reference individuals. By a query of the database 840 by means of the individual hearing system information, data sets may be extracted from the database 840 in which the hearing system information is similar to the individual hearing system information transmitted by the mobile telephone 100. As far as the database 840 is sufficiently densely and uniformly filled with data sets of different reference individuals, generally a data set may be determined whose signal processing parameters get sufficiently close to the HRTF of the mobile telephone user.

Additionally or alternatively to the query or request of the database 840, also a function may be evaluated which is done with the help of the function evaluation means 850. Only for the purpose of a more simple representation, the function is represented which maps a two-dimensional input variable to a one-dimensional output quantity. In general, the function is a mapping from R" into R"¹. The function may, for example, be based on a mathematical model or interpolate between empirically determined bases or support points, or a combination of both.

A further possibility for determining the signal processing parameters adapted to the user is to let the user evaluate sample audio signals, as was explained in connection with Fig. 4. In contrast to the embodiment according to Fig. 4, here the sample audio signals are stored in the database for sample audio signals 825 of the parameterization server 800. Thus, the sample audio signals are transmitted via the mobile radio interface 815 and the mobile radio network 216 to the mobile telephone 100 in order to be reproduced there for the user.

The evaluations of the sample audio signals by the user are transmitted from the mobile telephone 100 back to the parameterization server 800. From this point, the determination of the signal processing parameters basically corresponds to that explained with reference to Fig. 4. The image processing means 817 of the parameterization server 800 may be used when at least part of the individual hearing system information is present in the form of an image of the user or his hearing system. The image is transmitted from the mobile telephone 100 to the parameterization server 800 via the mobile radio network 216. Within the parameterization server 800, the image is supplied to the image processing means 817 which extracts features from the image. These features may then be used by the parameter generator 814 to determine signal processing parameters which match the user as well as possible. Basically, also here the processing of the image corresponds to that explained with reference to Fig. 3.

The database 840, the function evaluation means 850, the database of the sample audio signals 825 and the image processing means 817 typically need access to a sufficiently large database to fulfill their tasks. Transmitting a database via the mobile radio network 216 to the mobile telephone 100 is typically not desired due to the large amount of data to be transmitted. Apart from that, such a database is relatively valuable, as many cost and time consuming HRTF determinations with reference individuals were needed for generating the same. For this reason, it will be a prerogative to protect the data base of the database 840, the function evaluation means 850, the sample audio signals 825 and the image processing means 817 as well as possible against third-party unauthorized access.

The parameterization server 800 further includes an optional billing unit 830 connected to the mobile radio interface 815 and/or the parameter generator 814 and acquiring information from the same on whether a generation of a user-adapted signal processing parameter has taken place. For this purpose, an identification number of the mobile telephone 100 or the SIM card used therein may be used. The billing information may then be transmitted to a mobile radio service provider where the mobile telephone 100 or the SIM card is registered,

Fig. 9 shows a schematical flowchart of the method for generating an audio signal output by a mobile telephone according to the teaching disclosed herein. After the beginning of the method, at 902 first of all individual hearing system information of the user is received. The individual hearing system information of the user may be of a different kind, as described above with reference, for example, to Figs. 1 to 4. In the action designated by 904, at least one signal processing parameter is determined adapted to the user with respect to the individual hearing system information. This may in many cases be done by a comparison with reference data. The actions designated by 902 and 904 are typically executed once for each user of the mobile telephone 100. Then, the signal processing parameters adapted to the user are generally available permanently for the mobile telephone and are stored by the mobile telephone for later use. The subsequently described actions 906, 908 and 910 may be executed every time when the user of the mobile telephone 100 wants a surround sound-true reproduction of audio data. In this respect, first of all the audio data is provided at 906. At 908 the audio data are then processed according to the user-adapted signal processing parameters. The resulting output signal is then output to the user within the scope of the action 910, that is typically via a headset.

Fig. 10 shows a schematical flowchart for the method for providing at least one signal processing parameter according to the teaching disclosed herein. After starting the method, at 1002 individual hearing system information of the user is received via a mobile radio network. At 1004, at least one signal processing parameter adapted to the user is generated. Typically, a whole set of user-adapted signal processing parameters is generated which enable an audio signal processor of the mobile telephone to process an audio signal such that, for the user to which the signal processing parameters are adapted, a spatial sound impression results which is as realistic as possible. The at least one user-adapted signal processing parameter is transferred back to the user or the mobile telephone 100 via the mobile radio network, as is illustrated at 1006 of the flowchart illustrated in Fig. 10. Thereupon, the method of providing the at least one signal processing parameter ends. Apart from the actions 1002, 1004 and 1006 represented in Fig. 10, the method may include further actions like, e.g., executing image processing, providing sample audio data or also generating billing information.

Fig. 11 shows a loudspeaker arrangement with respect to a user according to the standard designated by 5.1. The 5.1 standard is used for a realistic reproduction of surround sound audio signals in cinemas and home cinemas. Apart from that, it is also applied in the reproduction of music recordings encoded in the format needed for this purpose. According to the 5.1 standard, five equal loudspeakers are placed on a circle around the listener. A left loudspeaker L, a central loudspeaker C and a right loudspeaker R are placed in front of the listener. A left surround loudspeaker LS and a right surround loudspeaker RS are positioned on the left and right side of the listener and somewhat offset to the back. A sub-woofer (SUB) completes the 5.1 loudspeaker system. The sub-woofer or LFE loudspeaker (low frequency effect) is mainly provided for low-frequency portions of the audio signal which can hardly be located by human beings, or not at all. Thus, the sub- woofer may be positioned in a freely selectable position in the vicinity of the listener. If the sub-woofer is omitted, a 5.0 loudspeaker system is obtained, wherein the low- frequency audio signal portions are reproduced via the five main loudspeakers. Using a loudspeaker system as is illustrated in Fig. 1 1 , realistic surround sound hearing impressions may be acquired for the listener in the planned position. The DVDs or BlucRay discs used in home cinema applications frequently have 5.0 or 5.1 -capable soundtracks which are provided for reproducing a loudspeaker system as illustrated in Fig. 1 1 and for achieving the desired surround sound effect.

In combination with a mobile telephone, however, frequently only reproduction either via an internal loudspeaker of the mobile telephone or a headset is possible. In a reproduction via the internal loudspeaker of the mobile telephone, usually rather substantial compromises have to be made with respect to sound quality and surround sound. Frequently, via the internal loudspeaker of the mobile telephone, only a mono reproduction is possible. In a reproduction via a headset, however, a good or very good sound quality and also a spatial sound perception may be achieved for the listener. The latter requests, however, that the signals be rendered for the left headset and the right headset, so that they simulate the HRTF of the respective user. When, for example, an audio signal present in the 5.0 format or the 5.1 format is to be rendered for a reproduction via the headset 122, this requests an audio signal portion normally reflected or radiated by the left loudspeaker L to be converted into a left headset signal component and a right headset signal component. In this respect it has to be considered that sound propagates differently from the left loudspeaker L to the left ear of the listener than to the right ear of the listener (see Fig. 1 1). The effects occurring in the sound propagation from the left loudspeaker L to the two ears have to be considered when a 5.0 or 5.1 audio signal is to be converted into a headset signal. Fig. 12 schematically illustrates a block diagram enabling such a conversion. L, R, C, LS and RS designate the individual sub-channels of a 5.0 signal in Fig. 12 which are to be converted for reproduction via headset. This is explained using the signal L provided for the left main loudspeaker. The signal L is transferred as an input variable to a functional block 1202. The functional block 1202 calculates an output signal which results on the input signal L and the user-adapted HRTF_L, wherein the HRTF_L is the head-related transmission function between the left loudspeaker L and the ears of the listener (see Fig. 1 1). The functional block 1202 has two outputs, one for the left headset and one for the right headset. The output signal of the functional block 1202 for the left headset is transferred to a signal adder or combiner 1222. The output signal of the functional block 1202 for the right headset is transferred to a signal adder or combiner 1224. At the signal adders or combiners 1222 and 1224, also those output signals of the functional blocks 1204, 1206, 1208 and 1210 for the respective headset (left or right) are transferred so that, at the outputs of the signal adders 1222 and 1224, an overlaying signal results for each of the left and the right headsets.

The functional blocks 1202 to 1210 are provided with the respectively applying hcad- related transmission functions (HRTF_L, HRTFR, HRTF_C, HRTF_ls and HRTF_RS). These user-adapted HRTFs are transferred from the parameter adaptation means 114 of the mobile telephone 100 to the functional blocks 1202 to 1210.

Although some aspects have been described in connection with a device, it is obvious that these aspects also represent a description of the corresponding method, so that a block or a component of a device may also be understood as a corresponding method step or as a feature of a method step. Analog to that, aspects that were described in connection with a or as a method step, also represent a description of a corresponding block or detail or feature of a corresponding device. Some or all of the method steps may be executed by a hardware apparatus (or using a hardware apparatus) like, for example, a microprocessor, a programmable computer or an electronic circuit. In some embodiments, some or several of the most important method steps may be executed by such an apparatus.

According to a further aspect of the teachings disclosed herein, a mobile telephone as described above may further comprise a radio interface implemented to receive the individual hearing system information from the interface for the individual hearing system information and to transfer the same to a server, and which is further implemented to receive the at least one estimation of the head-related transfer function (HRTF) adapted to the user from the server and pass the same on to the audio signal processor.

With respect to the parameterization server, and according to a further aspect of the teachings disclosed herein, the parameter generator may comprise a database with data sets which comprise predetermined individual hearing system information and at least one associated estimation of the head-related transfer function, and wherein the parameter generator may be implemented to select at least one data set whose individual hearing system information is similar or identical to the individual hearing system information received via the mobile radio interface, and to transfer the at least one associated estimation of the head-related transfer function to the mobile telephone. Furthermore, the parameter generator may be implemented to evaluate a function into which at least a part of the individual hearing system information received from the mobile radio interface is introduced as an input quantity and which provides the at least one estimation of the head- related transfer function as an output quantity. According to a further aspect, the parameterization server may further comprise a billing unit which is implemented for generating billing information with respect to a completed generation or transfer of the at least one estimation of the head-related transfer function to the mobile telephone.

With respect to a method for generating an audio signal output, the method may further comprise: transferring the individual hearing system information via a radio connection to a server, and receiving the at least one estimation of the head-related transfer function adapted to the user via the radio connection from the server. Still regarding the method, the action of processing the audio data may executed according to a first spatial sound source position, in which case the method may further comprise: processing further audio data according to the at least one estimation of the head-related transfer function adapted to the user and a second spatial sound source position so that the user perceives the further audio data as being spaced apart from the audio data with respect to the spatial hearing impression.

Regarding a method for providing at least one estimation of the head-related transfer function, the method may further comprise: querying a database comprising data sets comprising predetermined individual hearing system information and at least one allocated estimation of the head-related transfer function; selecting at least one data set whose individual hearing system information is similar or identical to the individual hearing system information received via the mobile radio network; and transferring the at least one allocated estimation of the head-related transfer function to the mobile telephone. The method may further comprise: evaluating a function into which at least a part of the individual hearing system information received from the mobile radio interface is introduced as an input quantity and which provides the at least one estimation of the head- related transfer function as an output quantity. The method may further comprise: receiving the image at least partially showing the user via the mobile radio network; analyzing the image with respect to at least one part of the individual hearing system information; and using at least that part of the individual hearing system information for generating the at least one estimation of the head-related transfer function. Moreover, the method may further comprise: generating billing information with respect to a completed generation or transfer of the at least one estimation of the head-related transfer function to the mobile telephone.

According to a further possible aspect of the disclosed teachings, a mobile telephone, comprises an interface for receiving individual hearing system information of a user of the mobile telephone; a parameter adapter for providing at least one signal processing parameter adapted to the user with regard to the individual hearing system information; an audio data source for providing audio data; an audio signal processor for receiving the audio data from the audio data source and the at least one signal processing parameter adapted to the user, for processing the audio data according to the at least one signal processing parameter and for providing an audio signal resulting therefrom giving the user a spatial hearing impression.

The mobile telephone may further comprise a radio interface implemented to receive the individual hearing system information from the interface for the individual hearing system information and to transfer the same to a server, and which is further implemented to receive the at least one signal processing parameter adapted to the user from the server and pass the same on to the audio signal processor.

The mobile telephone may further comprise an image recording device implemented to provide at least a part of the individual hearing system information in the form of an image at least partially showing the user for the parameter adapter.

The audio data source may be implemented or configured to provide sample audio signals for a reproduction for the user, and wherein the interface for receiving the individual hearing system information is implemented or configured to receive an evaluation of the sample audio signals by the user with respect to the spatial hearing impression as at least one part of the individual hearing system information.

The audio signal processor may be implemented or configured to process at least two audio data streams of different origin such that the user perceives the at least two audio data streams according to the spatial hearing impression as being spaced apart from each other.

The audio data source may comprise at least one of the following: a mobile radio receive section, an internal storage of the mobile telephone, a removable medium, a computer connected via an online connection and a streaming media connection.

According to a further possible aspect of the disclosed teachings a parameterization server comprises a parameter generator for providing a signal processing parameter adapted to a user on the basis of individual hearing system information; a mobile radio network interface for receiving the individual hearing system information from a mobile telephone which relate to the user of the mobile telephone, and for transferring the signal processing parameter adapted to the user to the mobile telephone for processing audio data by the mobile telephone according to the at least one signal processing parameter and for providing an output signal resulting therefrom which gives the user a spatial hearing impression.

The parameter generator may comprise a database with data sets which comprise predetermined individual hearing system information and at least one associated signal processing parameter, and wherein the parameter generator is implemented or configured to select at least one data set whose individual hearing system information is similar or identical to the individual hearing system information received via the mobile radio interface, and to transfer the at least one associated signal processing parameter to the mobile telephone.

The parameter generator may be implemented or configured to evaluate a function into which at least a part of the individual hearing system information received from the mobile radio interface is introduced as an input quantity and which provides the at least one signal processing parameter as an output quantity.

The parameterization server may further comprise an image processor which is implemented or configured for receiving an image at least partially illustrating the user from the mobile radio interface and for analyzing the image with respect to at least one part of the individual hearing system information and for providing at least that part of the individual hearing system information to the parameter generator.

The parameterization server may further comprise a sample audio signal collection with sample audio signals; wherein the mobile radio interface is implemented or configured to transfer at least one of the sample audio signals to the mobile telephone and to receive, from the mobile telephone, a user evaluation of the at least one sample audio signal with respect to the spatial hearing impression as being at least a part of the individual hearing system information. The parameterization server may further comprise a billing unit which is implemented or configured for generating billing information with respect to a completed generation or transfer of the at least one signal processing parameter to the mobile telephone.

According to a further possible aspect of the disclosed teachings a method for generating an audio signal output by a mobile telephone comprises receiving individual hearing system information of the user; determining at least one signal processing parameter adapted to the user at least with regard to the individual hearing system information; providing audio data; processing the audio data according to the at least one signal processing parameter; outputting the resulting audio signal output from the mobile telephone which gives a spatial hearing impression to the user.

The method may further comprise transferring the individual hearing system information via a radio connection to a server; receiving the at least one signal processing parameter adapted to the user via the radio connection from the server.

The method may further comprise recording an image at least partially showing the user which forms at least part of the individual hearing system information.

The method may further comprise reproducing sample audio signals for the user; receiving an evaluation of the sample audio signals by the user with respect to the spatial hearing impression; adding the evaluation to the individual hearing system information. Processing the audio data may be executed according to a first spatial sound source position, and the method may further comprise processing further audio data according to the at least one signal processing parameter adapted to the user and a second spatial sound source position so that the user perceives the further audio data as being spaced apart from the audio data with respect to the spatial hearing impression.

A further possible aspect of the teachings disclosed herein proposes a method for providing at least one signal processing parameter which is adapted to a user so that an audio signal processed by means of the at least one signal processing parameter gives the user a spatial hearing impression, the method comprising: receiving individual hearing system information from a mobile telephone via a mobile radio network, wherein the individual hearing system information relates to the user; generating the at least one signal processing parameter on the basis of the individual hearing system information; transferring the signal processing parameter adapted to the user via the mobile radio network to the mobile telephone.

The method may further comprise querying a database comprising data sets comprising predetermined individual hearing system information and at least one allocated signal processing parameter, selecting at least one data set whose individual hearing system information is similar or identical to the individual hearing system information received via the mobile radio network; transferring the at least one allocated signal processing parameter to the mobile telephone. The method may further comprise evaluating a function into which at least a part of the individual hearing system information received from the mobile radio interface is introduced as an input quantity and which provides the at least one signal processing parameter as an output quantity.

The method may further comprise receiving the image at least partially showing the user via the mobile radio network; analyzing the image with respect to at least one part of the individual hearing system information, using at least that part of the individual hearing system information for generating the at least one signal processing parameter.

The method may further comprise transferring at least one sample audio signal via the mobile radio network to the mobile telephone; receiving a user evaluation of the at least one sample audio signal with respect to the spatial hearing impression being at least a part of the individual hearing system information.

The method may further comprise generating billing information with respect to a completed generation or transfer of the at least one signal processing parameter to the mobile telephone. According to a further possible aspect of the teachings disclosed herein, a mobile telephone comprises an interface for receiving individual hearing system information of a user of the mobile telephone; a parameter adapter for providing at least one estimation of a head- related transfer function adapted to the user with regard to the individual hearing system information; an audio data source for providing audio data; an audio signal processor for receiving the audio data from the audio data source and the at least one estimation of the head-related transfer function adapted to the user, for processing the audio data according to the at least one estimation of the head-related transfer function and for providing an audio signal resulting therefrom giving the user a spatial hearing impression. According to a further possible aspect of the disclosed teachings, a parameterization server, comprises a parameter generator for providing an estimation of a head-related transfer function adapted to a user on the basis of individual hearing system information; a mobile radio network interface for receiving the individual hearing system information from a mobile telephone which relate to the user of the mobile telephone, and for transferring the estimation of the head-related transfer function adapted to the user to the mobile telephone for processing audio data by the mobile telephone according to the at least one estimation of the head-related transfer function and for providing an output signal resulting therefrom which gives the user a spatial hearing impression. According to a further possible aspect of the disclosed teachings, a method for generating an audio signal output by a mobile telephone comprises receiving individual hearing system information of the user; determining at least one estimation of a head-related transfer function adapted to the user at least with regard to the individual hearing system information; providing audio data; processing the audio data according to the at least one estimation of the head-related transfer function; outputting the resulting audio signal output from the mobile telephone which gives a spatial hearing impression to the user. According to a further possible aspect of the disclosed teachings, a method for providing at least one estimation of a head-related transfer function which is adapted to a user so that an audio signal processed by means of the at least one estimation of the head-related transfer function gives the user a spatial hearing impression is proposed, the method comprising: receiving individual hearing system information from a mobile telephone via a mobile radio network, wherein the individual hearing system information relates to the user; generating the at least one estimation of the head-related transfer function on the basis of the individual hearing system information; transferring the estimation of the head-related transfer function adapted to the user via the mobile radio network to the mobile telephone. A signal, like, for example, an audio signal, a video signal or a data signal may be stored on a digital storage medium or may be transmitted on a transmission medium like, for example, a wireless transmission medium or a wire-bonded transmission medium, like, for example, the internet. Depending on the respective implementation requirements, embodiments of the invention may be implemented in hardware or in software. The implementation may be using a digital storage medium, for example a floppy disc, a DVD, a Blu-Ray disc, a CD, an ROM, a PROM, an EPROM, an EEPROM or a FLASH memory, a hard disc or another magnetic or optical memory on which electronically readable control signals are stored which may cooperate with a programmable computer system or do cooperate with the same such that the respective method is executed. Thus, the digital storage medium may be computer- readable.

Some embodiments according to the invention thus include a data carrier comprising electronically readable control signals which are able to cooperate with a programmable computer system such that one of the methods described herein is executed. In general, embodiments of the present invention may be implemented as a computer program product having a program code, wherein the program code is effective in order to execute one of the methods when the computer program product is executed on a computer,

The program code may, for example, be also stored on a machine-readable carrier.

Other embodiments include the computer program for executing one of the methods described herein, wherein the computer program is stored on a machine-readable carrier.

In other words, an embodiment of the inventive method is thus a computer program comprising a program code for executing one of the methods described herein, when the computer program is executed on a computer. A further embodiment of the inventive method is thus a data carrier (or a digital storage medium or a computer-readable medium) on which the computer program for executing one of the methods described herein is recorded.

A further embodiment of the inventive method is thus a data stream or a sequence of signals representing the computer program for executing one of the methods described herein. The data stream or the sequence of signals may, for example, be configured in order to be transferred via a data communication connection, like, for example, via the internet. A further embodiment includes a processing means, for example a computer or a programmable logics device which is configured or adapted to execute one of the methods described herein.

A further embodiment includes a computer on which the computer program for executing one of the methods described herein is installed.

A further embodiment according to the invention includes a device or a system which is, for example, implemented to transfer a computer program for executing at least one of the methods described herein to a receiver. The transfer may, for example, be executed electronically or optically. The receiver may, for example, be a computer, a mobile device, a storage device or a similar device. The device or the system may, for example, be a file server for transferring the computer programmer to the receiver. In some embodiments, a programmable logic device (for example a field-programmable gate array, an FPGA) may be used to execute some or all functionalities of the methods described herein. In some embodiments, a field-programmable gate array may cooperate with a microprocessor to execute one of the methods described herein. In general, the methods are in some embodiments executed by any hardware device. This may be a universally usable hardware like a computer processor (CPU) or hardware-specific for the method, like, for example, an ASIC.

The above-described embodiments merely represent an illustration of the principles of the present invention. It is obvious that modifications and variations of the arrangements and details described herein will be obvious for other persons skilled in the art. It is thus intended to restrict the invention only by the scope of the subsequent patent claims and not by the specific details presented by the description and the explanation of the embodiments herein.

While this invention has been described in terms of several embodiments, there are alterations, permutations, and equivalents which fall within the scope of this invention. It should also be noted that there are many alternative ways of implementing the methods and compositions of the present invention. It is therefore intended that the following appended claims be interpreted as including all such alterations, permutations and equivalents as fall within the true spirit and scope of the present invention.

Claims

1. A mobile communication device (100), including: an interface (112) for receiving individual hearing system information of a user of the mobile communication device, the individual hearing system information characterizing a hearing system of the user; a parameter adaptation means (1 14) for providing at least one estimation of an head related transfer function (H TF) adapted to the user with regard to the individual hearing system information; an audio data source for providing audio data; an audio signal processor (118) for receiving the audio data from the audio data source (1 16) and the at least one estimation of the head related transfer function (HRTF) adapted to the user, for processing the audio data according to the at least one estimation of the head related transfer function (HRTF) and for providing an audio signal resulting therefrom giving the user a spatial hearing impression; and a radio interface implemented to receive the individual hearing system information from the interface (1 12) for the individual hearing system information and to transfer the same to a server, and which is further implemented to receive the at least one estimation of the head related transfer function (HRTF) adapted to the user from the server and pass the same on to the audio signal processor (1 18).

2. The mobile communication device (100) according to claim 1 , further including: an image recording device (31 1) implemented to provide at least a part of the individual hearing system information in the form of an image at least partially showing the user for the parameter adaptation means (114).

3. The mobile communication device (100) according claims 1 or 2, wherein the audio data source (116) is implemented to provide sample audio signals for a reproduction for the user, and wherein the interface (1 12) for receiving the individual hearing system information is implemented to receive an evaluation of the sample audio signals by the user with respect to the spatial hearing impression as at least one part of the individual hearing system information.

4. The mobile communication device (100) according to one of claims 1 to 3, wherein the audio signal processor (1 18) is implemented to process at least two audio data streams of different origin such that the user perceives the at least two audio data streams according to the spatial hearing impression as being spaced apart from each other.

5. The mobile communication device (100) according to one of claims 1 to 4, wherein the audio data source (1 16) includes at least one of the following: a mobile radio receive section, an internal storage of the mobile communication device, a removable medium, a computer connected via an online connection and a streaming media connection.

6. A parameterization server (700, 800), including: a parameter generator (714) for providing an estimation of an head related transfer function (HRTF) adapted to a user on the basis of individual hearing system information, the individual hearing system information characterizing a hearing system of the user; a mobile radio network interface (715) for receiving the individual hearing system information from a mobile communication device (100) which relate to the user of the mobile communication device, and for transferring the estimation of the head related transfer function (HRTF) adapted to the user to the mobile communication device (100) for processing audio data by the mobile communication device (100) according to the at least one estimation of the head related transfer function (HRTF) and for providing an output signal resulting therefrom which gives the user a spatial hearing impression.

7. The parameterization server according to claim 6, wherein the parameter generator (714) includes a database (840) with data sets which include predetermined individual hearing system information and at least one estimation of the head related transfer function (HRTF), and wherein the parameter generator (714) is implemented to select at least one data set whose individual hearing system information is similar or identical to the individual hearing system information received via the mobile radio interface, and to transfer the at least one associated estimation of the head related transfer function (HRTF) to the mobile communication device (100).

8. The parameterization server according to claims 6 or 7, wherein the parameter generator (714) is implemented to evaluate a function into which at least a part of the individual hearing system information received from the mobile radio interface is introduced as an input quantity and which provides the at least one estimation of the head related transfer function (HRTF) as an output quantity.

9. The parameterization server according to one of claims 6 to 8, further including: an image processing means (817) which is implemented for receiving an image at least partially illustrating the user from the mobile radio interface and for analyzing the image with respect to at least one part of the individual hearing system information and for providing at least that part of the individual hearing system information to the parameter generator (714).

10. The parameterization server according to one of claims 6 to 9, further including: a sample audio signal collection (825) with sample audio signals; wherein the mobile radio interface is implemented to transfer at least one of the sample audio signals to the mobile communication device (100) and to receive, from the mobile communication device (100), a user evaluation of the at least one sample audio signal with respect to the spatial hearing impression as being at least a part of the individual hearing system information.

11 The parameterization server according to one of claims 6 to 10, further including a billing unit (830) which is implemented for generating billing information with respect to a completed generation or transfer of the at least one estimation of the head related transfer function (HRTF) to the mobile communication device (100).

12. A method for generating an audio signal output by a mobile communication device (100), including: receiving individual hearing system information o the user, the individual hearing system information characterizing a hearing system of the user; transferring the individual hearing system information via a radio connection to a server (700, 800); determining at least one estimation of an head related transfer function (HRTF) adapted to the user at least with regard to the individual hearing system information; receiving the at least one estimation of the head related transfer function (HRTF) adapted to the user via the radio connection from the server providing audio data; processing the audio data according to the at least one estimation of the head related transfer function (HRTF); and outputting the resulting audio signal output from the mobile communication device (100) which gives a spatial hearing impression to the user.

The method according to claim 12, further including: recording an image at least partially showing the user which forms at least part of the individual hearing system information.

The method according to claims 12 or 13, further including: reproducing sample audio signals for the user; receiving an evaluation of the sample audio signals by the user with respect to the spatial hearing impression; adding the evaluation to the individual hearing system information.

The method according to one of claims 12 to 14, wherein processing the audio data is executed according to a first spatial sound source position, the method further including: processing further audio data according to the at least one estimation of the head related transfer function (HRTF) adapted to the user and a second spatial sound source position so that the user perceives the further audio data as being spaced apart from the audio data with respect to the spatial hearing impression.

16. A method for providing at least one estimation of an head related transfer function (HRTF) which is adapted to a user so that an audio signal processed by means of the at least one estimation of the head related transfer function (HRTF) gives the user a spatial hearing impression, the method including: receiving individual hearing system information from a mobile communication device (100) via a mobile radio network, wherein the individual hearing system information relates to the user and characterizes a hearing system of the user; generating the at least one estimation of the head related transfer function (HRTF) on the basis of the individual hearing system information; transferring the estimation of the head related transfer function (HRTF) adapted to the user via the mobile radio network to the mobile communication device (100).

17. The method according to claim 16, further including: querying a database including data sets including predetermined individual hearing system information and at least one allocated estimation of the head related transfer function (HRTF), selecting at least one data set whose individual hearing system information is similar or identical to the individual hearing system information received via the mobile radio network; transferring the at least one allocated estimation of the head related transfer function (HRTF) to the mobile communication device (100).

18. The method according to claims 16 or 17, further including: evaluating a function into which at least a part of the individual hearing system information received from the mobile radio interface is introduced as an input quantity and which provides the at least one estimation of the head related transfer function (HRTF) as an output quantity.

19. The method according to one of claims 16 to 18, further including: receiving the image at least partially showing the user via the mobile radio network; analyzing the image with respect to at least one part of the individual hearing system information, using at least that part of the individual hearing system information for generating the at least one estimation of the head related transfer function (HRTF).

20. The method according to one of claims 16 to 19, further including: transferring at least one sample audio signal via the mobile radio network to the mobile communication device (100); receiving a user evaluation of the at least one sample audio signal with respect to the spatial hearing impression being at least a part of the individual hearing system information.

21. The method according to one of claims 16 to 20, further including: generating billing information with respect to a completed generation or transfer of the at least one estimation of the head related transfer function (HRTF) to the mobile communication device (100).