Short introduction to the FAM class and
Bark-FAMlet functions
Unto
K. Laine
Purpose of the program
This program generates a set of complex FAM functions defined in the frequency
domain. Their phase characteristic follows the Hz-to-Bark mapping. The
associated time domain FAMlet functions are then produced by inverse Fourier
transforming the FAM functions.
The generated Bark-FAMlet functions are useful tools for auditory based
signal processing and for studies of human auditory perception.
What FAM stands for
Shortly, FAM stands for Frequency-Amplitude Modulated complex exponentials.
They were found by the author of this notebook around 1983. More serious
research on this topic started in the autumn of 1989. The first publication
of these studies was made in 1990 ICASSP meeting [1]. Later the method
was applied to auditory based speech analysis [2-4, 6]. Also related FAM
and FAMlet transforms were studied [5, 7]. Closely related developments
are Warped Linear Prediction [8] (WLP) and Generalized Linear Prediction
[9] (GLP), where the classical theory of LP was extended to analytic signals
and more general predictor structures.
The terminology for the FAM functions was chosen by thinking their general
shape especially when defined in the time domain. The main idea behind
these functions is that we first warp the complex exponential functions
thus introducing a type of frequency modulation ("FM", mathematically:
by replacing the x variable by an inner function g(x)), and then amplitude
modulate them in order to make them orthogonal or even orthonormal ("AM",
mathematically: by using a proper weighting function).
Definition for FAM functions:
FAM[a ,x] = am(x) Exp[a I fm(x)] = Sqrt[dg(x)/dx] Exp[a I g(x)],
where a is the order of the function and g(x) defines an inner function,
or the warping. The "AM"-part is formed from the first derivative
of the g(x).
The equation above shows that the FAM function consists of two parts: an
"AM"-part and an "FM"-part, which are interdependent
according to the FAM idea. This terminology describes the corresponding
features of the functions when they are defined in the time domain. However,
when the FAM functions are defined in the frequency domain the "AM"-function
is equal to the magnitude spectrum of all the FAM functions, and the "FM"-function
(or the inner function) now defines the phase characteristic of the FAM
of order one. The order zero FAM is a zero-phase function (magnitude only)
and the higher order ones have a phase multiple of that of the first order
one (multiple of the phase defined by the"FM"-function).
FAMily!
FAM is a class, or a family of orthonormal functions. This class has many
known members, like Fourier functions ("AM" equals one, no "FM"-property
, i.e., g(x)=x), Laguerre (or slightly modified, when "FM" equals
to the arctan(x)), Chebyshev ("FM" equals to the arccos(x)), Legendre,
etc. FAM sets can form a kernel in orthogonal transforms. Many known transforms
can be generated: Fourier-, Scale- etc.
Up to now the most interesting results are found when the FAM functions
are defined in the frequency domain. The corresponding time domain functions
called FAMlets are finally produced by inverse Fourier transforming the
frequency domain FAM functions. The name FAMlet was chosen to indicate
that they have similar use as the wavelets in certain areas of time-frequency
analysis and perceptual audio processing.
Bark-FAMlets shortly
1. They form an orthonormal set of functions (signals).
2. They have an identical power spectrum defined by the term dg/df on the
Hz-scale.
3. They have equal energy over each critical band, thus forming a set of
clicks with an unform masking property.
4. They are compact clicks in time, the length increasing linearly with
order a.
5. The zeroth order FAMlet is a symmetric pulse. The higher order ones are
similar to frequency chirps, the chirp rate decreasing with increasing order
a.
6. The average rate of the frequency chirp of the Bark-FAMlets is uniform
over the middle part of the Bark-scale (over the Bark channels 5-15). The
chirp rate decreases linearly with the increasing order of the functions.
Thus the higher order FAMlets have longer duration in time.
Present status and the future of the FAMlet research
Bark-FAMlets have recently been used to study the human auditory phase resolution.
In a preliminary study made subjects were able to perceive quality differences
in Bark-FAMlet clicks of less than 1ms in duration when played in a normal
or time-reversed direction. The first publication of the listening experiment
will be presented in the coming ICASSP-96 conference [10].
Bark-FAMlets have been also used as a stimulus in measuring the human auditory
brain stem responses. This study has been going on about two years now
together with Turku University Central Hospital (Dr. Altti Salmivalli).
Interesting new results have been achieved and the first publication will
be released soon [11].
In the summer -95 a new audio coding project started in the HUT Acoustics
Laboratory. In this project the main goal is to study how the FAMlets can
be applied to the broad band perceptual audio coding.
On the basic research side there are many lines going on. One of them is
related to the cochlea modeling. So called WBK approximation has been used
to solve the wave equation for the cochlea. FAM functions seem to give
a new approximative solution.
Context of the program
In the following we first define the Hz-to-Bark warping, derive a function
for the "AM"- and "FM"-part of the FAMs and then generate
the corresponding FAM functions. The related FAMlets are finaly produced
by inverse Fourier transform.
The first part of this notebook describes the derivation of the functions
in detail. At the end of the notebook a compact function for FAMlet generation
is given. NOTE that the notebook must be first evaluated in order to get
the function working (it needs other functions like the Hz-to-Bark mapping).
The notebook includes also some sounds made of Bark-FAMlets.
How to download the program
The above described Mathematica notebook program can be downloaded here (Mathematica Notebook File, filename: FAMlet_gen.m, size 123274 bytes).
For questions concerning the FAMlet software, please contact Dr. Unto K. Laine.
Summary of FAM and FAMlet publications and related topics
(From the very begining up to the summer -95)
[1] Laine U. K., Altosaar T.: An Orthogonal Set of Frequency and Amplitude
Modulated (FAM) Functions for Variable Resolution Signal Analysis. Proc.
of ICASSP-90, Vol. 3, pp. 1615-1618, Albuquerque, New Mexico, April 3-6,
1990.
[2] Laine U. K.: A new high resolution time-Bark analysis method for speech.
Proc. of the XIIth Int. Conference of Phonetic Sciences, Vol. 2, pp. 402-405,
Aix-en-Provence, France 1991.
[3] Laine U. K., Karjalainen M. and Altosaar T.: Time-frequency and multiple-resolution
representations in auditory modeling. Paper summaries of 1991 IEEE ASSP
Workshop on Applications of Signal Processing to Audio and Acoustics, Paper
8. Session 1., New Paltz, USA 1991.
[4] Laine U. K.: Analysis of short fragments of speech using complex orthogonal
auditory transform (COAT). ESCA Workshop "Comparing Speech Signal Representations",
Sheffield, England April 7-9 1992.
[5] Laine U. K.: Famlet, to be or not to be a wavelet. IEEE-SP International
Symposium on Time-Frequency and Time-Scale Analysis, Victoria, British Columbia,
Canada, Oct. 4-6, pp. 335-338, 1992.
[6] Laine U. K.: Speech analysis using complex orthogonal auditory transform
(COAT). Proc. of 1992 Int. Conf. on Spoken Language Processing, Banff, Alberta,
Canada, Oct. 12-16 1992.
[7] Laine U. K.: MSE filter design and spectrum parametrization by orthogonal
FAM Transform. Proc. of the ISCAS-93, Chicago, Illinois, I pp. 148-151,
1993.
[8] Laine U. K., Karjalainen M., Altosaan T., Warped linear prediction (WLP)
in speech and audio processing. Proc. ICASSP-94, Adelaide, South Australia,
III pp. 349-352, 1994.
[9] Laine U. K., Generalized linear prediction based on analytic signals.
Proc. ICASSP-95, Detroit, MI, pp. 1701-1704 , 1995.
[10] Laine U. K. and Huotilainen M.: A study on auditory resolution using
Bark-FAMlet clicks. To be published at ICASSP-96, Atlanta, GA, 1996.
[11] Kähkönen E., Salmivalli A., Laine U. K., Uusipakka E. and
Johansson R.: FAMlet - a new stimulus for BRA (to be published), 1995.