Psychoacoustics Literature:
The following are purely on psychoacoustics, written by the leading researchers
in the field:
-
E. Zwicker and U.T. Zwicker, ``Audio engineering and psychoacoustics: Matching
signals to the final receiver, the human auditory system,'' Journal
of the Audio Engineering Society, vol. 39, no. 3, pp. 115-126, Mar.
1991.
-
B. Scharf, ``Critical bands,'' Chapter 5 of Foundations of Modern Auditory
Theory (J. Tobias, ed.), New York: Academic, 1970.
-
E. Zwicker and H. Fastl, Psychoacoustics: Facts and Models,
Berlin: Springer, 1990.
These attempt to quantify audio quality:
-
J.D. Johnston, ``Estimation of perceptual entropy using noise masking criteria,''
Proceedings of the IEEE Conference on Acoustics, Speech, and Signal
Processing, vol. 5, pp. 2524-2527, 1988.
-
J.G. Beerends and J.A. Stemerdink, ``A perceptual audio quality measure
based on a psychoacoustic sound representation,'' Journal of the Audio
Engineering Society, vol. 40, no. 12, pp. 963-978, Dec. 1992.
Perceptual Coding Literature:
Here is an overview of the currently popular perceptual coding schemes
(including MPEG):
K. Brandenburg and M. Bosi, ``Overview of MPEG audio: Current
and future standards for low-bit-rate coding,'' Journal of the Audio
Engineering Society, vol. 45, no. 1/2, pp. 4-21, Jan/Feb. 1997.
These are historically important (pre/non-MPEG) schemes:
-
M.R. Schroeder, B.S. Atal, and J.L. Hall, ``Optimizing digital speech coders
by exploiting masking properties of the human ear,'' Journal of the
Acoustical Society of America, vol. 66, no. 6, pp. 1647-1651, Dec.
1979.
-
J.D. Johnston, ``Transform coding of audio signals using perceptual noise
criteria,'' IEEE Journal on Selected Areas in Communications,
vol. 6, no. 2, pp. 314-323, Feb. 1988.
-
R.N.J. Veldhuis, M. Breeuwer, and R.G. Van der Waal, ``Subband coding of
digital audio signals,'' Philips Journal of Research, vol. 44, pp.
329-343, 1989.
-
K. Brandenburg, J. Herre, J.D. Johnston, Y. Mahieux, and E.F. Schroeder,
``ASPEC--Adaptive spectral entropy coding of high quality music signals,''
Audio Engineering Society Preprint #3011, May 1991.
-
K. Tsutsui, H. Suzuki, O. Shimoyoshi, M. Sonohara, K. Akagiri, and R.M.
Heddle, ``ATRAC: adaptive transform acoustic coding for MiniDisc,'' Audio
Engineering Society Preprint #3456, Dec. 1992.
-
C.C. Todd, G.A. Davidson, M.F. Davis, L.D. Fielder, B.D. Link, and S. Vernon,
``AC-3: Flexible perceptual coding for audio transmission and storage,''
Audio Engineering Society Preprint #3796, May 1994.
MPEG-Specific Literature:
High-level overviews:
-
K. Brandenburg and G. Stoll, ``ISO-MPEG-1 audio: A generic standard
for coding of high-quality digital audio,'' Journal of the Audio Engineering
Society, vol. 42, no. 10, pp. 780-791, Oct. 1994.
-
M. Bosi, K. Brandenburg, S. Quackenbush, L. Fielder, K. Akagiri, H. Fuchs,
M. Dietz, J. Herre, G. Davidson, and Y. Oikawa, ``ISO/IEC MPEG-2 advanced
audio coding,'' Journal of the Audio Engineering Society, vol. 45,
no. 10, Oct. 1997.
Descriptions of implementation:
-
B.G. Haskell, A. Puri, and A.N. Netravali, ``Audio,'' Chapter 4 of Digital
Video: An Introduction To MPEG-2, New York: Chapman & Hall,
1997.
-
U. Zolzer, ``Data Compression,'' Chapter 9 of Digital Audio Signal Processing,
New York: Wiley, 1997.
The actual standards (MPEG 1,2,4 in order):
-
ISO/IEC, International Standard IS 11172-3 ``Information technology
-- Coding of moving pictures and associated audio for digital storage media
at up to about 1.5 Mbits/s -- Part 3: Audio,'' Nov.1992.
-
ISO/IEC, International Standard IS 13818-3 ``Information technology
-- Generic coding of moving pictures and associated audio information --
Part 3: Audio,'' Apr. 1997.
-
ISO/IEC, International Standard IS 14496-3 ``Information technology
-- Coding of audio-visual objects -- Part 3: Audio,'' Oct. 1998.
EE-597 Home