It’s easiest to describe aliasing in terms of a visual sampling system we all know and love—movies. If you’ve ever watched a western and seen the wheel of a rolling wagon appear to be going backwards, you’ve witnessed aliasing. The movie’s frame rate isn’t adequate to describe the rotational frequency of the wheel, and our eyes are deceived by the misinformation!
The Nyquist Theorem tells us that we can successfully sample and play back frequency components up to one-half the sampling frequency. Aliasing is the term used to describe what happens when we try to record and play back frequencies higher than one-half the sampling rate.
Consider a digital audio system with a sample rate of 48 KHz, recording a steadily rising sine wave tone. At lower frequency, the tone is sampled with many points per cycle. As the tone rises in frequency, the cycles get shorter and fewer and fewer points are available to describe it. At a frequency of 24 KHz, only two sample points are available per cycle, and we are at the limit of what Nyquist says we can do. Still, those two points are adequate, in a theoretical world, to recreate the tone after conversion back to analog and low-pass filtering.
But, if the tone continues to rise, the number of samples per cycle is not adequate to describe the waveform, and the inadequate description is equivalent to one describing a lower frequency tone—this is aliasing.
In fact, the tone seems to reflect around the 24 KHz point. A 25 KHz tone becomes indistinguishable from a 23 KHz tone. A 30 KHz tone becomes an 18 KHz tone.
In music, with its many frequencies and harmonics, aliased components mix with the real frequencies to yield a particularly obnoxious form of distortion. And there’s no way to undo the damage. That’s why we take steps to avoid aliasing from the beginning.
A human ear can hear until a frequency up to 20kHz. So why do we have a bit rate by 44.1 and not 40kHz?
“The Nyquist Theorem tells us [...] one-half the sampling frequency”
Because our lowpass filter will start to roll off the response at 20 kHz, and it will still take a very steep filter to have the response down as far as we’d like by 22.05 kHz.
Because of broadcast television and being a multiple of framerates including drop-frame broadcast standards.
Yes, that’s why it’s the exactly number 44.1 kHz (and similar reasons for 48 kHz for DAT), but I think the question was, in essence, why is it more than 40 kHz. The main criteria for choosing the sample rate was to be high enough to be twice the accepted range of human hearing, plus a “stop band” to give room for the reconstruction filter to roll off adequately, but still be as low as practical within those constraints (to maximize the recording time). The exact number of 44.1 kHz was chosen because it fit those constraints, and gave some potential initial advantage to converting video gear for CD manufacturing. But that is secondary and not critical. They had to decide on a number; that one was in the acceptable range, and had a potential benefit at the time.
Why 44.1kHz
We were asked to answer this question for a client recently!
http://www.cs.columbia.edu/~hgs/audio/44.1.html
Thanks a lot!
thank 4 helping a Nigerian student