1. Power and SNR
We start from a discrete-time signal (after sampling, possibly before or after quantization).
For SNR, we think in terms of power:
-
Average signal power (discrete-time):
Intuition: if behaves like a voltage, then power is proportional to . Any constant factors (like ) cancel when we form ratios.
Quantization introduces an error:
where is the quantized value.
-
Noise power:
Then the signal-to-noise ratio (SNR) is
and in decibels (for power ratios):
Some useful rules of thumb:
- dB power ratio of
- dB power ratio of
- dB power ratio of about
- dB power ratio of about
2. How bit depth controls SNR
Assume a uniform mid-tread quantizer:
- Bit depth: bits quantization levels.
- Range: [-X_\max, X_\max], symmetric.
- Step size: \Delta = \frac{2 X_\max}{L} = \frac{2 X_\max}{2^B}
Quantization error model
Textbook assumptions:
- The input is “busy enough” and not strongly correlated with quantization levels.
- Quantization error is modeled as uniform on .
Then:
- Mean:
- Variance (and thus noise power):
So:
Signal power for a full-scale sine
Take a sinusoid:
The average power is
If the quantizer is used at full scale:
- Set A = X_\max
- Then P_\text{signal} = \dfrac{X_\max^2}{2}
Putting signal and noise together
We already have
\Delta = \frac{2 X_\max}{2^B} \Rightarrow \Delta^2 = \frac{4 X_\max^2}{2^{2B}}So
P_\text{noise} = \frac{\Delta^2}{12} = \frac{4 X_\max^2}{12 \cdot 2^{2B}} = \frac{X_\max^2}{3 \cdot 2^{2B}}Thus
\text{SNR} = \frac{P_\text{signal}}{P_\text{noise}} = \frac{X_\max^2 / 2}{X_\max^2 / (3 \cdot 2^{2B})} = \frac{1/2}{1/(3 \cdot 2^{2B})} = \frac{3 \cdot 2^{2B}}{2}Now convert to dB:
But
and
So
Key conclusions:
- Each extra bit increases SNR by about dB (for a full-scale sine in an ideal uniform quantizer).
- Example:
- bits: dB
- bits: dB
The uniform error model and full-scale sine are idealizations, but this formula is a very common rule of thumb.
3. How sample rate affects the representation
Bit depth and sample rate affect different aspects of the signal representation:
-
Bit depth
- Controls how finely amplitude is quantized.
- Determines quantization step size .
- Directly affects SNR / dynamic range via the dB relation.
-
Sample rate
- Controls how often we sample in time.
- Determines the maximum representable (non-aliased) frequency: roughly the Nyquist frequency .
- Controls time resolution (samples per second).
- Does not directly change the per-sample quantization noise power (for fixed ).
Formally:
- Sampling period:
- If the analog input is band-limited to f_\max and f_s > 2 f_\max, the signal is ideally reconstructible (Nyquist–Shannon sampling theorem).
- Increasing increases the bandwidth of the discrete-time representation (up to ).
In practice:
- Unlike speech, music requires the use of the full frequency spectrum. That means sampling the signal at a higher rate, i.e., the standard sampling rates of music recordings are 44.1kHz or 48 kHz vs. 16 kHz for speech.
- This also means longer sequences.
Noise spectral density intuition
With a fixed bit depth and quantization step :
-
Total quantization noise power over the full Nyquist band is still
-
If you increase , the same total noise power is spread over a wider frequency range
-
Therefore, the noise power per Hz (noise spectral density) decreases.
-
If you later low-pass to a fixed audio band (e.g. – kHz), oversampling plus filtering can improve in-band SNR.
This is the idea behind oversampling converters.
Summary: bit depth vs sample rate
-
Bit depth :
- Controls dynamic range and quantization SNR.
- More bits smaller lower higher SNR.
-
Sample rate :
- Controls bandwidth and time resolution.
- Higher can represent higher frequencies (up to ).
- For fixed and , it does not change the basic dB relationship, but it changes how that noise is distributed over frequency.