dmx.compressor.numerical.observer
Classes
|
Taken from torch.ao.quantization.observer.UniformQuantizationObserverBase |
|
This is a dummy observer that does not do anything |
|
Adapted from torch.ao.quantization.observer.HistogramObserver, still does not support per-channel |
|
Adapted from torch.ao.quantization.observer.MinMaxObserver, supports per-channel |
|
Extending HistogramObserver to allow calculation of percentile from histogram, taken from https://github.com/NVIDIA/TensorRT/blob/master/tools/pytorch-quantization/pytorch_quantization/calib/histogram.py#L287 |
- class dmx.compressor.numerical.observer.DMXObserverBase(dtype: Format, qscheme: qscheme = torch.per_tensor_affine, factory_kwargs: dict | None = None, eps: float = 1.1920928955078125e-07, **kwargs)
Bases:
ObserverBaseTaken from torch.ao.quantization.observer.UniformQuantizationObserverBase
- eps: Tensor
- extra_repr()
Return the extra representation of the module.
To print customized extra information, you should re-implement this method in your own modules. Both single-line and multi-line strings are acceptable.
- class dmx.compressor.numerical.observer.DummyObserver(dtype: Format, ch_axis: int = -1, **kwargs)
Bases:
DMXObserverBaseThis is a dummy observer that does not do anything
- calculate_qparams()
- forward(x)
Define the computation performed at every call.
Should be overridden by all subclasses.
Note
Although the recipe for forward pass needs to be defined within this function, one should call the
Moduleinstance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.
- class dmx.compressor.numerical.observer.HistogramObserver(bins: int = 2048, upsample_rate: int = 128, dtype: Format = XP[8, 0](CSN), qscheme: qscheme | None = torch.per_tensor_affine, ch_axis: int = -1, factory_kwargs: dict | None = None, eps=1.1920928955078125e-07, **kwargs)
Bases:
DMXObserverBaseAdapted from torch.ao.quantization.observer.HistogramObserver, still does not support per-channel
- calculate_qparams()
- extra_repr()
Return the extra representation of the module.
To print customized extra information, you should re-implement this method in your own modules. Both single-line and multi-line strings are acceptable.
- forward(x_orig: Tensor) Tensor
Define the computation performed at every call.
Should be overridden by all subclasses.
Note
Although the recipe for forward pass needs to be defined within this function, one should call the
Moduleinstance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.
- histogram: Tensor
- max_val: Tensor
- min_val: Tensor
- class dmx.compressor.numerical.observer.MinMaxObserver(dtype: Format = XP[8, 0](CSN), qscheme: qscheme | None = torch.per_tensor_affine, ch_axis: int = -1, factory_kwargs: Dict | None = None, eps: float | None = 1.1920928955078125e-07, **kwargs)
Bases:
DMXObserverBaseAdapted from torch.ao.quantization.observer.MinMaxObserver, supports per-channel
- calculate_qparams()
Calculates the quantization parameters.
- extra_repr()
Return the extra representation of the module.
To print customized extra information, you should re-implement this method in your own modules. Both single-line and multi-line strings are acceptable.
- forward(x_orig)
Define the computation performed at every call.
Should be overridden by all subclasses.
Note
Although the recipe for forward pass needs to be defined within this function, one should call the
Moduleinstance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.
- max_val: Tensor
- min_val: Tensor
- reset_min_max_vals()
Resets the min/max values.
- class dmx.compressor.numerical.observer.PercentileObserver(percentile: float = 99.99, **kwargs)
Bases:
HistogramObserverExtending HistogramObserver to allow calculation of percentile from histogram, taken from https://github.com/NVIDIA/TensorRT/blob/master/tools/pytorch-quantization/pytorch_quantization/calib/histogram.py#L287
- calculate_qparams()
- extra_repr()
Return the extra representation of the module.
To print customized extra information, you should re-implement this method in your own modules. Both single-line and multi-line strings are acceptable.