dmx.compressor.advanced_recipe
Classes
Approximation function extra_params tuning hyperparameters with default values |
|
|
Approximation function extra_params tuning recipe |
|
This is an abstract class of ADVANCED mode recipe. |
|
GPTQ recipe |
|
DmxModule GPTQ hyperparameters with default values |
DmxModule boundary cast quantizers calibration hyperparameters with default values |
|
DmxModule SmoothQuant hyperparameters with default values |
|
Fake quantizer (i.e. CastTo) calibration hyperparameters with default values. |
|
|
Fake quantizer calibration recipe |
|
SLaNC hyperparamters with default values |
|
SLaNC norm tuning for LayerNorm|RMSNorm recipe Paper: https://arxiv.org/abs/2410.10553 |
|
SmoothQuant recipe |
- class dmx.compressor.advanced_recipe.DmxApproximationFunctionTuningHyperparams(search_space: Space | None = None)
Bases:
objectApproximation function extra_params tuning hyperparameters with default values
- search_space: Space | None = None
- class dmx.compressor.advanced_recipe.DmxApproximationFunctionTuningRecipe(hp_gen, **kwargs)
Bases:
DmxBaseRecipeApproximation function extra_params tuning recipe
- class dmx.compressor.advanced_recipe.DmxBaseRecipe(hp_gen: Callable, **kwargs)
Bases:
ABCThis is an abstract class of ADVANCED mode recipe.
- applied_to(_model, save_checkpoint_to: str | None = None)
- class dmx.compressor.advanced_recipe.DmxGPTQRecipe(hp_gen, **kwargs)
Bases:
DmxBaseRecipeGPTQ recipe
- class dmx.compressor.advanced_recipe.DmxModuleGPTQHyperparams(microblock_size: int = 1, block_size: int = 128, percdamp: float = 0.01)
Bases:
objectDmxModule GPTQ hyperparameters with default values
- block_size: int = 128
- microblock_size: int = 1
- percdamp: float = 0.01
- class dmx.compressor.advanced_recipe.DmxModuleQuantizerCalibrationHyperparams(inputs: List[DmxQuantizerCalibrationHyperparams] | None = None, outputs: List[DmxQuantizerCalibrationHyperparams] | None = None, weight: DmxQuantizerCalibrationHyperparams | None = None, weight_storage: DmxQuantizerCalibrationHyperparams | None = None)
Bases:
objectDmxModule boundary cast quantizers calibration hyperparameters with default values
- inputs: List[DmxQuantizerCalibrationHyperparams] | None = None
- outputs: List[DmxQuantizerCalibrationHyperparams] | None = None
- weight: DmxQuantizerCalibrationHyperparams | None = None
- weight_storage: DmxQuantizerCalibrationHyperparams | None = None
- class dmx.compressor.advanced_recipe.DmxModuleSmoothQuantHyperparams(migration_strength: float = 0.5, fuse_to_weight: bool = False)
Bases:
objectDmxModule SmoothQuant hyperparameters with default values
- fuse_to_weight: bool = False
- migration_strength: float = 0.5
- class dmx.compressor.advanced_recipe.DmxQuantizerCalibrationHyperparams(observer_cls: ~torch.ao.quantization.observer.ObserverBase = <class 'dmx.compressor.numerical.observer.HistogramObserver'>, qscheme_to_overload: ~torch.qscheme = torch.per_tensor_symmetric, group_size: int | None = None, ch_axis: int | None = None)
Bases:
objectFake quantizer (i.e. CastTo) calibration hyperparameters with default values
- ch_axis: int | None = None
- group_size: int | None = None
- observer_cls
alias of
HistogramObserver
- qscheme_to_overload: qscheme = torch.per_tensor_symmetric
- class dmx.compressor.advanced_recipe.DmxQuantizerCalibrationRecipe(hp_gen, **kwargs)
Bases:
DmxBaseRecipeFake quantizer calibration recipe
- class dmx.compressor.advanced_recipe.DmxSLaNCHyperparams(position: str | None = None, mlp_type: str | None = None, device: device | None = None, prev_ln_weight: Module | None = None, fc1: Module | None = None, fc2: Module | None = None, up_proj: Module | None = None, down_proj: Module | None = None, gate_proj: Module | None = None, v_proj: Module | None = None, o_proj: Module | None = None)
Bases:
objectSLaNC hyperparamters with default values
- device: device | None = None
- down_proj: Module | None = None
- fc1: Module | None = None
- fc2: Module | None = None
- gate_proj: Module | None = None
- mlp_type: str | None = None
- o_proj: Module | None = None
- position: str | None = None
- prev_ln_weight: Module | None = None
- up_proj: Module | None = None
- v_proj: Module | None = None
- class dmx.compressor.advanced_recipe.DmxSLaNCRecipe(hp_gen, **kwargs)
Bases:
DmxBaseRecipeSLaNC norm tuning for LayerNorm|RMSNorm recipe Paper: https://arxiv.org/abs/2410.10553
- class dmx.compressor.advanced_recipe.DmxSmoothQuantRecipe(hp_gen, **kwargs)
Bases:
DmxBaseRecipeSmoothQuant recipe