kvcdn quant
Quantize a.kv artifact to a lower-precision dtype.
Usage
Options
| Option | Required | Description |
|---|---|---|
--input | yes | Path to the source .kv artifact |
--output | yes | Path to write the quantized artifact |
--dtype | yes | Target data type, e.g. F16, BF16, I8 |