This is the official repository for AMQ. AMQ is an automated mixed-precision quantization library for Large Language Models (LLMs). It uses multi-objective optimization to find the optimal balance ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results