AWQ: Activation-aware Weight Quantization for On-Device LLM Compression and Acceleration | Synapse