Contemporary Model Compression on Large Language Models Inference | Synapse