In this project, we aimed to enhance the computational efficiency and deployment feasibility of neural networks through mixed precision quantization. We implemented two quantization-aware training (QAT) methods. Our results demonstrated significant reductions in model bitwidth assignments while maintaining accuracy comparable to fullprecision models.
Building similarity graph...
Analyzing shared references across papers
Loading...
Omar Lahyani
QRU Quaderns de Recerca en Urbanisme
Building similarity graph...
Analyzing shared references across papers
Loading...
Omar Lahyani (Wed,) studied this question.