AFFINEQUANT: AFFINE TRANSFORMATION QUANTI ZATION FOR LARGE LANGUAGE MODELS阅读
文章目录 Abstract1 INTRODUCTION2 RELATED WORK3 METHODOLOGY3.1 AFFINEQUANT3.2 REVERSIBILITYANDGRADUALMASK3.3 EFFICIENCY 4 EXPERIMENTS4.1 实现细节4.2 评...
文章目录 Abstract1 INTRODUCTION2 RELATED WORK3 METHODOLOGY3.1 AFFINEQUANT3.2 REVERSIBILITYANDGRADUALMASK3.3 EFFICIENCY 4 EXPERIMENTS4.1 实现细节4.2 评...