[논문 리뷰] Extremely Low Bit Transformer Quantization for On-Device Neural Machine Translation 5 minute read