# Pure integer inference
Ibert Roberta Large
I-BERT is a pure integer-quantized version of RoBERTa-large, using INT8 to store parameters and integer operations for inference, achieving up to 4x inference acceleration.
Large Language Model
Transformers

I
kssteven
45
0
Ibert Roberta Base
I-BERT is a pure integer quantized version of RoBERTa, storing parameters in INT8 format and using integer operations for inference, significantly improving inference speed.
Large Language Model
Transformers

I
kssteven
2,988
1
Featured Recommended AI Models