B

Bert Base Uncased Squadv1 X2.44 F87.7 D26 Hybrid Filled V1

Developed by madlag
A QA model fine-tuned on SQuAD v1 based on BERT-base uncased and pruned via nn_pruning library, retaining 42% of original weights with 2.44x inference speed improvement
Downloads 17
Release Time : 3/2/2022

Model Overview

This is a BERT model optimized for QA tasks, reducing parameter scale and improving inference efficiency through structured pruning techniques, suitable for English QA scenarios

Model Features

Efficient Inference
Achieves 2.44x faster inference speed than original model through structured pruning
Parameter Optimization
Retains 42% of original weights (only 26% in linear layers), reducing model file size from 420MB to 355MB
Attention Head Pruning
80 out of 144 attention heads removed (55.6%) to optimize computational efficiency

Model Capabilities

English QA
Context Understanding
Text Extraction

Use Cases

Customer Support
Product Knowledge QA
Automatically answer customer inquiries based on product documentation
F1 score 87.71
Education Assistance
Learning Material QA
Extract answers from textbook content
EM score 80.03
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase