Bert Base Uncased Squad1.1 Block Sparse 0.07 V1
This is a BERT-base uncased model fine-tuned on the SQuAD1.1 dataset, featuring a block sparse structure that retains only 28.2% of the original weights, with an evaluation speed 1.92 times faster than the dense network.
Downloads 209
Release Time : 3/2/2022
Model Overview
This model is primarily designed for question-answering tasks, capable of answering relevant questions based on given contexts. It employs dynamic pruning techniques for optimization, improving inference speed while maintaining high accuracy.
Model Features
Block Sparse Structure
Linear layers retain only 7.5% of the original weights, with an overall retention of 28.2%, significantly reducing model size and improving inference speed.
Efficient Inference
Evaluation speed is 1.92 times faster than dense networks while maintaining relatively high accuracy.
Attention Head Optimization
106 out of 144 attention heads (73.6%) were removed, optimizing the model structure.
Model Capabilities
Text-based Question Answering
Context Understanding
Information Extraction
Use Cases
Intelligent Question Answering System
Fact-based Question Answering
Answer specific factual questions based on provided context
EM:71.88, F1:81.36
Educational Applications
Learning Assistance Q&A
Help students quickly find answers to questions from textbook content
Featured Recommended AI Models