Bert Base Uncased Squad1.1 Block Sparse 0.13 V1
This is a question answering system model based on BERT-base-uncased fine-tuned on the SQuAD1.1 dataset, featuring a block sparse structure that retains 32.1% of the original model's weights.
Downloads 25
Release Time : 3/2/2022
Model Overview
The model is primarily used for question answering tasks, capable of answering relevant questions based on provided context. It is case-insensitive and employs dynamic pruning technology to improve evaluation speed.
Model Features
Block Sparse Structure
Linear layers retain only 12.5% of original weights, with overall 32.1% weight retention, achieving 1.65x faster evaluation speed compared to dense networks.
Dynamic Pruning
Utilizes Victor Sanh's improved version of dynamic pruning to optimize model performance.
Attention Head Removal
97 out of 144 attention heads (67.4%) were removed to further optimize the model structure.
Model Capabilities
Question Answering System
Text Understanding
Contextual Answering
Use Cases
Education
Historical Knowledge QA
Answer questions based on historical texts, e.g., 'Where is the Eiffel Tower located?'
Can accurately answer questions within the provided context.
Information Retrieval
Document QA
Extract information from documents and answer related questions.
Can provide accurate answers based on document content.
Featured Recommended AI Models