Bert-base-cased-finetuned-cola Open-source Text Classification Model - Free Deployment to Judge Grammatical Correctness

Bert Base Cased Finetuned Cola

Developed by gchhablani

A text classification model fine-tuned on the GLUE COLA dataset based on bert-base-cased, used for grammatical correctness judgment

Text Classification

Transformers

EnglishOpen Source License:Apache-2.0 #Grammatical Acceptability Detection #BERT Fine-tuning #High Matthews Coefficient

Downloads 14

Release Time : 3/2/2022

Model Overview

This model is a fine-tuned version of bert-base-cased specifically for the COLA (Corpus of Linguistic Acceptability) task, designed to determine whether sentences conform to English grammar rules

Model Features

High-precision Grammar Judgment

Achieves a Matthews correlation coefficient of 0.5957 on the COLA dataset, demonstrating excellent performance

Based on BERT Architecture

Utilizes BERT's powerful language understanding capabilities through fine-tuning

Comparative Research Purpose

Specifically designed for performance comparison studies with FNet models

Model Capabilities

English grammatical correctness judgment

Text classification

Linguistic acceptability evaluation

Use Cases

Educational Technology

Grammar Checking Tool

Used to develop English learning assistance tools that automatically detect grammatical errors in student writing

Accurately identifies sentences that do not conform to English grammar

Natural Language Processing Research

Model Performance Comparison

Serves as a benchmark model for performance comparisons with other architectures (e.g., FNet)

Provides reliable BERT benchmark performance data

🚀 bert-base-cased-finetuned-cola

This model is a fine - tuned variant of [bert - base - cased](https://huggingface.co/bert - base - cased) on the GLUE COLA dataset. It offers a comparison between [google/fnet - base](https://huggingface.co/google/fnet - base) and [bert - base - cased](https://huggingface.co/bert - base - cased), aiming to evaluate their performance on text classification tasks. The model achieves notable results on the evaluation set, such as a Matthews Correlation of 0.5957.

🚀 Quick Start

This model is trained using the [run_glue](https://github.com/huggingface/transformers/blob/master/examples/pytorch/text - classification/run_glue.py) script. The following command was used for training:

#!/usr/bin/bash

python ../run_glue.py \
  --model_name_or_path bert-base-cased \
  --task_name cola \
  --do_train \
  --do_eval \
  --max_seq_length 512 \
  --per_device_train_batch_size 16 \
  --learning_rate 2e-5 \
  --num_train_epochs 3 \
  --output_dir bert-base-cased-finetuned-cola \
  --push_to_hub \
  --hub_strategy all_checkpoints \
  --logging_strategy epoch \
  --save_strategy epoch \
  --evaluation_strategy epoch

✨ Features

This model is a fine - tuned version of [bert - base - cased](https://huggingface.co/bert - base - cased) on the GLUE COLA dataset.
It achieves specific results on the evaluation set, including a Loss of 0.6747 and a Matthews Correlation of 0.5957.
The model was fine - tuned to compare [google/fnet - base](https://huggingface.co/google/fnet - base) against [bert - base - cased](https://huggingface.co/bert - base - cased).

📦 Installation

No specific installation steps other than the training script are provided in the original document.

💻 Usage Examples

No code examples are provided in the original document.

📚 Documentation

Model Information

Property	Details
Model Name	bert - base - cased - finetuned - cola
Base Model	[bert - base - cased](https://huggingface.co/bert - base - cased)
Fine - tuned Dataset	GLUE COLA
Comparison Model	[google/fnet - base](https://huggingface.co/google/fnet - base)
Paper for Comparison	this paper

Evaluation Results

This model achieves the following results on the evaluation set:

Loss: 0.6747
Matthews Correlation: 0.5957

Training Procedure

Training Command

#!/usr/bin/bash

python ../run_glue.py \
  --model_name_or_path bert-base-cased \
  --task_name cola \
  --do_train \
  --do_eval \
  --max_seq_length 512 \
  --per_device_train_batch_size 16 \
  --learning_rate 2e-5 \
  --num_train_epochs 3 \
  --output_dir bert-base-cased-finetuned-cola \
  --push_to_hub \
  --hub_strategy all_checkpoints \
  --logging_strategy epoch \
  --save_strategy epoch \
  --evaluation_strategy epoch

Training Hyperparameters

The following hyperparameters were used during training:

learning_rate: 2e - 05
train_batch_size: 16
eval_batch_size: 8
seed: 42
optimizer: Adam with betas=(0.9, 0.999) and epsilon = 1e - 08
lr_scheduler_type: linear
num_epochs: 3.0

Training Results

Training Loss	Epoch	Step	Validation Loss	Matthews Correlation
0.4921	1.0	535	0.5283	0.5068
0.2837	2.0	1070	0.5133	0.5521
0.1775	3.0	1605	0.6747	0.5957

Framework Versions

Transformers 4.11.0.dev0
Pytorch 1.9.0
Datasets 1.12.1
Tokenizers 0.10.3

🔧 Technical Details

The model is fine - tuned on the GLUE COLA dataset to compare the performance of [google/fnet - base](https://huggingface.co/google/fnet - base) and [bert - base - cased](https://huggingface.co/bert - base - cased). It uses the [run_glue](https://github.com/huggingface/transformers/blob/master/examples/pytorch/text - classification/run_glue.py) script for training with specific hyperparameters and achieves certain evaluation results.

📄 License

This model is licensed under the Apache - 2.0 license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご