Open-source text classification model bert-base-cased-finetuned-qqp - Comparing performance differences is super practical

Bert Base Cased Finetuned Qqp

Developed by gchhablani

A text classification model fine-tuned on GLUE QQP dataset based on bert-base-cased, used to compare performance differences between fnet-base and bert-base-cased

Text Classification

Transformers

EnglishOpen Source License:Apache-2.0 #Q&A Pair Classification #High Accuracy BERT #Text Similarity

Downloads 255

Release Time : 3/2/2022

Model Overview

This model is a BERT variant fine-tuned on the GLUE QQP (Quora Question Pairs) dataset, primarily used to determine whether two questions are semantically identical.

Model Features

High-performance Text Matching

Achieves 90.8% accuracy and 87.7% F1 score on QQP dataset

Based on BERT Architecture

Utilizes the proven BERT-base-cased architecture with strong semantic understanding capabilities

Comparative Research Purpose

Specifically designed for performance comparison studies with FNet models

Model Capabilities

Text Classification

Semantic Similarity Judgment

Question Pair Matching

Use Cases

Q&A Systems

Duplicate Question Detection

Identifying duplicate questions on Q&A platforms

90.8% accuracy

Information Retrieval

Query Expansion

Expanding search results by identifying semantically similar questions

🚀 bert-base-cased-finetuned-qqp

This model is a fine - tuned version of [bert - base - cased](https://huggingface.co/bert - base - cased) on the GLUE QQP dataset, aiming to compare with [google/fnet - base](https://huggingface.co/google/fnet - base) and achieve high performance in text classification.

🚀 Quick Start

This model is a fine - tuned version of [bert - base - cased](https://huggingface.co/bert - base - cased) on the GLUE QQP dataset. It achieves the following results on the evaluation set:

Loss: 0.3752
Accuracy: 0.9084
F1: 0.8768
Combined Score: 0.8926

The model was fine - tuned to compare [google/fnet - base](https://huggingface.co/google/fnet - base) as introduced in this paper against [bert - base - cased](https://huggingface.co/bert - base - cased).

📦 Installation

This model is trained using the [run_glue](https://github.com/huggingface/transformers/blob/master/examples/pytorch/text - classification/run_glue.py) script. The following command was used:

#!/usr/bin/bash

python ../run_glue.py \\n  --model_name_or_path bert - base - cased \\n  --task_name qqp \\n  --do_train \\n  --do_eval \\n  --max_seq_length 512 \\n  --per_device_train_batch_size 16 \\n  --learning_rate 2e - 5 \\n  --num_train_epochs 3 \\n  --output_dir bert - base - cased - finetuned - qqp \\n  --push_to_hub \\n  --hub_strategy all_checkpoints \\n  --logging_strategy epoch \\n  --save_strategy epoch \\n  --evaluation_strategy epoch \\n

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 2e - 05
train_batch_size: 16
eval_batch_size: 8
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon = 1e - 08
lr_scheduler_type: linear
num_epochs: 3.0

Training results

Training Loss	Epoch	Step	Validation Loss	Accuracy	F1	Combined Score
0.308	1.0	22741	0.2548	0.8925	0.8556	0.8740
0.201	2.0	45482	0.2881	0.9032	0.8698	0.8865
0.1416	3.0	68223	0.3752	0.9084	0.8768	0.8926

Framework versions

Transformers 4.11.0.dev0
Pytorch 1.9.0
Datasets 1.12.1
Tokenizers 0.10.3

📚 Documentation

Model Information

Property	Details
Model Type	Fine - tuned version of [bert - base - cased](https://huggingface.co/bert - base - cased) on the GLUE QQP dataset
Training Data	GLUE QQP dataset
Metrics	Accuracy: 0.9084, F1: 0.8768, Combined Score: 0.8926

📄 License

This model is licensed under the Apache - 2.0 license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご