Customer-Reviews-Classification Open Source Model - Free Deployment for Precise Classification of Customer Feedback Texts

Customer Reviews Classification

Developed by dnzblgn

A DistilBERT fine-tuned model for customer feedback classification, accurately categorizing text into six predefined categories

Text Classification

Transformers

EnglishOpen Source License:Apache-2.0 #Customer Feedback Classification #High Accuracy #DistilBERT Fine-tuning

Downloads 28

Release Time : 10/4/2024

Model Overview

This model is specifically designed for document classification tasks, capable of classifying customer feedback into logistics & delivery, customer service, price & value, quality & performance, usage & design, and other categories.

Model Features

Efficient Classification

Utilizes the DistilBERT architecture to efficiently parse text syntactic patterns for accurate classification

Multi-category Support

Supports classification of customer feedback into six major categories, covering common feedback types

High Accuracy

Achieves 94.7% accuracy on the evaluation dataset

Model Capabilities

Text Classification

Customer Feedback Analysis

Sentiment Orientation Recognition

Use Cases

Customer Service

Automatic Feedback Classification

Automatically classifies customer feedback into predefined categories for subsequent processing

Improves customer service efficiency and reduces manual classification time

Product Improvement Analysis

Analyzes customer evaluations across various aspects through classification results

Helps product teams identify areas for improvement

🚀 Customer Reviews Classification Model

This fine-tuned DistilBERT model is designed for document classification, specifically classifying customer feedback into six predefined categories. It efficiently handles text syntactic patterns, providing accurate classification based on content, style, and structure.

🚀 Quick Start

Here is an example of how to use this model for inference:

from transformers import pipeline

classifier = pipeline("text-classification", model="dnzblgn/Customer-Reviews-Classification")
result = classifier("The product arrived on time and was exactly as described.")
print(result)

✨ Features

Classifies customer feedback into six categories: Shipping and Delivery, Customer Service, Price and Value, Quality and Performance, Use and Design, and Other.
Leverages the transformer-based architecture of DistilBERT to handle text syntactic patterns.

📦 Installation

No specific installation steps are provided in the original document.

💻 Usage Examples

Basic Usage

from transformers import pipeline

classifier = pipeline("text-classification", model="dnzblgn/Customer-Reviews-Classification")
result = classifier("The product arrived on time and was exactly as described.")
print(result)

📚 Documentation

Model Description

This fine-tuned DistilBERT model is specifically designed for document classification. It classifies customer feedback into six predefined categories: Shipping and Delivery, Customer Service, Price and Value, Quality and Performance, Use and Design, and Other. By leveraging the transformer-based architecture of DistilBERT, the model efficiently handles the syntactic patterns of text, providing accurate document classification based on content, style, and structure.

Model type: DistilBERT (fine-tuned for text classification)
Language(s) (NLP): English
License: Apache 2.0
Finetuned from model: distilbert/distilbert-base-uncased

Bias, Risks, and Limitations

While the model achieves high accuracy across the six categories, it has limitations when dealing with overlapping categories or multiple labels within a single document. The model is designed for single-label classification, meaning it can only detect one label per document. If a document contains features of multiple categories (e.g., both 'Quality and Performance' and 'Price and Value'), the model may struggle to correctly identify both and will predict only one category, potentially leading to misclassification.

Recommendations

💡 Usage Tip

Users (both direct and downstream) should be aware of the model's single-label prediction limitation. In cases where a document contains features of multiple categories, additional models or multi-label classification techniques should be considered.

Training Data

A custom synthetic dataset was created for this task, focusing on the structural features of text. The dataset provides examples from six categories, helping the model learn from both the syntactic organization and the meaning of the text.

Training Hyperparameters

Property	Details
Model	distilbert/distilbert-base-uncased
Learning Rate	3e-5
Epochs	7
Train Batch Size	16
Gradient Accumulation Steps	2
Weight Decay	0.015
Warm-up Ratio	0.1

Evaluation

The model was evaluated using a custom dataset representing the same six document categories. Performance was measured based on accuracy, precision, recall, and F1 - score across the categories.

Metrics

Property	Details
Accuracy	0.947
Precision	0.948
Recall	0.948
F1-Score	0.948

For access to the synthetic dataset used, please contact: [deniz.bilgin@uni-konstanz.de].

📄 License

This model is licensed under the Apache 2.0 license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご