Vinilm-2021-from-large Open-source Pretrained Language Model - Fast Inference without Compromising Performance

Vinilm 2021 From Large

Developed by VMware

A compact pre-trained language model distilled by VMware using MiniLMv2 technology from the vBERT-2021-large model, improving inference speed while maintaining performance

Large Language Model

Transformers

EnglishOpen Source License:Apache-2.0 #BERT distilled model #Enterprise NLP #Efficient inference

Downloads 23

Release Time : 5/23/2022

Model Overview

This is VMware's customized pre-trained language model, compressed from a large model through knowledge distillation, suitable for enterprise-level NLP tasks

Model Features

Efficient distillation

Uses MiniLMv2 distillation technology to significantly improve inference speed without noticeable performance loss

Domain optimization

Specifically trained on VMware technical documents and blog content, suitable for enterprise technical scenarios

Lightweight deployment

Smaller in size compared to the original large model, more suitable for production environment deployment

Model Capabilities

Text feature extraction

Information retrieval

Text classification

Use Cases

Enterprise document processing

Technical document retrieval

Used for intelligent search and retrieval of internal technical documents

Content classification

Automatic classification of technical blogs and documents

🚀 viniLM-2021-from-large

A VMware-specific pretrained language model distilled from vBERT-2021-large for faster inference.

🚀 Quick Start

The viniLM-2021-from-large model is a VMware-specific Language Model. It is distilled from vBERT-2021-large to achieve faster inference times without significant performance loss.

✨ Features

Distilled Model: Based on MiniLMv2 distillation, it distills vBERT-2021-large into a smaller minilmv2 model.
Fast Inference: Enables quicker inference while maintaining good performance.
VMware-specific: Designed for VMware-related NLP tasks.

📦 Installation

To use this model, you need to have the transformers library installed. You can install it using pip:

pip install transformers

💻 Usage Examples

Basic Usage

Here is how to use this model to get the features of a given text in PyTorch:

from transformers import BertTokenizer, BertModel
tokenizer = BertTokenizer.from_pretrained('VMware/vinilm-2021-from-large')
model = BertModel.from_pretrained("VMware/vinilm-2021-from-large")
text = "Replace me by any text you'd like."
encoded_input = tokenizer(text, return_tensors='pt')
output = model(**encoded_input)

And in TensorFlow:

from transformers import BertTokenizer, TFBertModel
tokenizer = BertTokenizer.from_pretrained('VMware/vinilm-2021-from-large')
model = TFBertModel.from_pretrained('VMware/vinilm-2021-from-large')
text = "Replace me by any text you'd like."
encoded_input = tokenizer(text, return_tensors='tf')
output = model(encoded_input)

📚 Documentation

Model Info

Property	Details
Authors	R&D AI Lab, VMware Inc.
Model Date	Jun 2022
Model Version	2021-distilled-from-large
Model Type	Pretrained language model
License	Apache 2.0

Motivation

Based on MiniLMv2 distillation, we have distilled vBERT-2021-large into a smaller minilmv2 model for faster inference times without a significant loss of performance.

Intended Use

The model functions as a VMware-specific Language Model.

Training

Distilled From: vBERT-2021-large
Initial Weights: nreimers/MiniLMv2-L6-H768-distilled-from-BERT-Large

Datasets

Publically available VMware text data such as VMware Docs, Blogs, etc. were used for distilling the teacher vBERT-2021-large model into vinilm-2021-from-large model. Sourced in May 2021. (~320,000 Documents)

Preprocessing

Decoding HTML
Decoding Unicode
Stripping repeated characters
Splitting compound word
Spelling correction

Model performance measures

We benchmarked vinilm on various VMware-specific NLP downstream tasks (IR, classification, etc).

🔧 Technical Details

Since the model is distilled from a vBERT model based on the BERT model, it may have the same biases embedded within the original BERT model. The data needs to be preprocessed using our internal vNLP Preprocessor (not available to the public) to maximize its performance.

📄 License

This model is licensed under the Apache 2.0 license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご