Lyrics-BERT Open-Source Sentence Transformer Model - Trained on English lyrics to generate sentence embedding vectors for free

Home

Lyrics Bert

Developed by brunokreiner

A sentence transformer model trained on 480,000 English lyrics for generating 300-dimensional sentence embeddings

Text Embedding

Transformers

#Lyrics Semantic Analysis #English Text Embedding #Contrastive Learning Training

Downloads 568

Release Time : 1/18/2023

Model Overview

This is a BERT-based sentence embedding model, specifically optimized for lyrics text, suitable for semantic search and text clustering tasks. The model was trained using contrastive learning on a large dataset of English lyrics.

Model Features

Lyrics Domain Optimization

Specially trained for the characteristics of lyrics text, delivering superior performance in music-related text processing

Efficient Embedding

Converts sentences into 300-dimensional dense vectors, suitable for downstream tasks

Contrastive Learning Training

Trained using multiple negative ranking loss to enhance semantic representation capabilities

Model Capabilities

Sentence Embedding Generation

Semantic Similarity Calculation

Text Clustering

Semantic Search

Use Cases

Music Recommendation System

Lyrics-Based Similar Song Recommendation

Recommends songs with similar styles by calculating semantic similarity of lyrics

Music Information Retrieval

Semantic-Based Lyrics Search

Enables deep semantic lyrics search beyond keyword matching

🚀 {lyrics-bert}

This is a sentence-transformers model that maps sentences and paragraphs to a 300-dimensional dense vector space. It can be utilized for tasks such as clustering or semantic search.

✨ Features

Vector Mapping: Maps sentences and paragraphs to a 300-dimensional dense vector space.
Training Data: Trained on approximately 480,000 lyrics classified as English by the Python langdetect package.
Training Method: Trained from scratch using contrastive learning and multiple negatives ranking loss.
Base Model: Based on the bert-base-uncased model.

📦 Installation

Using this model becomes easy when you have sentence-transformers installed:

pip install -U sentence-transformers

💻 Usage Examples

Basic Usage (Sentence-Transformers)

from sentence_transformers import SentenceTransformer
sentences = ["This is an example sentence", "Each sentence is converted"]

model = SentenceTransformer('brunokreiner/lyrics-bert')
embeddings = model.encode(sentences)
print(embeddings)

Advanced Usage (HuggingFace Transformers)

Without sentence-transformers, you can use the model like this: First, you pass your input through the transformer model, then you have to apply the right pooling-operation on-top of the contextualized word embeddings.

from transformers import AutoTokenizer, AutoModel
import torch


#Mean Pooling - Take attention mask into account for correct averaging
def mean_pooling(model_output, attention_mask):
    token_embeddings = model_output[0] #First element of model_output contains all token embeddings
    input_mask_expanded = attention_mask.unsqueeze(-1).expand(token_embeddings.size()).float()
    return torch.sum(token_embeddings * input_mask_expanded, 1) / torch.clamp(input_mask_expanded.sum(1), min=1e-9)


# Sentences we want sentence embeddings for
sentences = ['This is an example sentence', 'Each sentence is converted']

# Load model from HuggingFace Hub
tokenizer = AutoTokenizer.from_pretrained('{MODEL_NAME}')
model = AutoModel.from_pretrained('{MODEL_NAME}')

# Tokenize sentences
encoded_input = tokenizer(sentences, padding=True, truncation=True, return_tensors='pt')

# Compute token embeddings
with torch.no_grad():
    model_output = model(**encoded_input)

# Perform pooling. In this case, mean pooling.
sentence_embeddings = mean_pooling(model_output, encoded_input['attention_mask'])

print("Sentence embeddings:")
print(sentence_embeddings)

📚 Documentation

Evaluation Results

For an automated evaluation of this model, see the Sentence Embeddings Benchmark: https://seb.sbert.net

Full Model Architecture

SentenceTransformer(
  (0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: BertModel 
  (1): Pooling({'word_embedding_dimension': 300, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False})
)

🔧 Technical Details

This model was trained on around 480,000 lyrics that were classified as English by the Python langdetect package. It was trained from scratch using contrastive learning and multiple negatives ranking loss. The base model was a bert-base-uncased model. It trained for 17 epochs after the loss function stagnated at $1.16 * 10^{-4}$.

📄 Citing & Authors

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご