XLMRoBERTa-SQuADv2 Open-Source Question-Answering Model - Free Deployment for Precise Answers to Various Questions

Home

Xlmroberta Squadv2

Developed by aware-ai

This is an xlm-roberta-large model fine-tuned on the SQuADv2 dataset for question answering tasks

Question Answering System

Transformers

#Multilingual Q&A #Long-text comprehension #SQuADv2 fine-tuning

Downloads 15

Release Time : 3/2/2022

Model Overview

This model is based on the XLM-Roberta architecture and fine-tuned on the SQuADv2 dataset, specifically designed for multilingual question answering tasks.

Model Features

Multilingual support

Based on XLM-Roberta architecture with strong cross-lingual understanding capabilities

Optimized for Q&A tasks

Specifically fine-tuned on the SQuADv2 question answering dataset

Long-text processing

Supports sequence processing up to 512 tokens

Model Capabilities

Text comprehension

Question answering system

Cross-lingual processing

Use Cases

Intelligent Q&A

Fact-based Q&A

Answer specific questions based on given text

Example correctly answered questions about Jim Henson

Educational applications

Reading comprehension assistance

Helps students understand article content and answer questions

🚀 XLM-ROBERTA-LARGE finetuned on SQuADv2

This is an XLM-RoBERTa-large model fine-tuned on the SQuADv2 dataset for question-answering tasks, aiming to provide accurate answers to various questions.

🚀 Quick Start

This XLM-RoBERTa-large model is fine-tuned on the SQuADv2 dataset for question-answering tasks. You can use it to perform question-answering operations as shown in the "Model in Action" section.

✨ Features

Fine-tuned on the SQuADv2 dataset for question-answering tasks.
Based on the XLM-RoBERTa architecture, which has strong cross-lingual understanding capabilities.

📚 Documentation

🔍 Model details

XLM-Roberta was proposed in the paper **XLM-R: State-of-the-art cross-lingual understanding through self-supervision

🏋️‍♂️ Model training

This model was trained with the following parameters using the simpletransformers wrapper:

train_args = {
    'learning_rate': 1e-5,
    'max_seq_length': 512,
    'doc_stride': 512,
    'overwrite_output_dir': True,
    'reprocess_input_data': False,
    'train_batch_size': 8,
    'num_train_epochs': 2,
    'gradient_accumulation_steps': 2,
    'no_cache': True,
    'use_cached_eval_features': False,
    'save_model_every_epoch': False,
    'output_dir': "bart-squadv2",
    'eval_batch_size': 32,
    'fp16_opt_level': 'O2',
}

📊 Results

{"correct": 6961, "similar": 4359, "incorrect": 553, "eval_loss": -12.177856394381962}

💻 Usage Examples

📋 Basic Usage

from transformers import XLMRobertaTokenizer, XLMRobertaForQuestionAnswering
import torch

tokenizer = XLMRobertaTokenizer.from_pretrained('a-ware/xlmroberta-squadv2')
model = XLMRobertaForQuestionAnswering.from_pretrained('a-ware/xlmroberta-squadv2')

question, text = "Who was Jim Henson?", "Jim Henson was a nice puppet"
encoding = tokenizer(question, text, return_tensors='pt')
input_ids = encoding['input_ids']
attention_mask = encoding['attention_mask']

start_scores, end_scores = model(input_ids, attention_mask=attention_mask, output_attentions=False)[:2]

all_tokens = tokenizer.convert_ids_to_tokens(input_ids[0])
answer = ' '.join(all_tokens[torch.argmax(start_scores) : torch.argmax(end_scores)+1])
answer = tokenizer.convert_tokens_to_ids(answer.split())
answer = tokenizer.decode(answer)
#answer => 'a nice puppet'

Created with ❤️ by A-ware UG

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご