M

Msmarco T5 Small V1

Developed by doc2query
T5-based doc2query model for document expansion and training data generation
Downloads 23
Release Time : 3/2/2022

Model Overview

This model is based on the T5 architecture and can generate relevant queries for input text, primarily used for document expansion and domain-specific training data generation.

Model Features

Document Expansion
Can generate 20-40 relevant queries for a paragraph to help bridge the vocabulary gap in lexical search
Training Data Generation
Can be used to generate (query, text) pairs for training powerful dense embedding models
Based on T5 Architecture
Fine-tuned using google/t5-v1_1-small model, with efficient text generation capabilities

Model Capabilities

Text Generation
Query Generation
Document Expansion
Training Data Generation

Use Cases

Information Retrieval
Search Engine Optimization
Generate relevant queries for documents and index them to improve the effectiveness of traditional BM25 search engines
Performs well on the BEIR benchmark
Machine Learning
Embedding Model Training
Generate (query, text) pairs as training data for training dense embedding models
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase