A

All With Prefix T5 Base V1

Developed by doc2query
T5-based doc2query model for document expansion and training data generation
Downloads 574
Release Time : 3/2/2022

Model Overview

This model, based on the T5 architecture, can generate relevant queries for documents to enhance search engine effectiveness or create training data.

Model Features

Multi-prefix Support
Supports multiple prefix inputs, enabling generation of different types of output text based on different prefixes
Document Expansion
Can generate 20-40 relevant queries for a document to help bridge the vocabulary gap in lexical search
Training Data Generation
Can be used to generate (query, text) pairs for unlabeled text to train embedding models

Model Capabilities

Text Generation
Query Generation
Document Expansion
Training Data Generation

Use Cases

Information Retrieval
Search Engine Enhancement
Index generated queries alongside original documents to improve BM25 search engine performance
Demonstrated significant improvement in search effectiveness in BEIR evaluations
Machine Learning
Embedding Model Training
Generate (query, text) pairs for training dense embedding models
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase