smt-grandstaff Open Source SMT Model - Free Deployment for High-precision Transcription of Piano Sheet Music Images

Home

Smt Grandstaff

Developed by antoniorv6

This SMT model was fine-tuned on the Camera GrandStaff piano sheet dataset for piano sheet image transcription tasks.

Image-to-Text

Safetensors

Open Source License:MIT #Piano sheet transcription #End-to-end optical recognition #ConvNext-Transformer architecture

Downloads 136

Release Time : 8/13/2024

Model Overview

The SMT model consists of a visual encoder (ConvNext) and a text decoder (classic Transformer), capable of converting sheet music system images into text representations.

Model Features

End-to-end sheet music recognition

Directly generates sheet music text representations from image input without intermediate processing steps

Piano sheet specialization

Specifically optimized for piano sheets in the Grandstaff dataset

Hybrid architecture

Combines the advantages of visual encoders and text decoders to achieve image-to-text conversion

Model Capabilities

Piano sheet image recognition

Sheet music text generation

Optical music recognition

Use Cases

Music education

Sheet music digitization

Convert paper piano sheets into digital format

Improves sheet music archiving and sharing efficiency

Music production

Automatic scoring

Convert handwritten sheet music into editable digital format

Simplifies music production workflow

Property	Details
Pipeline Tag	image-to-text
Datasets	antoniorv6/grandstaff
Tags	omr, camera_grandstaff
arXiv	2402.07596

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Smt Grandstaff

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Sheet Music Transformer (base model, fine-tuned on the Grandstaff dataset)

🚀 Quick Start

✨ Features

📚 Documentation

Intended uses & limitations

BibTeX entry and citation info

📄 License