VeriGen Open-Source Model - Generate Verilog Hardware Description Language Code for Free, Tailored for Hardware Development

Fine Tuned Codegen 2B Verilog

Developed by shailja

VeriGen is a 2 billion parameter model fine-tuned from CodeGen-multi-2B, specifically designed for generating Verilog Hardware Description Language code.

Large Language Model

Transformers

OtherOpen Source License:Openrail #Verilog code generation #Hardware Description Language #RTL design

Downloads 511

Release Time : 9/18/2022

Model Overview

The model was trained on Verilog code from GitHub and textbooks, capable of generating Verilog code snippets, but does not guarantee functional correctness of the generated code.

Model Features

Verilog-specific

Fine-tuned specifically for Verilog Hardware Description Language, capable of generating Verilog code snippets.

Large context window

Supports a context length of 2048 tokens, suitable for processing longer code snippets.

Optimized based on CodeGen

Fine-tuned from Salesforce's CodeGen-multi-2B model, inheriting its excellent code generation capabilities.

Model Capabilities

Verilog code generation

Hardware design assistance

Code completion

Use Cases

Hardware design

Module generation

Generates complete Verilog modules based on partial module header prompts.

Can generate Verilog module code with basic functionality.

Teaching aid

Serves as a Verilog teaching assistant, demonstrating code examples.

Helps students understand Verilog syntax and structure.

🚀 VeriGen

The VeriGen model is a powerful tool for automated Verilog RTL code generation. It's fine - tuned from an existing model and trained on a specific Verilog dataset, offering great potential for hardware description language development.

🚀 Quick Start

The model is ready for inference. You can use it through the widget provided, with an example input of "module display_hello_word".

✨ Features

Fine - Tuned Model: A 2B parameter fine - tuned version of CodeGen - multi - 2B.
Trained on Specific Dataset: Trained on Verilog Dataset.
Long Context Length: Supports a context length of 2048.

📚 Documentation

🔍 Model Summary

The VeriGen model is a 2B parameter model, which is a fine - tuned version of CodeGen - multi - 2B. It is trained on the Verilog Dataset with a context length of 2048.

Repository: [shailja - thakur/VGen](https://github.com/shailja - thakur/VGen)
Baseline LLM: SalesForce/CodeGen
Paper: Benchmarking Large Language Models for Automated Verilog RTL Code Generation
Point of Contact: contact@shailja
Languages: Verilog (Hardware Description Language)

💻 Usage Examples

Basic Usage

The model was trained on Verilog from GitHub and textbooks. It's not an instruction model, but adding a partial line of module header like "module mux" to the prompt can make it a capable Verilog teaching assistant.

# pip install -q transformers
import torch
from transformers import AutoTokenizer, AutoModelForCausalLM
# Prompt
prompt = "//module half adder "
device='cuda'
# Load model and tokenizer
model_name = "shailja/CodeGen_2B_Verilog"
tokenizer = AutoTokenizer.from_pretrained("shailja/fine - tuned - codegen - 2B - Verilog")
model = AutoModelForCausalLM.from_pretrained("shailja/fine - tuned - codegen - 2B - Verilog").to(device)

# Sample
input_ids = tokenizer(prompt, return_tensors="pt").input_ids.to(device)
sample = model.generate(input_ids, max_length=128, temperature=0.5, top_p=0.9)

print(tokenizer.decode(sample[0], truncate_before_pattern=[r"endmodule"]) + "endmodule")

⚠️ Limitations

The model is trained on Verilog source code from open sources. The predominant natural language in the source code is English, though other languages are also present. The generated Verilog code is not guaranteed to work as intended. It may be inefficient, contain bugs or exploits. For an in - depth discussion of the model limitations, refer to [the paper](https://drive.google.com/file/d/1cN - b9GnWtHzQRoE7M7gAEyivY0kl4BYs/view).

🔧 Technical Details

Model

Architecture: GPT - 2 model with multi - query attention
Pretraining steps: 150k
Pretraining tokens: ~72B
Precision: fp16

Hardware

GPUs: 3 Tesla A100
Training time: 8 days

📄 License

The model is licensed under the BigCode OpenRAIL - M v1 license agreement. You can find the full agreement [here](https://huggingface.co/spaces/bigcode/bigcode - model - license - agreement).

📖 Citation

@misc{https://doi.org/10.48550/arxiv.2212.11140,
  doi = {10.48550/ARXIV.2212.11140},
  url = {https://arxiv.org/abs/2212.11140},
  author = {Thakur, Shailja and Ahmad, Baleegh and Fan, Zhenxing and Pearce, Hammond and Tan, Benjamin and Karri, Ramesh and Dolan - Gavitt, Brendan and Garg, Siddharth},
  title = {Benchmarking Large Language Models for Automated Verilog RTL Code Generation},
  publisher = {arXiv},
  year = {2022},
  copyright = {arXiv.org perpetual, non - exclusive license}
}

Property	Details
Pipeline Tag	text - generation
Inference	true
Model Type	Fine - tuned version of CodeGen - multi - 2B
Training Data	Verilog Dataset
Library Name	transformers
License	bigcode - openrail - m
Datasets	shailja/Verilog_GitHub

⚠️ Important Note

The pretraining dataset of the model was not filtered for permissive licenses only. The model can generate source code verbatim from the dataset, and the code's license might require attribution and/or other specific requirements that must be respected.

You need to read the BigCode [OpenRAIL - M license](https://huggingface.co/spaces/bigcode/bigcode - model - license - agreement) agreement before accepting it.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご