### DeepSeek Coder 1.3B TypeScript Open-Source Code Generation Model - Supports Long Texts and Intermediate Code Filling

Deepseek Coder 1.3b Typescript

Developed by CodeGPTPlus

TypeScript-specific code generation model fine-tuned from deepseek-coder-1.3b-base, supporting 16K context and Fill-in-the-Middle (FIM) functionality

Large Language Model

Transformers

Open Source License:Other #TypeScript code generation #16K long context support #Project-level completion (FIM)

Downloads 72

Release Time : 1/12/2024

Model Overview

AI model optimized for TypeScript code generation, excelling in producing precise and efficient TypeScript solutions

Model Features

TypeScript specialization

Fine-tuned with 50 billion tokens of TypeScript data, delivering professional-grade code generation quality

Large context support

Supports 16K token context window, suitable for processing large code files

Fill-in-the-Middle (FIM)

Supports project-level code completion, intelligently filling missing parts in code snippets

Model Capabilities

TypeScript code generation

Code auto-completion

Function-level code completion

Project-level code understanding

Code snippet filling (FIM)

Use Cases

Development tools

IDE intelligent completion

Integrated into development environments to provide real-time TypeScript code suggestions

Improves developer efficiency by 30%+

Code migration assistance

Assists in migrating JavaScript projects to TypeScript

Automatically generates type declarations and interfaces

Education

Programming teaching assistant

Generates TypeScript teaching examples and exercises

Provides immediately runnable code examples

🚀 deepseek-coder-1.3b-typescript

CodeGPTPlus/deepseek-coder-1.3b-typescript is a fine - tuned model based on deepseek-ai/deepseek-coder-1.3b-base, crafted by the CodeGPT team. It specializes in generating precise and efficient TypeScript code. With a 16K window size and a fill - in - the - middle task, it offers project - level code completion.

🚀 Quick Start

This model is designed for completion tasks. Here are some usage examples:

💻 Usage Examples

Basic Usage

from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("CodeGPTPlus/deepseek-coder-1.3b-typescript",
                                          trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("CodeGPTPlus/deepseek-coder-1.3b-typescript",
                                              trust_remote_code=True).cuda()

input_text = """<｜fim▁begin｜>function quickSort(arr: number[]): number[] {
  if (arr.length <= 1) {
    return arr;
  }
  const pivot = arr[0];
  const left = [];
  const right = [];
<｜fim▁hole｜>
  return [...quickSort(left), pivot, ...quickSort(right)];
}<｜fim▁end｜>"""

inputs = tokenizer(input_text, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_length=256)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Advanced Usage

Running with Ollama

Model: https://ollama.ai/codegpt/deepseek-coder-1.3b-typescript

ollama run codegpt/deepseek-coder-1.3b-typescript

Running with Ollama and CodeGPT Autocomplete in VSCode

Documentation: https://docs.codegpt.co/docs/tutorial-features/code_autocompletion

Select "Ollama - codegpt/deepseek-coder-1.3b-typescript" in the autocomplete model selector. Then, write any code or comment in the vscode text editor, and the model will provide you with code suggestions through the CodeGPT code autocomplete.

CodeGPT: DeepSeek Coder - Typescript

✨ Features

Specialized for TypeScript: Fine - tuned on a TypeScript dataset, it can generate precise and efficient solutions in this programming language.
Project - level Completion: Utilizes a 16K window size and a fill - in - the - middle task to offer project - level code completion.
Good Performance: Achieves a loss of 0.7681 on the evaluation set.

📚 Documentation

Model Details

Property	Details
Model Developers	CodeGPT Team
Variations	1.3B
Input	The model only accepts text input.
Output	The model only generates text output.

Evaluation Results

It achieves the following results on the evaluation set:

Loss: 0.7681

Training Details

Training hyperparameters

learning_rate: 2e - 05
train_batch_size: 20
eval_batch_size: 20
seed: 42
gradient_accumulation_steps: 2
total_train_batch_size: 40
optimizer: Adam with betas=(0.9,0.999) and epsilon = 1e - 06
lr_scheduler_type: cosine
lr_scheduler_warmup_steps: 261
num_epochs: 1

Training results

Training Loss	Epoch	Step	Validation Loss
1.0745	0.0	1	0.8681
1.2267	0.05	1308	0.8130
1.1594	0.1	2616	0.8018
0.7674	0.15	3924	0.7942
0.6443	0.2	5232	0.7889
0.9155	0.25	6540	0.7847
0.7501	0.3	7848	0.7819
0.8835	0.35	9156	0.7792
0.7261	0.4	10464	0.7769
0.9746	0.45	11772	0.7748
0.6884	0.5	13080	0.7734
0.6104	0.55	14388	0.7722
0.8876	0.6	15696	0.7710
0.9567	0.65	17004	0.7703
0.6915	0.7	18312	0.7696
0.8874	0.75	19620	0.7691
0.6124	0.8	20928	0.7686
0.8147	0.85	22236	0.7684
0.8021	0.9	23544	0.7683
0.8665	0.95	24852	0.7681

Framework versions

Transformers 4.37.0.dev0
Pytorch 2.0.1+cu118
Datasets 2.16.1
Tokenizers 0.15.0

📄 License

The model is released under the other license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご