GECKO-7B Open-Source Model - Supports Korean, English, and Code Processing. Enjoy for Free!

GECKO 7B

Developed by kifai

GECKO is a 7-billion-parameter decoder-only Transformer model trained on Korean, English, and code, released under the Apache 2.0 license.

Large Language Model

Transformers

Supports Multiple LanguagesOpen Source License:Apache-2.0 #Korean-English Bilingual Generation #Code Understanding #Long Text Processing

Downloads 43

Release Time : 5/27/2024

Model Overview

GECKO is a generative language model trained on Korean, English, and code, utilizing 200 billion tokens and a terabyte-scale Korean corpus, suitable for multilingual text generation tasks.

Model Features

Multilingual Support

Supports text generation tasks in Korean and English.

Code Understanding

Capable of understanding and generating code snippets.

Long Context Processing

Supports context lengths of up to 8k tokens.

Model Capabilities

Korean Text Generation

English Text Generation

Code Explanation

Multilingual Translation

Use Cases

Code Assistance

Code Explanation

Explain the functionality of HTML code and provide instructions.

Accurately explains code functionality and provides expected output.

Multilingual Applications

Korean-English Translation

Translate text between Korean and English.

🚀 GECKO: Generative Language Model for English, Code and Korean

GECKO is a powerful generative language model designed for English, code, and Korean. It offers high - quality text generation capabilities, trained on a large - scale corpus.

🚀 Quick Start

GECKO-7B is a 7B parameter decoder - only transformer. It's pretrained on Korean, English, and code, using 200 billion tokens and terabytes of Korean corpus. It's an open - source model released under the Apache 2.0 License. For more details, refer to our technical report.

✨ Features

Llama Architecture: GECKO uses the Llama architecture, making it easily integrated with other frameworks that support Llama.

📦 Installation

~14GB RAM is the required minimum memory size with half - precision like float16 or bfloat16.

💻 Usage Examples

Basic Usage

import torch
from transformers import AutoTokenizer, AutoModelForCausalLM

model_id = 'kifai/GECKO-7B'
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype=torch.bfloat16, device_map="auto")

text = """이 HTML 코드가 어떤 기능을 하는지 설명하고, 그 설명을 영어로 제공해주세요.
\```html
<button onclick="alert('Welcome!')">Click Me</button>
\```
"""
inputs = tokenizer(text, return_tensors='pt')['input_ids'].to('cuda')
output = model.generate(inputs, max_new_tokens=512, repetition_penalty=1.2)
print(tokenizer.decode(output[0], skip_special_tokens=True))
# 이 HTML 코드가 어떤 기능을 하는지 설명하고, 그 설명을 영어로 제공해주세요.
# \```html
# <button onclick="alert('Welcome!')">Click Me</button>
# \```
# 
# ## Description
# 
# This is a button that will display the message "Welcome!" when clicked.
# 
# ## Expected Output
# 
# The expected output should be:
# 
# \```text
# Welcome!
# \```

📚 Documentation

Model Details

Property	Details
Model Type	GECKO
Training Data	A mix of publicly available online data
Params	7B
Content Length	8k
GQA	X
Tokens	200B
LR	3.0 x 10-4

🔧 Technical Details

GECKO is a generative language model using the Llama architecture. This architecture allows the model to be easily integrated with other frameworks that support Llama.

⚠️ Limitation

GECKO is a generative language model with some risks. Its testing has mainly been conducted in Korean and has not covered all possible scenarios. As with all large language models, the outputs from GECKO cannot be predicted in advance and might sometimes be inaccurate, biased, or otherwise problematic. Therefore, developers should conduct safety testing and fine - tune the model for the intended uses before deploying it.

📄 License

GECKO is released under the Apache 2.0 license.

📖 Citation

@misc{oh2024gecko,
      title={GECKO: Generative Language Model for English, Code and Korean}, 
      author={Sungwoo Oh and Donggyu Kim},
      year={2024},
      eprint={2405.15640},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

🙏 Acknowledgement

The training is supported by the TPU Research Cloud program.

📞 Contact

We look forward to hearing from you and collaborating with you.

Sungwoo Oh [LinkedIn]
Donggyu Kim

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご