đ li-14b-v0.4-slerp0.1
This is a merged pre - trained language model, created by combining multiple base models using the mergekit tool, aiming to provide better performance in text - generation tasks.
đ Quick Start
This README provides detailed information about the merged model li-14b-v0.4-slerp0.1
, including its base models, merge details, evaluation results, and information about the company behind it.
⨠Features
- Model Merging: Utilizes the mergekit tool to merge pre - trained language models.
- Diverse Evaluation: Evaluated on multiple datasets such as IFEval, BBH, MATH, GPQA, MuSR, and MMLU - PRO, providing a comprehensive view of its performance.
- Company Backing: Supported by Century Innovation, a leading "Industrial Internet" printing enterprise with a focus on technological innovation.
đĻ Installation
No installation steps are provided in the original document, so this section is skipped.
đģ Usage Examples
No code examples are provided in the original document, so this section is skipped.
đ Documentation
Merge Details
Merge Method
This model was merged using the SLERP merge method.
Models Merged
The following models were included in the merge:
Configuration
The following YAML configuration was used to produce this model:
base_model: wanlige/li-14b-v0.4
merge_method: slerp
tokenizer_source: base
dtype: float32
out_dtype: bfloat16
parameters:
t:
- filter: self_attn
value: [ 0.00, 0.50, 0.30, 0.70, 1.00 ]
- filter: mlp
value: [ 1.00, 0.50, 0.70, 0.30, 0.00 ]
- value: [ 0.00, 0.00, 0.00, 0.00, 0.04, 0.08, 0.12, 0.16, 0.24, 0.32, 0.40, 0.48, 0.56, 0.64, 0.72, 0.72, 0.72, 0.72, 0.72, 0.72, 0.72, 0.72, 0.64, 0.56, 0.48 ]
slices:
- sources:
- model: wanlige/li-14b-v0.4
layer_range: [ 0, 48 ]
- model: sthenno-com/miscii-14b-0218
layer_range: [ 0, 48 ]
Detailed results can be found here
Metric |
Value |
Avg. |
42.91 |
IFEval (0 - Shot) |
79.23 |
BBH (3 - Shot) |
50.88 |
MATH Lvl 5 (4 - Shot) |
53.32 |
GPQA (0 - shot) |
14.54 |
MuSR (0 - shot) |
11.75 |
MMLU - PRO (5 - shot) |
47.71 |
Company Information
Established on March 9, 2001, and headquartered in Jinan, Shandong Province, Century Innovation has grown over the past two decades by focusing on technological innovation. The company has achieved deep integration of the Internet with the traditional printing industry, pioneering new models and business formats distinct from conventional printing practices.
Century Innovation specializes in the research, design, production, and sales of customized imaging, commercial printing, and packaging products. By combining the Internet, digitalization, automation, and intelligent technologies with the printing industry, the company enables relatively standardized and scalable production for small - batch personalized custom orders. This approach aims to meet the needs of individual consumers and various enterprise users for small - batch customization, providing users with one - stop, scenario - based custom printing services and achieving full - process intelligent manufacturing. As a result, Century Innovation has become a leading "Industrial Internet" printing enterprise.
In the future, Century Innovation will continue to increase investment in technology R & D, deeply integrate the Internet, big data, artificial intelligence, and other next - generation information technologies, and focus on cultivating specialized technical talent. The company will actively adopt digital and intelligent means to optimize innovative business processes and enhance user experience. Through multi - dimensional development, it aims to drive industry collaboration, promote the transformation of old and new drivers in the printing industry, and explore new directions for its growth.
To learn more, visit our official website: Century Innovation
đ§ Technical Details
No specific technical implementation details (more than 50 words) are provided in the original document, so this section is skipped.
đ License
No license information is provided in the original document, so this section is skipped.