Phi 3 Small 8k Instruct Onnx Cuda
MIT
Phi-3 Small is a 7B-parameter lightweight cutting-edge open-source model, optimized for NVIDIA GPUs in ONNX format, supporting 8K context length with strong inference capabilities.
Large Language Model
Transformers