Access Global AI Models - Power Next-Gen Apps
From General to Specialized AI - All Models in One Platform
Hot
Latest
High Likes
Filter

98 models match the criteria

Tiny Random LlamaForCausalLM
This model is based on the transformers library, with no specific details provided.
Large Language Model Transformers
T
trl-internal-testing
773.88k
7
Depth Anything V2 Small Hf
Apache-2.0
Depth Anything V2 is currently the most powerful monocular depth estimation model, trained on 595,000 synthetically annotated images and over 62 million real unlabeled images, featuring fine details and robustness.
3D Vision Transformers
D
depth-anything
438.72k
15
Depth Anything V2 Large
Depth Anything V2 is currently the most powerful monocular depth estimation model, trained on a large amount of synthetic and real images, providing fine depth details and high robustness.
3D Vision English
D
depth-anything
130.54k
94
Tiny Dummy Qwen2
MIT
This model is released under the MIT License, specific details are currently unknown.
Large Language Model Transformers
T
fxmarty
90.78k
1
Depth Anything V2 Large Hf
Depth Anything V2 is currently the most powerful Monocular Depth Estimation (MDE) model, trained on 595,000 synthetically annotated images and over 62 million real unlabeled images, offering finer details and stronger robustness.
3D Vision Transformers
D
depth-anything
83.99k
19
Depth Anything V2 Small
Apache-2.0
Depth Anything V2 is currently the most powerful monocular depth estimation model, trained on large-scale synthetic and real images. Compared to V1, it captures finer details and is more robust.
3D Vision English
D
depth-anything
55.22k
64
Depthcrafter
Other
DepthCrafter is a model capable of generating temporally coherent long depth sequences for open-world videos with fine details, without requiring additional information such as camera poses or optical flow.
3D Vision
D
tencent
55.08k
91
Depth Anything V2 Base Hf
Depth Anything V2 is currently the most powerful monocular depth estimation model, trained on 595,000 synthetically annotated images and over 62 million real unlabeled images, offering finer details and stronger robustness.
3D Vision Transformers
D
depth-anything
47.73k
1
Yolos Fashionpedia
This is a fine-tuned object detection model for the fashion domain, based on the YOLOS architecture, capable of recognizing and localizing fashion items and their details.
Object Detection Transformers English
Y
valentinafeve
44.05k
125
Style Enhancer Xl Lora
A high-resolution LoRA adapter specifically designed for Animagine XL 2.0 to enhance the quality and details of anime-style images.
Image Generation English
S
Linaqruf
19.40k
23
Ttplanet SDXL Controlnet Tile Realistic
Openrail
A ControlNet Tile model based on the SDXL framework for enhancing or altering original image details, compatible with WebUI and ComfyUI
Image Generation
T
TTPlanet
18.39k
237
Internvideo2 5 Chat 8B
Apache-2.0
InternVideo2.5 is a video multimodal large language model enhanced by Long and Rich Context (LRC) modeling, built upon InternVL2.5. It significantly improves existing MLLM models by enhancing the ability to perceive fine-grained details and capture long-term temporal structures.
Video-to-Text Transformers English
I
OpenGVLab
8,265
60
Ijepa Vith14 1k
I-JEPA is a self-supervised learning method that predicts representations of other parts of an image from partial representations, without relying on manual data transformations or filling in pixel-level details.
Image Classification Transformers
I
facebook
8,239
10
Flamingo 2024
MIT
This model is released under the MIT license, specific details are currently unknown.
Large Language Model Transformers
F
babylm
6,526
1
Fantasy Wizard Witches
Other
A LoRA model customized based on the FLUX.1-dev model, specifically designed for generating fantasy-style wizard and witch images, significantly enhancing image details.
Image Generation
F
Keltezaa
6,121
3
Cue Detr
MIT
This model is released under the MIT License, with specific details currently unknown.
Large Language Model Transformers
C
disco-eth
6,095
0
Canopus LoRA Flux UltraRealism 2.0
Openrail
A hyper-realistic model utilizing LoRA fine-tuning technology, capable of generating high-quality images with realistic textures, lighting, and intricate details.
Image Generation
C
prithivMLmods
5,118
103
Internvl 2 5 HiCo R16
Apache-2.0
InternVideo2.5 is a video multimodal large language model (MLLM) built upon InternVL2.5, enhanced with Long and Rich Context (LRC) modeling, capable of perceiving fine-grained details and capturing long-term temporal structures.
Video-to-Text Transformers English
I
OpenGVLab
1,914
3
Dreamshaper 8
Other
A high-quality stable diffusion model capable of generating realistic images from text descriptions, especially excelling in portrait rendering and complex details.
Image Generation
D
digiplay
1,233
16
Phantasma Anime
Openrail
A model capable of generating lively anime-style illustrations with special effect details, especially suitable for fantasy themes.
Image Generation
P
alvdansen
1,178
76
Bde Cner Batteryonlybert Uncased Base
MIT
This model is released under the MIT license, with specific details currently unknown.
Large Language Model Transformers
B
batterydata
1,128
2
Pointllm 7B V1.2
This model is released under the Creative Commons Attribution-NonCommercial 4.0 International License. For specific details, please refer to the model page.
Large Language Model Transformers
P
RunsenXu
920
3
Floral High Dynamic Range
Apache-2.0
A cutting-edge large-scale image generation model, excelling in producing images with astonishing clarity, precision, and intricate details, particularly suited for high-resolution imagery and realistic scene generation.
Image Generation Supports Multiple Languages
F
future-technologies
851
8
Plushy World Flux
Openrail
This model presents fantasy characters with exaggerated proportions and rounded lines, featuring rich 3D rendering details that blend cute cartoon designs with realistic lighting and textures.
Image Generation
P
alvdansen
708
35
Add Detail Xl
A detail adjustment tool specifically designed for SDXL, enhancing or reducing image generation details through LoRA model
Image Generation
A
highscoregames12018
654
1
Acertainthing
Openrail
Anything3.0 is an overfitted model, excelling in generating character images and specific details, capable of producing high-quality images even with suboptimal prompts.
Image Generation English
A
JosephusCheung
529
188
Flux Realism FineDetailed
Openrail
A super-realistic image generation model based on LoRA fine-tuning technology, focusing on producing high-quality images with lifelike textures and intricate details.
Image Generation
F
prithivMLmods
511
21
Sanskrit5 Multitask
The model card information is incomplete and cannot provide specific details
Large Language Model Transformers
S
chronbmm
433
2
Pony Realism V23 Sdxl
Other
A Stable Diffusion model focused on generating highly realistic pony images, with special emphasis on facial details, skin textures, and lighting effects.
Image Generation English
P
John6666
432
1
Skycaptioner V1
Apache-2.0
SkyCaptioner-V1 is a model specifically designed for generating high-quality structured descriptions of video data. By integrating specialized sub-expert models, multimodal large language models, and manual annotations, it addresses the limitations of general description models in capturing professional film details.
Video-to-Text Transformers
S
Skywork
362
29
Bert Large Maths
Apache-2.0
Open-source model under Apache-2.0 license (specific details unavailable)
Large Language Model Transformers
B
reyvan
330
1
Add Detail Xl
add-detail-xl is a detail adjustment model for SDXL. It can increase or decrease image details by adjusting the weight, bringing more flexibility to image generation.
Image Generation
A
LyliaEngine
327
4
Detail Tweaker XL V1
Openrail
A detail adjustment tool specifically designed for SDXL, enhancing or simplifying image details through LoRA technology
Image Generation
D
AiWise
261
2
Internvl 2 5 HiCo R64
Apache-2.0
A video multimodal large language model enhanced by Long and Rich Context (LRC) modeling, improving existing MLLMs by enhancing the perception of fine-grained details and capturing long-term temporal structures
Video-to-Text Transformers English
I
OpenGVLab
252
2
Bde Abbrev Batteryonlybert Cased Base
MIT
This model is released under the MIT license, specific details are currently unavailable.
Large Language Model Transformers
B
batterydata
244
0
Illustrious SemiRealistic V10
Other
A Stable Diffusion XL-based text-to-image generation model focused on producing high-quality, semi-realistic female portraits with photorealistic details and rich visual expression.
Image Generation English
I
ParahumanSkitter
235
1
Emotion LLaMA
Apache-2.0
This model is released under the Apache-2.0 license, with specific details currently unknown.
Large Language Model Transformers
E
ZebangCheng
213
4
Anime Nouveau Xl Lora
A LoRA adapter designed for Animagine XL 2.0, adding rich Art Nouveau-style details and decorative elements to anime images
Image Generation English
A
Linaqruf
200
11
Slowfast Video Mllm Qwen2 7b Convnext 576 Frame64 S1t4
A video multimodal large language model using a slow-fast architecture, balancing temporal resolution and spatial details, supporting 64-frame video understanding
Video-to-Text Transformers
S
shi-labs
184
0
Bert Base Uncased Ganesh123
A fine-tuned version based on the BERT base model, specific use case and training data details are currently unknown
Large Language Model Transformers
B
stevems1
173
0
Latex Ocr
MIT
This model is released under the MIT License; further details need to be supplemented.
Large Language Model Transformers
L
yhshin
170
11
PRM
Apache-2.0
PRM is a novel large-scale reconstruction model based on photometric stereo vision, capable of reconstructing high-quality mesh models with fine local details.
3D Vision
P
LTT
151
0
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase