Access Global AI Models - Power Next-Gen Apps

From General to Specialized AI - All Models in One Platform

Hot

Latest

High Likes

Filter

Commercial Models

Open Source Models

Classification

Framework

Open Source License

Language

98 models match the criteria

Hot

Latest

High Likes

Tiny Random LlamaForCausalLM

This model is based on the transformers library, with no specific details provided.

Large Language Model

trl-internal-testing

Depth Anything V2 Small Hf

Depth Anything V2 is currently the most powerful monocular depth estimation model, trained on 595,000 synthetically annotated images and over 62 million real unlabeled images, featuring fine details and robustness.

Depth Anything V2 Large

Depth Anything V2 is currently the most powerful monocular depth estimation model, trained on a large amount of synthetic and real images, providing fine depth details and high robustness.

3D Vision English

Tiny Dummy Qwen2

This model is released under the MIT License, specific details are currently unknown.

Large Language Model

Depth Anything V2 Large Hf

Depth Anything V2 is currently the most powerful Monocular Depth Estimation (MDE) model, trained on 595,000 synthetically annotated images and over 62 million real unlabeled images, offering finer details and stronger robustness.

Depth Anything V2 Small

Depth Anything V2 is currently the most powerful monocular depth estimation model, trained on large-scale synthetic and real images. Compared to V1, it captures finer details and is more robust.

3D Vision English

DepthCrafter is a model capable of generating temporally coherent long depth sequences for open-world videos with fine details, without requiring additional information such as camera poses or optical flow.

Depth Anything V2 Base Hf

Depth Anything V2 is currently the most powerful monocular depth estimation model, trained on 595,000 synthetically annotated images and over 62 million real unlabeled images, offering finer details and stronger robustness.

Yolos Fashionpedia

This is a fine-tuned object detection model for the fashion domain, based on the YOLOS architecture, capable of recognizing and localizing fashion items and their details.

Object Detection

Transformers English

Style Enhancer Xl Lora

A high-resolution LoRA adapter specifically designed for Animagine XL 2.0 to enhance the quality and details of anime-style images.

Image Generation English

Ttplanet SDXL Controlnet Tile Realistic

A ControlNet Tile model based on the SDXL framework for enhancing or altering original image details, compatible with WebUI and ComfyUI

Image Generation

Internvideo2 5 Chat 8B

InternVideo2.5 is a video multimodal large language model enhanced by Long and Rich Context (LRC) modeling, built upon InternVL2.5. It significantly improves existing MLLM models by enhancing the ability to perceive fine-grained details and capture long-term temporal structures.

Transformers English

Ijepa Vith14 1k

I-JEPA is a self-supervised learning method that predicts representations of other parts of an image from partial representations, without relying on manual data transformations or filling in pixel-level details.

Image Classification

This model is released under the MIT license, specific details are currently unknown.

Large Language Model

Fantasy Wizard Witches

A LoRA model customized based on the FLUX.1-dev model, specifically designed for generating fantasy-style wizard and witch images, significantly enhancing image details.

Image Generation

This model is released under the MIT License, with specific details currently unknown.

Large Language Model

Canopus LoRA Flux UltraRealism 2.0

A hyper-realistic model utilizing LoRA fine-tuning technology, capable of generating high-quality images with realistic textures, lighting, and intricate details.

Image Generation

Internvl 2 5 HiCo R16

InternVideo2.5 is a video multimodal large language model (MLLM) built upon InternVL2.5, enhanced with Long and Rich Context (LRC) modeling, capable of perceiving fine-grained details and capturing long-term temporal structures.

Transformers English

A high-quality stable diffusion model capable of generating realistic images from text descriptions, especially excelling in portrait rendering and complex details.

Image Generation

Phantasma Anime

A model capable of generating lively anime-style illustrations with special effect details, especially suitable for fantasy themes.

Image Generation

Bde Cner Batteryonlybert Uncased Base

This model is released under the MIT license, with specific details currently unknown.

Large Language Model

Pointllm 7B V1.2

This model is released under the Creative Commons Attribution-NonCommercial 4.0 International License. For specific details, please refer to the model page.

Large Language Model

Floral High Dynamic Range

A cutting-edge large-scale image generation model, excelling in producing images with astonishing clarity, precision, and intricate details, particularly suited for high-resolution imagery and realistic scene generation.

Image Generation Supports Multiple Languages

future-technologies

Plushy World Flux

This model presents fantasy characters with exaggerated proportions and rounded lines, featuring rich 3D rendering details that blend cute cartoon designs with realistic lighting and textures.

Image Generation

A detail adjustment tool specifically designed for SDXL, enhancing or reducing image generation details through LoRA model

Image Generation

highscoregames12018

Anything3.0 is an overfitted model, excelling in generating character images and specific details, capable of producing high-quality images even with suboptimal prompts.

Image Generation English

Flux Realism FineDetailed

A super-realistic image generation model based on LoRA fine-tuning technology, focusing on producing high-quality images with lifelike textures and intricate details.

Image Generation

Sanskrit5 Multitask

The model card information is incomplete and cannot provide specific details

Large Language Model

Pony Realism V23 Sdxl

A Stable Diffusion model focused on generating highly realistic pony images, with special emphasis on facial details, skin textures, and lighting effects.

Image Generation English

Skycaptioner V1

SkyCaptioner-V1 is a model specifically designed for generating high-quality structured descriptions of video data. By integrating specialized sub-expert models, multimodal large language models, and manual annotations, it addresses the limitations of general description models in capturing professional film details.

Bert Large Maths

Open-source model under Apache-2.0 license (specific details unavailable)

Large Language Model

add-detail-xl is a detail adjustment model for SDXL. It can increase or decrease image details by adjusting the weight, bringing more flexibility to image generation.

Image Generation

Detail Tweaker XL V1

A detail adjustment tool specifically designed for SDXL, enhancing or simplifying image details through LoRA technology

Image Generation

Internvl 2 5 HiCo R64

A video multimodal large language model enhanced by Long and Rich Context (LRC) modeling, improving existing MLLMs by enhancing the perception of fine-grained details and capturing long-term temporal structures

Transformers English

Bde Abbrev Batteryonlybert Cased Base

This model is released under the MIT license, specific details are currently unavailable.

Large Language Model

Illustrious SemiRealistic V10

A Stable Diffusion XL-based text-to-image generation model focused on producing high-quality, semi-realistic female portraits with photorealistic details and rich visual expression.

Image Generation English

ParahumanSkitter

This model is released under the Apache-2.0 license, with specific details currently unknown.

Large Language Model

Anime Nouveau Xl Lora

A LoRA adapter designed for Animagine XL 2.0, adding rich Art Nouveau-style details and decorative elements to anime images

Image Generation English

Slowfast Video Mllm Qwen2 7b Convnext 576 Frame64 S1t4

A video multimodal large language model using a slow-fast architecture, balancing temporal resolution and spatial details, supporting 64-frame video understanding

Bert Base Uncased Ganesh123

A fine-tuned version based on the BERT base model, specific use case and training data details are currently unknown

Large Language Model

This model is released under the MIT License; further details need to be supplemented.

Large Language Model

PRM is a novel large-scale reconstruction model based on photometric stereo vision, capable of reconstructing high-quality mesh models with fine local details.

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase