T

Test2

Developed by mccaly
FoodSeg103 is a dataset containing 7,118 food images, annotated with 104 ingredient categories, with an average of 6 ingredient labels and pixel-level masks per image.
Downloads 22
Release Time : 7/14/2023

Model Overview

This model is used for semantic segmentation of food images, capable of identifying and segmenting multiple ingredients in an image.

Model Features

Large-scale Food Image Dataset
Contains 7,118 images annotated with 104 ingredient categories, with an average of 6 ingredient labels and pixel-level masks per image.
Multimodal Pre-training Method
Proposes the ReLeM multimodal pre-training method, explicitly equipping the segmentation model with rich and semantic food knowledge.
Multiple Baseline Models
Provides multiple baseline models based on dilated convolution, feature pyramid, and vision transformers.

Model Capabilities

Food Image Segmentation
Ingredient Recognition
Pixel-level Mask Generation

Use Cases

Food Industry
Food Ingredient Analysis
Used to analyze ingredients in food images, aiding in nutritional calculation and dietary management.
Accurately identifies and segments multiple ingredients.
Smart Dining
Used for food recognition and ingredient analysis in smart dining systems.
Enhances the automation and intelligence level of dining systems.
Health Management
Diet Recording
Helps users record ingredients and nutritional content in their diet.
Provides accurate ingredient recognition and segmentation results.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase