FastSAM (Fast Segment Anything Model) - Ultralytics YOLOv8 Docs #3417
Replies: 6 comments 15 replies
-
How can I obtain the information of the masks after image inference? |
Beta Was this translation helpful? Give feedback.
-
working on decode from the output |
Beta Was this translation helpful? Give feedback.
-
Unable to use device=[0, 1, 2, 3] for using GPU resources. |
Beta Was this translation helpful? Give feedback.
-
Hello, I am a beginner in machine learning models. After converting the FastSAM.pt model to CoreML format, I used it in the Xcode project. The input parameter is an image of [1 * 3 * 1024 * 1024], and the output parameter yields two multidimensional arrays of [1 * 37 * 21504] and [1 * 3 * 256 * 256]. |
Beta Was this translation helpful? Give feedback.
-
I am using FastSAM (model file: FastSAM-x.pt) for object segmentation in a project where the input images vary in size. FastSAM attempts to adjust image sizes to be multiples of its maximum stride (32) but occasionally fails, leading to errors. Here's the typical warning and adjustment process: WARNING The model automatically updates the image size and usually continues without issues. However, when the original image size is close to a multiple of 32 but not exactly a multiple, FastSAM throws a runtime error after processing hundreds of images successfully. For example: Image Shape before SAM: (374, 144, 3) Interestingly, manually resizing the image to the suggested dimensions (e.g., 384 in this case) before passing it to the model does not solve the issue, and the error persists. |
Beta Was this translation helpful? Give feedback.
-
Given that fast-sam is fast a user may want to run it during training for finetuning on a subtask.
Thank you, |
Beta Was this translation helpful? Give feedback.
-
FastSAM (Fast Segment Anything Model) - Ultralytics YOLOv8 Docs
Explore the Fast Segment Anything Model (FastSAM), a real-time solution for the segment anything task that leverages a Convolutional Neural Network (CNN) for segmenting any object within an image, guided by user interaction prompts.
https://docs.ultralytics.com/models/fast-sam/
Beta Was this translation helpful? Give feedback.
All reactions