| ▲ | clueless 2 hours ago | |
With a avg latency of 4 seconds, this still couldn't be used in real-time video, correct? [Update: should have mentioned I got the 4 second from the roboflow.com links in this thread] | ||
| ▲ | Etheryte 2 hours ago | parent | next [-] | |
Didn't see where you got those numbers, but surely that's just a problem of throwing more compute at it? From the blog post: > This excellent performance comes with fast inference — SAM 3 runs in 30 milliseconds for a single image with more than 100 detected objects on an H200 GPU. | ||
| ▲ | 2 hours ago | parent | prev [-] | |
| [deleted] | ||