Remix.run Logo
Show HN: Detect any object in satellite imagery using a text prompt(useful-ai-tools.com)
17 points by eyasu6464 5 days ago | 7 comments

I built a browser-based tool that uses Vision-Language Models (VLMs) to detect objects in satellite imagery via natural language prompts. Draw a polygon on the map, type what you want to find (e.g., "swimming pools," "oil tanks," "solar panels"), and the system scans tile-by-tile, projecting bounding boxes back onto the globe as GeoJSON.

The pipeline: pick zoom level + prompt → slice map into mercantile tiles → feed each tile + prompt to VLM → create bounding boxes → project to WGS84 coordinates → render on map.

No login required for the demo. Works well for distinct structures zero-shot; struggles with dense/occluded objects where narrow YOLO models still win.

canadiantim 7 minutes ago | parent | next [-]

Find me large outcroppings of gold, or gold particles in tree canopies please

password4321 3 hours ago | parent | prev | next [-]

Dupe yesterday with ~50 points and ~20 comments:

https://news.ycombinator.com/item?id=47305979

gnabgib 2 hours ago | parent | next [-]

That's actually tomorrow (check the submission IDs & hover over the submission time on this). This is an accidental SCP fail.

pinkmuffinere 2 hours ago | parent | prev [-]

They've submitted basically the same thing 4 times: https://news.ycombinator.com/submitted?id=eyasu6464

I assume this is a marketing strategy. It feels a bit dishonest.

jayknight 3 hours ago | parent | prev | next [-]

Neat. I was recently wondering if there was a way to find houses in my area that had roof-top solar, just to get an idea of how common it was.

jjwiseman 3 hours ago | parent | prev | next [-]

Since this post is a dupe, here's a video demonstrating a similar but different app I made: https://www.youtube.com/watch?v=EjH0kMEz4YY

fallingmeat 4 hours ago | parent | prev [-]

did you just make Danti?