Remix.run Logo
djhn 7 days ago

But is there actually an API for that? Last I checked the big providers Video Intelligence APIs even distinguishing cats and dogs was still unreliable.

junon 6 days ago | parent | next [-]

Just to see if a bird is in the picture (like the comic states) using chatgpt et al can probably do a sufficient job.

Not condoning people make this app, just thinking about how fast things have moved in just a few short years.

JustExAWS 5 days ago | parent [-]

For a POC, I’ve done animal recognition in a picture with Anthropic and the various Amazon Nova models. It’s around 10 lines of code.

SAI_Peregrinus 6 days ago | parent | prev | next [-]

BirdNET from the Cornell lab of ornithology provides that api.

filoleg 6 days ago | parent [-]

Unless I am missing something massive, BirdNET[0] is for identifying birds by sound, not by images.

Merlin[1] (also from Cornell Lab of Ornithology), on the other hand, has both image and sound ID. I haven't used either, so I cannot compare the quality of results from Merlin vs. BirdNET for sound ID, but afaik only Merlin has image ID.

0. https://birdnet.cornell.edu/

1. https://merlin.allaboutbirds.org/

reaperducer 6 days ago | parent | prev | next [-]

https://merlin.allaboutbirds.org/

MaxikCZ 7 days ago | parent | prev [-]

These days you dont need an api, you can run the stack on tamagochi