There's an important cautionary tale this week. AIs sometimes get it wrong... especially with image search...
A building in downtown Denver. P/C Dan Russell |
If you do a regular old Google Image Search, you get the right answer: This is the El Jebel Shrine, aka the Sherman Event Center in Denver, Colorado. That’s great, and exactly what you’d expect. Easy peasy.
If you search on Bing Image Search, you also get the right answer:
But I wouldn’t write an SRS post about something so obvious.
What IS surprising is what happens when you ask your favorite AI / LLMs about this image.
Mostly, they get it very, very wrong.
Oddly, after getting it right with Bing Image Search, Microsoft's Copilot gives a terrible answer. (Once again, the left hand does not know what the right hand is doing.)
This is NOT the Denver Athletic Club. (It is also made of brick and has arches, but no domes or minaret-like towers.)
But if you use the visual description ability of ChatGPT, you get another very wrong answer (mostly because it’s trying to find buildings near my Palo Alto, California location–an assumption that seems really bad… especially since none of the buildings it suggests look anything like the image I asked about!
I thought that maybe I should give ChatGPT a hint, telling it that the building was in Denver. But that didn't work either. In fact, the answers got worse. The buildings it suggested aren't anywhere near the search target!
Maybe I'm just asking the wrong LLM?
Here's Claude's reply:
Claude gets the style correct, but not a proper identification.
I then asked Google Gemini what the answer might be. AGAIN: It too is really wrong:
I know the Advanced Medicine Center Building in Palo Alto--it looks nothing like this.
Well, I thought, that’s because I’m using Gemini 1.5 Pro. (As you know, there are multiple Gemini models to choose from...)
But when I switched to Gemini 2.0 Flash Experimental (the latest!), I got an even more wrong answer, even though it has a “might not work as expected” disclaimer. Indeed!
The Mosque of Ibn Tulun looks a little like the image with domes and towers, but the color, layout, and materials are all wrong.
HOWEVER, when I tried Gemini 2.0 Experimental Advanced, I finally got the correct answer.
Notice that it took 3 different tries with different Gemini models to get to the right answer. That’s not encouraging. We know that most people will simply accept the first result and not do any follow-on checking.
Since I took the photo I’ll tell you: this really IS the Sherman Street Event Center, aka the El Jebel Shrine, aka the Rocky Mountain Consistory, and as the Scottish Rite Temple is a historic building in the North Capitol Hill neighborhood of downtown Denver.
Here’s a great article about it: https://denverite.com/2024/05/09/el-jebel-shriner-mosque-photos/
The Moorish-inspired building was constructed in 1907, as a meeting hall for the El Jabel chapter of the Ancient Arabic Order of the Nobles of the Mystic Shrine (the Shriners). It was never a true mosque in the Islamic sense. In 1924, having outgrown the building, the Shriners sold it to the Scottish Rite Masons, who renamed it. In 1995, the Scottish Rite sold the building to Eulipions, Inc. who converted it into a catering and events facility, and it’s been bouncing around the Denver real estate market ever since. (Although it seems to have recently landed a permanent owner.)
This is a beautiful example of the Moorish Revival Architecture movement of the early 20th century in the US. This stylistic movement is a variation of Islamic architecture that was introduced in the 17th and 18th centuries in Spain. It's characterized by geometric shapes, arches, and decorative elements like arabesques and ceramic tiles.
Bottom line: The AIs are mostly wrong. Regular search-by-image is much better. (Oddly, Tineye.com found nothing, not even a near miss!)
But what’s worse is that they’re CONFIDENTLY wrong. There’s no hesitation, no questioning of the plausibility of the results. When Microsoft CoPilot says that this is the “Denver Athletic Club,” it doesn’t say that it “might be” the club, or that “this image looks a great deal like the Athletic Club, but I’m not 100% sure.”
Which is disappointing.
Search Research Lessons
1. Don't trust any image identification that the LLMs give you. We've talked about this before, but REALLY... if they can't identify a very visually distinct building, I wouldn't trust their results when foraging for mushrooms, berries, or edible plants! Perhaps one day they'll improve their accuracy... but for the time being, be sure to double-check everything!
As always,
Keep Searching!
Interesting. I missed this one - but I agree that photos can be wrong. I've often tried geo-locating something and got answers I knew were wrong. However I've also got the correct answer so the tip is to repeat and clarify and give what you know and often the AI will apologise and try again, narrowing down enough to get it.
ReplyDeleteI just tried the photo in Perplexity Pro and it got it straight way. My prompt was basic "What building is this" (without any clues at all).
This building is the El Jebel Shrine Building, located in Denver, Colorado. It is a historic structure built in 1907 in the Moorish Revival architectural style. Originally constructed as a meeting hall for the Shriners, a Masonic organization, it features distinctive red brickwork, arched windows, and minaret-like towers, which are characteristic of its exotic design. Today, it is recognized as one of Denver’s architectural landmarks.
I've not checked the rest but from what you wrote, it seems correct.
Interestingly Claude also got it right - at least for me.
"This appears to be the El Jebel Shrine Temple, a historic building in Denver, Colorado. It's a distinctive Moorish Revival-style building with characteristic features like the minaret-like tower, arched windows, and ornate architectural details. The building was constructed in the early 1900s and served as a meeting place for the El Jebel Shriners, a Masonic fraternal organization. The building's reddish-brown brick exterior and unique architectural style make it stand out among the more modern buildings visible in the background.
The El Jebel Shrine Temple is considered an important architectural landmark in Denver and represents a time when fraternal organizations played a significant role in American social life. The building showcases the exotic architectural influences that were popular in American architecture during the early 20th century."
Gemini 1.5 Flash also got it - but with less detail: "The building in the image is the Sherman Street Event Center in El Jebel, Colorado.
Originally built in 1906 as a school, it was later converted into a community center and event space."
Different building date and purpose (school, 1906) but it got the building. So a lesson is to check the supplementary details as at least one of these is wrong! (I didn't bother to check - I just wanted to see what came up).
Copilot also succeeded. BUT ChatGPT tried to find something local to me in the UK (and so failed about it did say it didn't know), Le Chat (Mistral) also didn't have a clue and neither did You.com. DeepSeek apparently couldn't read text in the image so is clearly not up to some of the others (yet).
So the lesson isn't just what you wrote. It seems as though location plays a part but perhaps because you searched, the AI tools updated their answers. OR they are inconsistent in their answers - which is another lesson in using AI (and I suspect this isn't just for image search - as when you do search in different locations with the same prompt you do get different, location dependent, answers. Similarly searches in other languages).
This is interesting... I'm surprised that you got correct answers with Claude. When I just tried redoing the search, I got the same answer I found earlier (that is, it didn't answer it correctly). I agree that Gemini 1.5 Flash gives the right answer, as does Gemini 2.0 Experimental Advanced. But as you know, there are 5 different Gemini models available. Are we supposed to use ALL of them to find the right answer?
ReplyDeleteAgree that the inconsistencies and inaccuracies do not speak well for the LLMs.
Also agree - it's really case of Caveat Emptor. However when your search is really fuzzy in that you've only got an idea and so can't come up with good keywords, AI is a game changer. So for example when you read something 18 months ago but can't recall where or precise details. E.g. "I read a news item about a new restaurant in New York that was founded by refugees from somewhere in South America that's been getting rave reviews". What keywords do you use if the article didn't mention reviews but said the restaurant was Argentinian, was founded by new immigrants (and not refugees) and had just won a Michelin star. Yet the AI is more likely to find it. (This is fictional - but gives an idea on what I mean by fuzzy).
DeleteI used AI for some of the puzzles in the Bellingcat Christmas Challenge and I found it really helped to narrow down options that I could then use more traditional approaches to get or confirm the answer.
surely (or Shirley) a conundrum… use, but don't believe -
ReplyDelete[is that a Reagan echo? doveryai, no proveryai,]
the process leaves me uffish, & in a tewolgow ('darkness, gloominess') state.
there may be multiple Jabberwocks about -
perhaps there will be one (or a clever LLM)
that will have a vorpal sword?
- but it seems the cat is of of the bag,
along with veracity. Such is the genie. (another AI moniker)
"In 1868 Carroll asked his publishers, Macmillan, "Have you any means, or can you find any, for printing a page or two in the next volume of Alice in reverse?" It may be that Carroll was wanting to print the whole poem in mirror writing. Macmillan responded that it would cost a great deal more to do, and this may have dissuaded him."
https://i.imgur.com/toj9Wtq.png
Alice said to send this…
https://en.wikipedia.org/wiki/The_Mouse%27s_Tale
https://en.wiktionary.org/wiki/vorpal
https://en.wikipedia.org/wiki/Jabberwocky
https://en.wikipedia.org/wiki/VSim
https://philosophynow.org/issues/116/Could_There_Be_A_Solution_To_The_Trolley_Problem
https://static1.cbrimages.com/wordpress/wp-content/uploads/2025/01/far_side-dec-6-1994.jpg?q=70&fit=crop&w=750&dpr=1
The cat as an LLM ---
https://en.wikipedia.org/wiki/Schr%C3%B6dinger%27s_cat
"The idea is that the cat's fate depends on the behavior of a subatomic particle, so it remains both alive and dead until the collapse of the wave function. The point, Schrödinger wrote, was that this “ridiculous” scenario should discourage us from “naively accepting as valid a 'blurred model'” of reality."