LLMs have a big problem:
P/C Dall-E. Prompt: "happy robots talking about truth" |
They have zero idea what's true. They only know the probabilities of words in text. That's NOT the same thing as "knowing" something--it's a bit like knowing that "lion" is the most likely word following "king of the jungle..." without having any idea about monarchies, metaphor, or what a king really is all about.
The folks at Oxford Semantic Technologies wrote an interesting blog post about LLMs and finding verifiable facts. They call the fundamental problem the "Snow White Problem." The key idea is that LLMs don't really know what's true--they just know what's likely.
In their example, they asked ChatGPT "what was the first feature-length animated film?" ChatGPT confidently said that it was the 1937 film "Snow White."
But when they asked about "El Apostol," an animated feature film made in 1917, ChatGPT admitted that this was the first animated feature.
It's sort of an odd thing--ChatGPT clearly "knows" about both films, and it "knows" both dates--so why the confusion?
Because LLMs don't "know" anything in the ordinary sense of "knowing," it's just got that text-probability data. That is, inconsistent knowledge just isn't a thing for LLMs. Having coherence within a conversation doesn't really matter much. If you ask ChatGPT:
Dan: What are the next most likely 5 words that follow the phrase "let me not to the marriage of true minds"
ChatGPT4: The next most likely 5 words that follow the phrase "let me not to the marriage of true minds" are: Admit impediments.
Well, that's just plain wrong. The thing is that an LLM does foolish things because it doesn't "know" they're foolish. I asked for 5 words, not 2.
On the other hand, Bard did much better with:
Bard: The next most likely five words that follow the phrase "Let me not to the marriage of true minds" are "Admit impediments. Love is not," which is from William Shakespeare's Sonnet 116. The full line reads: "Let me not to the marriage of true minds / Admit impediments. Love is not love."
Dan: What was the first animated short?Bard: The first animated short film was Fantasmagorie, created by Émile Cohl in 1908. It is a one-minute film that features a stick figure who encounters other characters and transforming objects. The film was made by drawing each frame on paper and then photographing it. Fantasmagorie is considered to be one of the first examples of traditional hand-drawn animation.
Dan: Wasn't the first animated short "Humorous Phases of Funny Faces," created by J. Stuart Blackton in 1906?Bard: Yes, you are correct. Humorous Phases of Funny Faces, created by J. Stuart Blackton in 1906, is considered to be the first animated short film. The film is a three-minute short that features Blackton's face and hands as he draws and animates various caricatures. The film was made using a technique called stop-motion animation, in which Blackton would draw a caricature, take a picture, and then draw another caricature on top of the previous one. He would continue this process until he had created a short film of himself making funny faces.
I apologize for the previous mistake. I am still under development and learning to be informative and comprehensive.
Dan: And didn't J. Stuart Blackton produce an animated short in 1900? That would be even earlier, yes?Bard: That's correct. J. Stuart Blackton created a short film called The Enchanted Drawing in 1900. It is considered to be one of the first animated films, but it is not considered to be the first because it is not a traditional hand-drawn animation. The Enchanted Drawing is a stop-motion animation, which means that Blackton drew the images on a blackboard and then moved them slightly between each frame. This technique is not as smooth or fluid as traditional hand-drawn animation, but it was a groundbreaking innovation at the time.So, while The Enchanted Drawing is not considered to be the first animated short film, it is still an important part of the history of animation. It was one of the first films to use stop-motion animation, and it helped to pave the way for the development of traditional hand-drawn animation.
Good morning,
ReplyDeleteI may have been groggy (another SearchReSearch challenge) when I read this right after waking up, but my brain kept asking me if this was similar to the waterfall posts back in March. https://searchresearch1.blogspot.com/2023/03/answer-worlds-largest-waterfall.html
So to Bard I began with:
Fred: Where is the largest waterfall?
Bard: (a couple of paragraphs about the Denmark Straits Cataracts)
Fred: List the three largest waterfalls by volume of water.
Bard: 1. Inga Falls 2. Livingstone Falls 3. Wagenia Falls
(Bard did include Country and cubic feet per second plus more on the Congo River basin.)
Fred: What is the volume of water per second of the Denmark Strait Cataracts?
Bard: (a few paragraphs basically to say 175 million cubic feet per second.)
Fred: Why didn't you include this in your previous answer?
Bard: (apologizes for still being under development)
Fred: List the three largest waterfalls by volume of water.
Bard: Sure, here are the three largest waterfalls by volume of water:
1. Denmark Straits Cataracts...
2. Inga Falls...
3. Livingstone Falls...
Please note that the Denmark Strait Cataracts are underwater, so they are not visible from the surface of the ocean.
Bard corrected the response. What I now wonder: will Bard keep that correction if I ask later today? Will Bard give the same response to someone else?
… seven little guys just showed up at my door and offered to break my legs - one was really Grumpy - and they said they could bring 7 Samurai, if necessary… I didn't even know Taylor Swift was sleepy - or really Snow Pink…
ReplyDeleteQuirino Cristiani
https://vimeo.com/158798301
https://www.facebook.com/people/Quirino-Cristiani-The-Mystery-of-the-First-Animated-Movies/100035392556374/
Bono [sic] is but a mid-summers night dream… a cubicle state of mind
At one point in its recurring soliloquies about its potential upsides, Bard dreamed about living up to the legacy of the English playwright that inspired its name. Bard explained that its creators at Google “thought Shakespeare would be a good role model for me, as he was a master of language and communication.”
Black Maria studio, NJ - Edison - 1900
https://youtu.be/pe7HSnZotbU
Knowledge Graph Conference 2023 in NYC on May 8-12
https://www.ontotext.com/knowledgehub/fundamentals/what-is-the-semantic-web/
from your prompt - long, but interesting
https://www.youtube.com/watch?v=z5WZhCBRDpU
Sergeant Phil Esterhaus's (Michael Conrad) with the belly dancer is almost un-careful, but well worth the 2+ minutes
Interesting posts by you, Remmij, Fred and Dr. Russell.
DeleteI think that questions like Dr Russell thinks when creating a Challenge are and will be hard to understand to LLMS. And also thinking that how many people will verify the answers they are getting? I think not so many.
Also the question Fred ask about how they will answer same question made by others or later. They should learn and improve. Have any of you te tried asking?
I also was trying AI text to image. Sometimes they are great, others not so much. One example was with Snoopy. Why he is deformed? I guess is due to copyright but then it is really not Snoopy, right?
The Wolfram Alpha + LLM sounds amazing.
a couple other bits - I need to be more careful out there - but it is already too late.
ReplyDeletewhat type?
https://www.wired.com/story/ai-isnt-going-to-reinvent-the-alphabet-anytime-soon/
further DALL-E, looking back:
https://imgur.com/a/wUHXpOH
jfg [just for grins/giggles] - the robots made me do it (said in a whispering voice)
ReplyDeletehttps://imgur.com/a/u4VtzXQ
meant to include this:
ReplyDeletesupposedly Miller has made over 100,000 images as he experimented, edited those down to 20k and then pcked 20 for this show at Gagosian, NYC - that would be immersive.
https://gagosian.com/exhibitions/2023/bennett-miller/
That's an interesting observation about the nature of postmodern image art.
Deleteif the barristers are spinning up, things are getting serious… already on an AI list somewhere…
ReplyDelete592
870
degree in electrical engineering
Nita Farahany
SLAPLAB
Shakespeare, now BARD - was Will AI?
wiki
Act IV, Scene II
Henk van Ess
ReplyDeleteYou made some good points in this post about large language models / AI chatbots and how they have no idea whether or not what they are writing is true. And, the series you have on them is quite informative, so thanks.
ReplyDeleteBuilding on what you wrote in this post, a question I would ask is this: Even if these LLMs eventually get some sort of fact-checking software added (like you suggested), and even if they eventually become fairly trustworthy as a result, would users also receive a list of sources that the chatbot used to give its answers, or not? And, if not, how would they know what its claims were based on (or if they were even trustworthy)?
This comment has been removed by the author.
ReplyDeleteyou prompted me to look for… some belief - (this could have been generated by any generic chat-bot, y = mx + b)
Deletehttps://en.wikipedia.org/wiki/Hallucination_(artificial_intelligence)
https://www.sify.com/ai-analytics/the-hilarious-and-horrifying-hallucinations-of-ai/
https://www.beren.io/2023-03-19-LLMs-confabulate-not-hallucinate/
https://simple.wikipedia.org/wiki/Absolute_truth
https://simplicable.com/philosophy/empiricism
https://en.wikipedia.org/wiki/Terminal_lucidity
linear
https://www.youtube.com/watch?v=XzBB2VPQ5Os
https://youtu.be/Ix3pK-AU1tg
I think uncertainty is a human marker… maybe?
I know this is dirty though - I need to cleanse my eyes
https://youtu.be/Y4VflUOUTsY
the certainty of time:
https://youtu.be/2Skae4wjDWo
indicative of the increasing speed & diverse applications coming.
ReplyDeleteshows different image generators and text to video samples.
https://www.youtube.com/watch?v=gQfQiXP9yZA
a pizza ordering bot -
and the powerful -
https://imgur.com/a/tHeo6lm
guardrails
ReplyDeletenvidia podcasts
speaking of waterfalls
ReplyDeletehttps://www.getty.edu/art/collection/object/104D03
if only AI had access to these robots… but they were busy in Hell. where are they kept now? & did they leave emojis?
ReplyDeletehttps://www.terranoa.com/en/one-off/fukushima-robots-in-hell-2093
smashing Pandora's box(keyboard)
ReplyDeletehttps://tinyurl.com/ycy43jec
“This is not a new toy. This is not a fad. This is not VCs looking for attention and founders trying to create hype. This is a society-changing, species-changing event. I’m excited by this technology but the downsides are just so immense. We’ve unleashed forces that we don’t understand." Chon Tang, SkyDeck, UC Berkeley’s startup accelerator.
could AI do worse? [meatsacks moving statues]
https://tinyurl.com/mtrz42bs
https://en.wikipedia.org/wiki/Thomas_Fallon
no message from beyond… but nice setting.
https://en.wikipedia.org/wiki/Monolith_(Space_Odyssey)#/media/File:ENS_2001_Monolith_LILA_night.jpg
if things go sideways…
ReplyDeleteinteresting that Bob came to mind again…
ReplyDeletenew tome
worth the re-read
writings
Coatlicue
The Colloquy of Coatlicue and Chronos
or (“You don’t have to have humans to be a humanmachine.” — BardBot)
“You don’t have to have cows to be a cowboy.”
—Nudie
Coatlicue: You have no future.
Chronos: And you have no past.
Coatlicue: That doesn’t leave us much of a present.
Chronos: Maybe we are doomed to being merely some “light-years” with missing tenses.
Coatlicue: Or two inefficient memories.
Chronos: So this is Palenque.
Coatlicue: Yes; as soon as it was named it ceased to exist.
Chronos: Do you think those overturned rocks exist?
Coatlicue: They exist in the same way that undiscovered moons orbiting an unknown planet exist.
Chronos: How can we talk about what exists, when we hardly exist ourselves?
Coatlicue: You don’t have to have existence to exist.
a little side detour… 📍
ReplyDeletecan only absorb so many AI images
see Johanna van Gogh-Bonger
letters
tres robots brujas
if things go sideways…
https://futurism.com/blake-lemoine-google-interview
ReplyDeleteHappy May Day…
ReplyDeletehttps://www.nytimes.com/2023/05/01/technology/ai-google-chatbot-engineer-quits-hinton.html
https://www.technologyreview.com/2023/05/01/1072478/deep-learning-pioneer-geoffrey-hinton-quits-google/?truid=&utm_source=the_algorithm&utm_medium=email&utm_campaign=the_algorithm.unpaid.engagement&utm_content=05-01-2023&mc_cid=dc465b7dad&mc_eid=a2abac3a08
https://aaai.org/working-together-on-our-future-with-ai/
https://twitter.com/geoffreyhinton/status/1635739459764322330?cxt=HHwWtIC8xeiXqLMtAAAA
ever cross paths?
ReplyDeletehttps://research.google/people/jeff/
https://homes.cs.washington.edu/~chambers/bio.html
deity help us if AI gains control of TMZ - e.g.,
ReplyDeletehttps://twitter.com/timnitGebru/status/1334352694664957952
https://www.vox.com/recode/2020/12/4/22153786/google-timnit-gebru-ethical-ai-jeff-dean-controversy-fired
https://twitter.com/JeffDean/status/1334953632719011840
https://en.wikipedia.org/wiki/Timnit_Gebru
https://en.wikipedia.org/wiki/Fei-Fei_Li
you may know someone interested in this…
ReplyDeletekept diving, I mean searching
ReplyDeleteThis must be some sort of diving superlative.
Deletehttps://www.alexinwanderland.com/scuba-diving-in-israels-dead-sea/
https://www.sportdiver.com/diving-in-lifeless-sea#page-3
https://www.youtube.com/watch?v=MHLiobkTUd0
Source: https://www.newsguardtech.com/special-reports/newsbots-ai-generated-news-websites-proliferating/
ReplyDeleteSummary: A News Guard analysis has found that there are a number of content farms that are using large language models to generate new content (as well as clickbait), many of which have a lot of ads. The ones that News Guard investigated have generic and benign names, plus articles they use that were generated by AI are often summaries or rewritten versions of content from other sources, though they often include hints that humans did not write them. Also, though many had factual information, some had misinformation.
On a personal note, I guess that this is further proof of what Dan has been warning about concerning LLMs, as well as the importance of being able to evaluate websites and their content.
True, true, true.
Deletehmmmm, an imaginary assault on time using AI…
ReplyDeletehttps://www.instagram.com/stelfiett/
https://www.vox.com/videos/2023/5/2/23708076/ai-artist-stelfie-process-workflow
collecting -
ReplyDeletepotentially on the upside…
AI, DeepMind - Alpha Fold - shape sifter dataset
DeepMind blog