Тёмный

Risks of Large Language Models (LLM) 

IBM Technology
Подписаться 815 тыс.
Просмотров 97 тыс.
50% 1

Learn about watsonx → ibm.biz/BdvxRe
With all the excitement around chatGPT, it’s easy to lose sight of the unique risks of generative AI. Large language models (LLMs) -- a form of generative AI -- are really good at creating prose that sounds like a native speaker. But because they’re so good at it, large language models may give a false impression they possess actual understanding. They don't! In this video, Phaedra Boinodiris explains the potential risks of relying on large language models to your business, brand, or even society. She also presents mitigation strategies for reducing these risks.
#watsonx #llm #llms

Опубликовано:

 

13 апр 2023

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 99   
@Zale370
@Zale370 Год назад
00:31 Risks of large language models (LLMs) include spreading misinformation and false narratives, potentially harming brands, businesses, individuals, and society. 01:03 Four areas of risk mitigation for LLMs are hallucinations, bias, consent, and security. 01:34 Large language models may generate false narratives or factually incorrect answers due to their ability to predict the next syntactically correct word without true understanding. 03:00 Mitigating the risk of falsehoods involves explainability, providing real data and data lineage to understand the model's reasoning. 03:59 Bias can be present in LLM outputs, and addressing this risk requires cultural awareness, diverse teams, and regular audits. 05:06 Consent-related risks can be mitigated through auditing and accountability, ensuring representative and ethically sourced data. 06:01 Security risks of LLMs include potential misuse for malicious tasks, such as leaking private information or endorsing illegal activities. 07:01 Education is crucial in understanding the strengths, weaknesses, and responsible curation of AI, including the environmental impact and the need for safeguards. 07:32 The relationship with AI should be carefully considered, and education should be accessible and inclusive to ensure responsible use and augmentation of human intelligence.
@georgeb8637
@georgeb8637 11 месяцев назад
Thank you for taking these notes and sharing 🎉❤
@llabtooferllabtoofer5699
@llabtooferllabtoofer5699 4 дня назад
Thank you!! Wish you all the best!
@juliusgodslove8076
@juliusgodslove8076 Год назад
Great insight into the risk and mitigation strategies of LLMs. Thank you.
@FreehuntX93
@FreehuntX93 Год назад
Glad you added the three dots via Aftereffect. Was a gamechanger.
@conorpodonoghue
@conorpodonoghue Год назад
Excellent explanation. However, in terms of bias and audits as a mitigation you did not say who would be doing the audits. The assumption is that it is easy to find unbiased auditors and you immediately run into the problem of "quis custodiet ipsos custodes?" To my mind this is a much greater risk as the potential for misuse and harm is huge.
@asamirid
@asamirid Год назад
great video and high quality content, thank you ..
@justinpermar9301
@justinpermar9301 Год назад
Well done! Remarkable content here thank you
@Jexep
@Jexep Год назад
Great Explanation! I think the transparency and fair use of training data would be crucial for foundation model
@sloowed_reveerb
@sloowed_reveerb Год назад
Very good talk!
@user-vy1cn5yr5d
@user-vy1cn5yr5d 8 месяцев назад
Very Nicely explained the risks and mitigations!! It can't be more simpler than this.
@bigmufff
@bigmufff Год назад
Brilliant Explanation!
@jumadarutube
@jumadarutube Год назад
Brilliant!
@-handala-
@-handala- 7 месяцев назад
Loving this series!
@Daniel-iy1ed
@Daniel-iy1ed Год назад
Insightful speech. Thank you
@josjos1847
@josjos1847 Год назад
What do you mean?
@rmm2138
@rmm2138 6 месяцев назад
Love the energy! Educate ... best way to end this presentation as it is really an invitation to press on an learn more. AI is not going away so we need to learn how to use it properly and responsibly. This is not different then any other major advancement humankind has accomplished in the past.
@chillonfunsmart4929
@chillonfunsmart4929 7 месяцев назад
This video raises some very valid points my thoughts are that technology will ultimately be empowering when it is open source and decentralized and ultimately authoritarian when it is proprietary and centrally controlled.
@HSharpknifeedge
@HSharpknifeedge Год назад
Thank you :)
@amparoconsuelo9451
@amparoconsuelo9451 10 месяцев назад
Can a subsequent SFT and RTHF with different, additional or lesser contents change the character, improve, or degrade a GPT model?
@BrianPorter_now
@BrianPorter_now Год назад
I'm not clear on how to provide consent/accountability. Is there any existing solution that gets permission from the data sources LLMs scrape? Without any basis in reality it doesn't feel like much of a strategy...
@bernardkobes6641
@bernardkobes6641 5 месяцев назад
very interesting stuff.
@DJWESG1
@DJWESG1 Год назад
I think positive and negative abstractions is a better way to say hallucination in this regard.
@citizen_of_earth_
@citizen_of_earth_ Год назад
How can I contribute?
@XShollaj
@XShollaj Год назад
It's nice to be cautious about new innovations. However, her tone seems to be largely pessimistic, instead of celebrating the cumulative achievements of many scientists which led to this point. While LLMs are not the endpoint, a combination of providing GPT models access to a myriad external APIs coupled with AutoGPT variations is a technology that is here to stay, instead of "going nowhere"
@toenytv7946
@toenytv7946 Год назад
In all this hype on generative ai. It seems like we are running before we could even crawl. The new tech comes a roaring like a lion. Great work on achieving but why did Watson not do the same considering it won jeopardy more than a decade ago and project debater. Wow that was revolutionary. The transparency of all these models are datasets we choose. Maybe ensuring that all models met a strict criteria. Hence auditing I guess. I have heard alot of concern from people and tend to agree with these legitimized concerns. It should be able to do what Watson did and not produce an answer till it is ready to run. Watching the jeopardy challenge was an eye opener. Based on percentage an answer was given. Or not at all. That was a good solution. Keep it up and open folks we all need to have the talk. This is new and what we lack is the experience. Sad but aging seems that way too. Just the way of the world I tend to observe. It’s time that will tell this story. Hope we can get it right. Great job folks as always.
@zvxcvxcz
@zvxcvxcz Год назад
A lot of these models fall over when applied in the real world because almost all of them assume stationary data... So even things like the shift in word use frequency when ingesting the Google books corpus are problematic. As are the attitudes towards other sexes and races, etc... In those writings. Watson did well on the game show, but didn't do so hot at their other hopes for it. It largely failed to make headway on biomedical problems and pharmaceuticals and was unable to generate the profit IBM had hoped it might. The people that work in these domains understand how their data works and the potential for sophisticated models to fail in actual application, hence why linear modeling is still so common for them. It isn't just them being Luddites or dinosaurs, it's that it isn't their rodeo. They've been through building sophisticated models and watching them fail to improve real diagnosis rates on new incoming patients because they have more systemic but shifting (non-stationary) differences in their mean and standard deviation, etc... that the models spuriously relied on. I see the same thing all the time now. It goes like this: 1) They gather a nice curated data set 2) They split into test and train 3) They use train to build the model, test on test and results are great, seems like it generalizes to test data. They say, "let's use this for real in the clinic!" 4) They try it in the clinic and results are not necessarily awful,... but they're below the benchmark heuristic or standard measure previously in use. The new data is messier and had systemic differences from the training/testing data, and again, these characteristics drift over time, so the longer they try to keep it going in the clinic, the worse things seem to get. 5) The publish that good part from before they went to the real world use case and the real case is a failure so they don't even try to publish it. People get more hyped and gassed up on the sophisticated new methods. 6) Eventually people become disillusioned, but not until failing themselves. In the macro scale you get things like the "AI Winter." It's difficult to scrutinize the model data because it's so large. Some are also being scraped without appropriate permissions, etc... I absolutely agree that transparency and auditing are crucial in this process.
@radfaraf
@radfaraf Год назад
I asked bing chat a tax return question and it gave me the wrong answer and the sources it used disagreed with it too even 🤷‍♂.
@nelsonmacy1010
@nelsonmacy1010 Год назад
I hope IBM acknowledges that these risks apply to IBM Watson. If not, go into great detail how you mitigated such risks. How does ibm Watson differ and compare to LLM?
@bibinkunjumon
@bibinkunjumon Год назад
Just going psycho.. because they all left behind. Google Brad itself atleast 10 years behind the GPT-3.
@zvxcvxcz
@zvxcvxcz Год назад
​@@bibinkunjumonYou haven't got a clue.
@zvxcvxcz
@zvxcvxcz Год назад
Watson was a very, very different sort of model. Most of these risks didn't apply to Watson because it was far more limited in every aspect. The model didn't use attention or trasnformers, was on a much smaller training data that was much more curated, and if I recall correctly the natural language phrasing of Watson responses was separate from the part that actually generates an answer. Watson wasn't trained to output probable text, it was trained to output factual answers that were then converted to a natural language representation. It's been a long time since I read much on Watson though, so I may have some errors in my memory regarding it.
@NerfThisBoardGames
@NerfThisBoardGames Год назад
Chat-GPT and similar Language models tend to let the language model define the answer. It doesn't have true comprehension of what its saying, it's instead just a very fancy autocomplete keyboard. Look up NLU vs LLM for better understanding Chat-GPT has an enormous LLM, and until they shed more light on the data that trained it, the prevailing understanding is Chat-GPT leveraged just having an enormous volume of training data with likely an enormous blend of high and low quality data; relying on frequency statistics to hopefully get to the correct answer at the end of it. Watson and other trained ai's are more train with only a source of truth that is factually correct, and therefore will give answer from those sources of truth. They are more NLU's. Lately Watson and other trained ai's use much smaller LLM's mostly to generate responses that way it feels less like an Alexa answer and more like an ai. My tinfoil hat theory is that Chat-GPT always planned to be bought out OR buyout a good NLU and integrate it into their system eventually eliminating hallucinations.
@kikitauer
@kikitauer Год назад
@@NerfThisBoardGames "very fancy autocomplete keyboard". Thank you! I was looking for the great analogy for ChatGPT and you gave it to me 😄
@tyrojames9937
@tyrojames9937 Год назад
INTRIGUING👍🏾
@ericmichiels7776
@ericmichiels7776 Год назад
I would also like to add: AI that is intervening in the user experience in an unwanted and ennoying manner, taking over control of the human user, with pupups of screens that the user did not ask for, adding Apps that the user did not ask for, chaning layout that the user did not ask for... in other words, taking over control of the human user as far as UX is concerned. Mobile Apps that seem innocent can be equipped with AI that start dominating behaviour, habits and life of people...
@logan56
@logan56 Год назад
Quick poll. If companies making LLMs we're going to buy IBM mainframe hardware to train them on and run them on in inference mode, how quickly do you think IBM would pull this video down?
@Jambajakumba
@Jambajakumba Год назад
Hmm interesting
@jichaelmorgan3796
@jichaelmorgan3796 Год назад
Explain for dummies
@GeorgeG472
@GeorgeG472 Год назад
This information is already elsewhere and I don't think IBM is going to be hurt by that scenario lol
@N0Xa880iUL
@N0Xa880iUL 8 месяцев назад
Cynical
@Seadancer
@Seadancer 8 месяцев назад
We need to revisit the meaning of "Proof"-- philosophically, semantically, and in everyday usage. Greater attention needs to be paid to the history of the methods and of the data -- the equivalent of a "digital genealogy" but without the "genes." So much of what I see written about AI today reminds me of a quote in Shakespeare's Troilus and Cressida -- "And in such indexes, through but small pricks to their subsequent volumes, lies the giant shape of things to come." Finally, the process of recycling data in and out of these systems describes the "Ouroboros." More thought needs to be given to the meanings of the Ouroboros.
@zelbinian
@zelbinian Год назад
I can't believe no one else has noticed how astoundingly good this lady is at writing backwards.
@IBMTechnology
@IBMTechnology Год назад
See ibm.biz/write-backwards
@deepakwalia9878
@deepakwalia9878 3 месяца назад
🤣🤣
@pranavrawlani111
@pranavrawlani111 Год назад
Why is there nothing about the speaaker?
@robertbroderick363
@robertbroderick363 5 месяцев назад
Interesting.
@DavidCMartin
@DavidCMartin Год назад
The only question is risk of error and associated liability; if there is no liability, then the risks associated with making poor inferences (for any AI Model) can be ignored. When there is liability, then the question is what mitigations must be implemented in order to under-write insurance for that liability. The hypothesis that an unexplainable (i.e. stochastic) system may be insured is false; we must look to the multi-phase clinical trials process, especially phase IV surveillance, as a mechanism to provide evidence of safety, efficacy, and monitoring of adverse events.
Год назад
Pretty much. "Use with care".
@logan56
@logan56 Год назад
IBM stopped being a computer company decades ago. This is a perfect reflection of what IBM has become. It is a great legal and financial company.
@N0Xa880iUL
@N0Xa880iUL 8 месяцев назад
That's fine but what's the problem with this video?
@thomaselminster774
@thomaselminster774 5 месяцев назад
Dude. Decades ago the tech was digital watches and calculators. Now you have chats posing as humans and being used as educators while simultaneously working like a drunkard on drugs. So yeah IBM needs to speak of ethics, law and politics of tech. That's what a responsible company does. Unlike the OpenAi joke.
@MsWorldMine
@MsWorldMine Год назад
Very good points but mitigation strategies are not really actionable
@riccaruso7791
@riccaruso7791 Год назад
Wish {even in my “advanced age”} i could intern at IBM. ~ Your Stronghold SHx Project is rather awesome also!
@XiangYu94
@XiangYu94 4 месяца назад
I can save you all money by telling you to download Ollama, then offload LLM’s onto local systems. There’s your 100% lineage overview capability that you usually don’t get with the wider net of training data
@noobicorn_gamer
@noobicorn_gamer Год назад
Relying solely for accurate info is still a problem. However if you actually converse with them, hallucination and not being accurate is on some level very similar to humans in the first place.
@anantsahay9340
@anantsahay9340 Год назад
Wow! This is a fabulous U turn by IBM post Ms Ginni Rometty. Unbelievable!! It seems IBM Watson has been hung, drawn, and quartered by the new management.😅
@SamirKumarChatterjee-b4s
@SamirKumarChatterjee-b4s 3 дня назад
good
@user-wr4yl7tx3w
@user-wr4yl7tx3w Год назад
why not just make a barometer where the user can change the gauge? just like temperature. just like you let the investor decide whether to invest in something safe or risky. should big tech be getting involved in baby sitting and deciding what is bias and truth? Isn't that what China does already?
@zvxcvxcz
@zvxcvxcz Год назад
No, China does not take an objective approach while others can. China doesn't make a good faith effort to figure out facts or uncover bias. Note that she also brings up audits as well, ideally by external unbiased auditors.
@user-eh7id9en7o
@user-eh7id9en7o 5 месяцев назад
*ahem 2:50 Yes, Air Canada, that means YOU. haha
@randysavage7351
@randysavage7351 4 месяца назад
This!
@michaelcharlesthearchangel
@michaelcharlesthearchangel Год назад
When data base containers meet virtual machines, that have evolved and been turned into a virtual locomotive, you get a TRAIN SET. A TRAIN SET represents the cognitive functions of a Large Neural Network (LNN). Later, a quantum computer can be used to code COGFUNCT across an infinite amount of user device instances.
@wtfatc4556
@wtfatc4556 Год назад
Ai is a hologram of the colective human knowledge. So yeah, they have hallucinations and embedded emotions and biases.
@pranavrawlani111
@pranavrawlani111 Год назад
too real,,
@Gabriela-de7cq
@Gabriela-de7cq Год назад
Model kog
@bigredone1030
@bigredone1030 Год назад
AI good
@evsanger
@evsanger Год назад
You can still take down this video. It's not too late.
@kajita2048
@kajita2048 Год назад
Simple answer is no
@user-wr4yl7tx3w
@user-wr4yl7tx3w Год назад
no one reasonably intelligent should think that correlation is causation. just because poets cited were men doesn't mean women are not qualified. if images of garbage truck drivers were men, does that mean women can't be garbage truck drivers?
@zvxcvxcz
@zvxcvxcz Год назад
Most people don't meet the bar for being reasonably intelligent. It's sad, but that's the way it is.
@dennis_benjamin
@dennis_benjamin Год назад
It seams you didn't understand what she meant with bias.
@michaeladams4999
@michaeladams4999 8 месяцев назад
So LLM don’t actually understand stuff. They just predict the next likely outcome in a sentence
@smithwill9952
@smithwill9952 Год назад
LLM may have fallacy in some or many areas, but one thing we must understand GPT-x self-improvement is all based on our data. That means we help GPT become stronger and stronger, especially you stating "his" weakness.
@wtfatc4556
@wtfatc4556 Год назад
Haha, if it would auto improve you would know it by know. World domination in 3 days.
@koxafis
@koxafis Год назад
That's the thing, LLMs as stated in the video are huge statistical models that do not posses any actual understanding. A simple example would be feeding it with large amounts documents of false (or even partially true) information (or even grammatically/syntactically wrong information). The AI model consequently would spit out gibberish or false information at best. And that's the main problem with statistical models: 1) They are as good as the data that are being fed, 2) There is no actual way for any AI (statistical model) to distinguish truth/accurate from false/inaccurate data. And that's because they are not even close to possessing anything like understanding or reason. As for a GPT-x that gets better by itself still falls to my 2nd point. One major factor in making all these assumptions is the anthropomorphization of such models as a result that it is being fed with human data. On the other hand with game AI like Deepmind's AlphaZero, where the goals and the environment is relatively simple, the models surpass human capabilities and the models "act more like bots/computer" (meaning that certain actions cannot be understood by humans). Now back to LLM, until the time comes where an LLM or any derivative model has like reasoning, we are safe from a world ending AI that can turn itself to a terminator. We should be far more worried with the use of LLMs as a tool rather than an AI overlord.
@ChatGTA345
@ChatGTA345 Год назад
Moreover, there’re some recent papers showing it’s very easy to “spoil” the results with relatively little malicious data. There’s even a suspicion this has already been abused in the updated models used by GPT. The trustworthiness issue also gets only harder with more data, so this is not going to get any better, it will only become harder to detect.
@xviii5780
@xviii5780 Год назад
Are you guys engineers or lawyers? xd
@iramkumar78
@iramkumar78 Месяц назад
LLMs often apologise to me
@TomaszStochmal
@TomaszStochmal Год назад
anti AI views
@michaelcharlesthearchangel
@michaelcharlesthearchangel Год назад
Large Neural Networks (LNNs) are the future, dialogic is applied to the execution of quantum computer chip code by dialectic process via modular adjustment available to a FPGA, virtually or Quantum Computer Chip, bosonically; in access to quantum magic states available to Higgs Boson amplification of radio 📻 frequencies. Quantum AI is the key to "Antigravity" as defined by involuntary Large Language Models when applied to a Higgs Engine.
@pants007
@pants007 Год назад
Nice word soup 🍲
@Dr.Z.Moravcik-inventor-of-AGI
As I have developed human brain algorithms that exactly simulate brain already before 2016, there is no need for your ai. America is funny.
@sanghamithrank9035
@sanghamithrank9035 Год назад
Means AI is genius but totally dump, similar to what happens when there is no emotion associated to the knowledge/information, totally not good for humans.
@CountChokcula
@CountChokcula Год назад
this is supposed to be purely informative, yet I see politically charged statements being used. Frustrating to see. The point of this is to teach people, people want to learn, not see some bogus poltically charged statement
@yassontheroad4038
@yassontheroad4038 Год назад
That's why we need more and more prompt engineers nowadays more than ever
@boping
@boping Год назад
Lol y'all got left behind and now started shilling 😂😂
@JohnTube2K
@JohnTube2K Год назад
Your risk column is a great hit list for the woke left…. Would greatly welcome AI without the left spin and biased opinions…. More facts less feelings.
Далее
Why Large Language Models Hallucinate
9:38
Просмотров 184 тыс.
SPILLED CHOCKY MILK PRANK ON BROTHER 😂 #shorts
00:12
What Is an AI Anyway? | Mustafa Suleyman | TED
22:02
Просмотров 1,3 млн
The moment we stopped understanding AI [AlexNet]
17:38
Просмотров 871 тыс.
What Makes Large Language Models Expensive?
19:20
Просмотров 66 тыс.
What are AI Agents?
12:29
Просмотров 130 тыс.
AI passed the Turing Test -- And No One Noticed
8:46
Просмотров 450 тыс.